Award
 » Home » Bioinformatics » Software Developers » Transcription Factor motif (PERL)
 
Solutions Search! The Customized Life Science Search Engine
Search Site
Search Suppliers
Search Internet
Search over 6000 life science websites specifically selected by our expert scientist moderators.

Other Topics
8/24/2008 12:16 PM
How to start? (PDB Viewer ...
7/9/2008 07:31 PM
What is your programming ...
11/15/2007 03:40 PM
Sequence Search Solution
9/3/2007 08:27 PM
Bioinformatic Software De ...
9/3/2007 08:44 PM
Database Poll
3/26/2007 08:47 PM
Scibuntu (ubuntu linux di ...
3/14/2007 03:28 PM
Transcription Factor moti ...
2/14/2007 06:23 AM
Bioinformaticians looking ...
8/9/2006 09:39 PM
extremely fast Smith-wate ...
4/20/2006 02:38 PM
Database Poll, What DBMS ...
4/19/2006 01:19 PM
Inference engine for comp ...
2/22/2006 10:56 PM
about the coupling of GPC ...
12/7/2005 01:37 PM
Software Development
6/21/2005 03:27 PM
Error in using SeqIOTools
6/14/2005 04:13 PM
PERL
1/13/2005 10:32 PM
HT-GO-FAT
Subscribet to topic
bottom of page RSS Feed Topic Feed
 Transcription Factor motif (PERL) [View Printable]
ABC

Frog Egg

[ Privacy ]
See
Similar
Scientists





Group: Member
Posts: 11
Joined: Mar 13, 2007







 Send a personal messsage to ABC Reply with a quote from this post Go to the top of the page

I want to use perl to find transcription factor DNA binding sites. For instance CACTTGAN. I only have basic perl writing comprehension but can follow a script fairly well. Thanks.

ABC
.........................
[ Privacy ]

 Posted Mar 14, 2007, 0:45 AM
jonatmudd

Frog Laureate

See
Similar
Scientists





Group: Member
Posts: 28
Joined: Oct 07, 2005







 Send a personal messsage to jonatmudd Reply with a quote from this post Go to the top of the page

Hi ABC,
It's not clear to me what you want to do--do you simply want to detect whether a particular sequence appears?
Or do you want the position of where it occurs? Or.....?

For the simplest case, to find a sequence of say CACTTGAN, all you have to do is the following for simple string matching

$All_Base_Pairs =~ /CACTTGAN/

The regular expression above is case sensitive, so if your file containing all the base pairs is in small case, make sure you try to match small case, not upper case.

$All_Base_Pairs = "ACTTTAGGGCACTTGANACCTATACCTATGG";

(I just made up some sequence including the one you are looking for.)

You made need to do some simple file manipulation if you have the sequences stored in files and don't want to tediously cut and paste them in.

For now, the above will return true if the binding site you are looking for is present.

Hope that gets you started...
.........................

Posted Mar 16, 2007, 16:00 PM
ABC

Frog Egg

[ Privacy ]
See
Similar
Scientists





Group: Member
Posts: 11
Joined: Mar 13, 2007







 Send a personal messsage to ABC Reply with a quote from this post Go to the top of the page

I'd like the find the sequence and it's position. My infile is just bare capital letter bases no fasta format or other characters.
.........................
[ Privacy ]

Posted Mar 16, 2007, 20:22 PM
jonatmudd

Frog Laureate

See
Similar
Scientists





Group: Member
Posts: 28
Joined: Oct 07, 2005







 Send a personal messsage to jonatmudd Reply with a quote from this post Go to the top of the page

Since you want to keep track of position, you'll need to do a progressive match. Try something like:

my $sequence = INFILE #where INFILE is however you choose to input the sequence within which you are searching

my $pattern = \"[CG]CACTTGA[ATCG]\"
while ($sequence =~ $pattern/gi) {
printf "Found a sequence at %d\n", pos($sequence)-length($pattern);
}
.........................

Posted Mar 16, 2007, 20:58 PM
ABC

Frog Egg

[ Privacy ]
See
Similar
Scientists





Group: Member
Posts: 11
Joined: Mar 13, 2007







 Send a personal messsage to ABC Reply with a quote from this post Go to the top of the page

Thanks, how would I go about printing the exact match since there is variation in the beginning and end bases?
.........................
[ Privacy ]

Posted Mar 17, 2007, 0:23 AM
jonatmudd

Frog Laureate

See
Similar
Scientists





Group: Member
Posts: 28
Joined: Oct 07, 2005







 Send a personal messsage to jonatmudd Reply with a quote from this post Go to the top of the page

hi there-
sorry for the slow reply.
i'm not a super expert at perl either, and that is a tricky question.

i know everyone loves perl because it is fast and free, but you might think about matlab. it has a very nice built in functions to do exactly what you want. probably only 3 lines of code. if you have matlab, interested and need help, let me know. i could rig it up no problem in a few minutes.
.........................

Posted Mar 20, 2007, 21:50 PM
ABC

Frog Egg

[ Privacy ]
See
Similar
Scientists





Group: Member
Posts: 11
Joined: Mar 13, 2007







 Send a personal messsage to ABC Reply with a quote from this post Go to the top of the page

I figured it out thanks for the assistance.
.........................
[ Privacy ]

Posted Mar 21, 2007, 16:34 PM
sichan

Frog Egg

See
Similar
Scientists





Group: Member
Posts: 22
Joined: Jul 30, 2008







 Send a personal messsage to sichan Reply with a quote from this post Go to the top of the page

A good book for basic Perl-ing for common bioinformatics tasks is James Tisdall's 'Beginning Perl for Bioinformatics.'
A slightly more advanced book would be its sequel, 'Mastering Perl for Bioinformatics', also written by Tisdall.
.........................

Posted Aug 04, 2008, 23:12 PM
top of page

Forum Jump