Gene PHATRDRAFT_51040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51040 
Symbol 
ID7201972 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp803362 
End bp805478 
Gene Length2117 bp 
Protein Length607 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181264 
Protein GI219121835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCCCCCGACG AGGGGCAGAT TGCCTCCCGT TTCATGTATC GATTGATACA TTTATGGTTC 
TCTTGTGAGG ATGAGGGAAG CTCAAATGGT TCGCTCGTGG AAATCATGGC TGAAGCTGTG
ACTCGACTTC CATCATTTCG GTTCGTACCG GCAACCAGCC AACTTTTGTC TCGTGTAGAA
AAAAGAAACA CAGGCTCTTT TCAAGAAACT TTGCACACGT TGATCCTCCG AATGTGCCAC
GATCATCCTT ACAATTGCCT AGTACAGGTA GTTTCGTTGG CCAATGGAAA GACTATAGGT
AGTGGTGTCA GCGGACGACA GGCAGAGGTA TTCCTCGAAA ATACGAGCGA CACGAAGGTT
GATGGCGCAA ATGAGATTCT TGCATCCCTT AGAACAAGCG AGCGCAAACC TCTTGGGAGG
TTGATGGAGG GTTACATCTC GCTCACTGAT GCATACGTCC ACTTGGCGAT GTATCCAACG
CATGACTTCC AGAAGGCCAA AAACAAAAAG TTTCCTTTTT CAGCAGTCAG CAAATCCCAC
GCCGAACGCC TGGACCAATG CCTAGGCGTG GGACGCCGCA AAGTGCCTCA CCCACCTTGT
GTCTTGACCA AACCACCGCC AATTCGTCCA GCAAGTGACT ACACCGACGA AACGGGGCAG
CTCATTGGGT CTGAGAGCGT CGTTGGTTTT GAGCAAGCCT TTTCTATCAC CGAGAGCGGT
CTGCATCGCC CCAAGATTGT CTACTGTCTT GGATCCAAAG GTGGTCGTTT TAAACAGCTG
GTAAAGGGAG AGGACGAGAT CCGACAGGAT GCCGTAATGG AGCAAGTTTT TGGTTACGTC
AACGAATTGT TGTCAAACGG GGATCTGTCA GACAGCCTAG ATGAGATCCG GCGGACGACT
GGAGCTGGCC ATTTGCGTTT AGTGACTTAC AATATTGTTC CTTTGAGTCC AGCAAGCGGG
GTAAGTGATA AAGGACTACT TACAGGACGT GCATCATCAA ACTCACTTCG TTTTCATTTC
TGCAGGTTCT AGAATGGGTC GATCATACCA TTCCATTCGG GGAGTTCATG ATGGACAAGA
AAGGTCACGT CGGTGCGCAT TCTCGGTATT ATCCTGGACA ATGGAGCAGC CTTGTTTGCC
GGGAGCAGCT GCGGAAAGCA CCGAAAAAGG AAAAACTTCA AGCCTTTAAT GCAATTTGCT
TAAACCACTC GCCCGTCTTT CGATACTTTT TCGTGGAGAG GTTTGGGCAC ACGCCAGAAT
TATGGCACGA GGCTCGGATG CGCTACACGC GGTCCGTTGC TGTCAATAGT ATTGTTGGGC
ACATTCTTGG GATCGGCGAT CGCCACTGCA GCAACATTCT TATTCATGAG GGGACTGGGG
AAGTCGTACA CATCGATTTT GGTATAGTCT TCGAACAAGG AAAGGTACGT TCCTTTTGAA
CCCTAAGAGG ATACCGCTCT TTTCTTCTTA ATGGAGTCGT ATCGCTCAAG GCATATTTTA
TTTCTTGACG ATAGCTCCTG AACACGCCCG AGCTAGTGCC GTTCCGACTG ACGCAAAATA
CAGTGGATGG GTTCGGCCCA GTGGGACTCG ACGGTACCTT TACCAAATCC GCCCAACGGA
CTTTATCCGT TCTCCGAAAG AATTCAAACG CGCTCCTGAC TATTCTGTCC GCGATTGTTT
CGGATCCGCT GTACAAATGG AGCGTAAGTC CAGTCAAAGC ACGGCTGCGG CAAGAGCAGC
AACAGCATCA GGATGAGGAA GAACAAGGGG AGAACAAACG CACATCGATG ACAACCACAA
CGAGTTCCAC CAAAGTTTCA CGTAGCCAAG GACAAGAAAA CGAAGCCGCG TCACATGCCA
TTCGGAGGAT ACAAGAAAAA CTCCAAGGCT ACGAGGATGG CACATCGGGC GAGCAACAAA
GCGTGGAAGG ACAGGTTCAG CTCCTGATTA ACTCCGCGAA GAACAAGGAC AATTTGTGTC
TCATGTTCTG TGGTTGGGCG CCGTGGGTGT AATTTCCACC AATAAATCTA CATTACTTCT
GTTTCGCGAG ACTCTACGAG AGTCAAAAAA CGGACAGTGA ATTACTGTAA TGAACTAAGT
AGATGTTGCC AAAATGC
 
Protein sequence
MYRLIHLWFS CEDEGSSNGS LVEIMAEAVT RLPSFRFVPA TSQLLSRVEK RNTGSFQETL 
HTLILRMCHD HPYNCLVQVV SLANGKTIGS GVSGRQAEVF LENTSDTKVD GANEILASLR
TSERKPLGRL MEGYISLTDA YVHLAMYPTH DFQKAKNKKF PFSAVSKSHA ERLDQCLGVG
RRKVPHPPCV LTKPPPIRPA SDYTDETGQL IGSESVVGFE QAFSITESGL HRPKIVYCLG
SKGGRFKQLV KGEDEIRQDA VMEQVFGYVN ELLSNGDLSD SLDEIRRTTG AGHLRLVTYN
IVPLSPASGV LEWVDHTIPF GEFMMDKKGH VGAHSRYYPG QWSSLVCREQ LRKAPKKEKL
QAFNAICLNH SPVFRYFFVE RFGHTPELWH EARMRYTRSV AVNSIVGHIL GIGDRHCSNI
LIHEGTGEVV HIDFGIVFEQ GKLLNTPELV PFRLTQNTVD GFGPVGLDGT FTKSAQRTLS
VLRKNSNALL TILSAIVSDP LYKWSVSPVK ARLRQEQQQH QDEEEQGENK RTSMTTTTSS
TKVSRSQGQE NEAASHAIRR IQEKLQGYED GTSGEQQSVE GQVQLLINSA KNKDNLCLMF
CGWAPWV