Gene PHATRDRAFT_40672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40672 
Symbol 
ID7198582 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp110788 
End bp114479 
Gene Length3692 bp 
Protein Length1034 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184736 
Protein GI219129102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGCAG CGATTTCGCC TCGAAAGTAG ATCACCAAGG ATTTCGAGGG GTAAGCTCGC 
TACGGCGGAG CCCCTCAAAC CGGCATTTCG CTCGGAGACT CGGAAGGATT CAGGCTCGGC
ATTGCCGAAG GTTCTGTACT CGGTTCCAAG GATGGCACTG CACTCGGATC AGCCGAAGGT
ATTGCGGAAG GCGCCGCACT CGGACCGGCC GATGGAACAG CTGAAGGCAT TGCACTTGGT
ATAGCAGAAG GGACCGCACT CGGATCAGCC GATGGAGCCG CGCTGGGAAC CGCTGATGGC
ATCGCGGACG GAACACCAGA GGGGACCGCC GATGGAGCTT GGCTTGGAAC TGCCGAAGGC
ATAGCGCTGG GAACAGCTGA AGGTTGTGCC GATGGAGCCG CGCTAGGGAT CGCTGAAGGC
ATCGCGGACG GAACACCAGA GGGGACCGCC GATGGAGCTT GGCTTGGAAC TGCCGAAGGC
ATAGCGCTGG GAACAGCTGA AGGTTGTGCC GATGGAGCCG CGCTGGGAAC CGCTGATGGC
ATCGCGGACG GAACACCAGA GGGGACCGCC GATGGAGCTT GACTTGGAAC TGCCGAAGGC
ATAGCGCTGG GAACAGCTGA AGGTTGTGCC GATGGAGCCG CGCTGGGAAC CGCTGAAGGT
TGTGCCGATG GAGCCGCGCT GGGAACAGCT GAAGGTTGTG CCGATGGAGC CGCGCTAGGG
ACCGCTGAAG GCATCGCGGA CGGAACACCA GAGGGGACCG CCGATGGAGC TTGACTTGGA
ACTGCCGAAG GCATAGCGCT GGGAACAGCT GAAGGTTGTG CCGATGGAGC CGTGCTAGGA
ACCGCTGAAG GCATCGCGGA CGGAACACCG GACGGGACCG CCGATGGAGC TTGGCTGGGA
ACTGCCGAAG GCATAGCGCT GGGAACAGCT GAAGGTTGTG CCGATGGAGC CGCGCTGGGA
ACCGCTGAAG GTTGTGCCGA TGGAGCCGCG CTGGGAACAG CTGAAGGTTG TGCCGATGGA
GCCGCGCTAG GGACCGCTGA AGGCATCGCG GACGGAACAC CAGAGGGGAC CGCCGATGGA
GCTTGACTTG GAACTGCCGA AGGCATAGCG CTGGGAACAG CTGAAGGTTG TGCCGATGGA
GCCGTGCTAG GAACCGCTGA AGGCATCGCG GACGGAACAC CGGACGGGAC CGCCGATGGA
GCTTGGCTGG GAACTGCCGA AGGCATAGCG CTGGGAACAG CTGAAGGTTG TGCCGATGGA
GCCGCGCTGG GAACCGCTGA AGGTTGTGCC GATGGAGCCG CGCTAGGAAC CGCTGAAGGC
ATCGCGGACG GCACAACCGA TGGGACCGCC GATGGAGCTT GGCTTGGAAC TGCCGAAGGC
ATAGCTGAGG GAACAGCTGA AGGTTGTGCC GAAGGCATCT CGGAAGGAAC CGCTGAAGGT
TGTGCGGATG GAGCAGCGCT GGGAACCGCT GATGGCATCG CGGACGGCAC AACCGACGGG
ACCGCCGATG GAGCTTGGCT TGGAACTGCC GAAGGCATAG CGGACGGCAC AACCGACGGG
ACCGCCGATG GAGCCTCACT CGGAACTGCC GAAGGCATAG CGGAGGGAAC GACGGAAGGA
ACAGCCGACG GCATCGCGGA GGGAACCGCT GAAGGTAGTG CCGATGGAGC CGCGCTGGGA
ACTGCTGAAG GCATCGCAGA CGGCACAGCT GACGGGACCG CCGATGGAGC CTCACTCGGA
ACTGCCGAAG GCATAGCTGA GGGAACAGCT GAAGGTTGTG CCGAAGGCAT CTCGGAAGGA
ACCGCTGAAG GTTGTGCTGA CGGAGCCGCG CTGGGAATCG CTGAAGGCAT CGCGGACGGC
ACAACCGACG GGACCGCCGA TGGAGCCTCG CTTGGAACTG CCGAAGGCAT AGCGGACGGC
ACAACCGACG GGACCGCCGA TGGAGCTTCA CTCGGAACTG CCGAAGGCAT AGCGGAGGGA
ACGACGGAAG GAACAGCCGA CGGCATCGCG GAGGGAACCG CTGAAGGTAG TGCCGATGGA
GCCGCGCTGG GAACTGCTGA AGGCATCGCA GACGGCACAG CTGACGGGAC CGCCGATGGA
GCCTCACTCG GAACTGCCGA AGGCATAGCT GAGGGAACAG CTGAAGGTTG TGCCGAAGGC
ATCTCGGAAG GAACCGCTGA AGGTTGTGCT GACGGAGCCG CGCTGGGAAT CGCTGAAGGC
ATCGCGGACG GCACAACCGA CGGGACCGCC GATGGAGCCT CGCTTGGAAC TGCCGAAGGC
ATAGCTGAGG GAACGACGGA AGGAACTGCC GAAGGCATCG CGGAGGGACC TGTTGAAGGT
AATGCTGACG GAGCCGCGCT GGGAATCGCT GAAGGCATCG CGGACGGCAC AACCGACGGG
ACCGCCGATG GAGCTTCGGA CGGAGATGCC GAAGGGAGCG CACTCGGAAC CGCCGACGGT
AGGGCGGAAG GAAGCTCCGA TGGCTCTGCA GATGGGAGCG AGGTCGGAAC CTCGGACGGA
GATGCAGAGG GGAGCTCACT CGGAACGGCT GAGGGGACGG CAGAGGGTAC ATCCGAAGGG
AATTGCGAAG GAGCCTGCGA AGGAACTTCA GACGGAGCGC TGGATGGTGG AGCGCTGATA
GTGGGGGACT GTGATGGAGA GTCCGATGGA AACGCGGAAG GGAAGTCAGA CGGAGCTACA
GAAGGAACTT CCGAAGGCAC ATCCGAAGGG ACTGCTGATG GTGTCAGTGT TGGGGTTTCA
GAAGGAACCC CCGAAGGAAA ATCCGAAAAG ACCCCGGTCG GTGTCTGCGA GGGGCTGTCA
GAAGCATTTT GCGAGGGTGA TGTCGATGGA GACCGCGATG GCGCCTCGGA CTTTTTGCTT
TTGGCACTCT TCTTGCTTTG CTTCGAATTA TCCATCATAC CCTTTGTACT TTTCGGATTC
GGTACGGTAG GGACACGTGT GGGTAGAACC AGGCTAGGAC CCCCCGAAGG CAACACGGGA
GGAGCCTGGG TAGGCATCTC AATGGGAATA ATCGTAGGCA GCGTCGAAGG TGCCCCCGAA
GGGTTTGCCA AGGATGGAGA AGGGGTAGGA GCAAGCAGAA GCATGGTGGG AACGGACGAC
GGTACGTCCG ACTTCTTGCT CTTCGTGCTT TTGGCTCCTC CCTTTCCATC CTTGCCGCTA
CTCTTTCCAT CCTTACCGCT ACCCTTGCCT TCTCCATCGC CAGCTTTCTT ATTATTCGAG
GGAGGCATCG CCTTGCCTCC GTTATTGGAG ACGACGTTGA TGACGCTCGT TAAAACAGGA
TGCTTCCTCA GCTGCCGCAT TGCTCCGATA TCTTCCGCCG ACTCTACTGA AGAGCAGCCA
GCCGAAACGA CACCCAGCAA GGCTAAGGTG GAGAATCGCA TGATGCGTGG ATCGATACAA
ACGGGAGACT TAGCGATCAA TGGCGCTCTA ATTTTTGCGT TACCGTATGT TTTTGGTTAG
TCCAACGGCA AAATACGAGG AAAGTTCGAT CCGCTTGGAT TGGTTTGTTG GCTTGGTTTG
CAAAGTGCTC CAACGAGCGA TGCATGTGGT CTTTTATGTC GAGAATCGTG TTTAGGAATA
AGCGATTGAC AACGGGCGAA CGCTGCCCTC GAACACATAA TTCGGAAGAT TTAATACTTC
TATGCTCCTC CCACAGAGTA GTTGCGACTT GA
 
Protein sequence
MVAAISPRKL GIAEGSVLGS KDGTALGSAE GIAEGAALGP ADGTAEGIAL GIAEGTALGS 
ADGAALGTAD GIADGTPEGT ADGAWLGTAE GIALGTAEGC ADGAALGIAE GIADGTPEGT
ADGAWLGTAE GIALGTAEGI ALGTAEGCAD GAALGTAEGC ADGAALGTAE GIALGTAEGC
ADGAVLGTAE GIADGTPDGT ADGAWLGTAE GIALGTAEGC ADGAALGTAE GCADGAALGT
AEGIALGTAE GCADGAVLGT AEGIADGTPD GTADGAWLGT AEGIALGTAE GCADGAALGT
AEGCADGAAL GTAEGIADGT TDGTADGAWL GTAEGIAEGT AEGCAEGISE GTAEGCADGA
ALGTADGIAD GTTDGTADGA WLGTAEGIAD GTTDGTADGA SLGTAEGIAE GTTEGTADGI
AEGTAEGSAD GAALGTAEGI ADGTADGTAD GASLGTAEGI AEGTAEGCAE GISEGTAEGC
ADGAALGIAE GIADGTTDGT ADGASLGTAE GIADGTTDGT ADGASLGTAE GIAEGTTEGT
ADGIAEGTAE GSADGAALGT AEGIADGTAD GTADGASLGT AEGIAEGTAE GCAEGISEGT
AEGCADGAAL GIAEGIADGT TDGTADGASL GTAEGIAEGT TEGTAEGIAE GPVEGNADGA
ALGIAEGIAD GTTDGTADGA SDGDAEGSAL GTADGRAEGS SDGSADGSEV GTSDGDAEGS
SLGTAEGTAE GTSEGNCEGA CEGTSDGALD GGALIVGDCD GESDGNAEGK SDGATEGTSE
GTSEGTADGV SVGVSEGTPE GKSEKTPVGV CEGLSEAFCE GDVDGDRDGA SDFLLLALFL
LCFELSIIPF VLFGFGTVGT RVGRTRLGPP EGNTGGAWVG ISMGIIVGSV EGAPEGFAKD
GEGVGASRSM VGTDDGTSDF LLFVLLAPPF PSLPLLFPSL PLPLPSPSPA FLLFEGGIAL
PPLLETTLMT LVKTGCFLSC RIAPISSADS TEEQPAETTP SKAKVENRMM RGSIQTGDLA
INGALIFALP VVAT