Gene Tery_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0938 
Symbol 
ID4245677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1474632 
End bp1477691 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content31% 
IMG OID638106193 
Producthypothetical protein 
Protein accessionYP_720805 
Protein GI113474744 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC ATATTATTTA TCAATTAAAA TGTAGGTTGA TTAAAGCATT TAAAAGAATC 
AAATCAATAC ATATATTAAT AATATTATTT TTTTCATTTG TATGCTGGGA AATACTTGGA
TTTTCAACTA ATATATTAGC TGATTTTTTA TGGTTTCAAG AGCTGAATTA TTTGCCTGTA
TTAATAAATA AACTGCAAAC AGAAACTTGG CTTTGGATAA CAACATTTTT AATTAGTATG
GGCTTTTTTT TAGTGAATTT GAGATTAGCC AGCTTTTTTA AATATTCAAA AAAACAAGTT
CGGATAACTG AAAGATCTGA AGAAATTATG CTAATTCCCC CAGTTACTAT ACCGTCAAGT
AAACTACGAA TTGAGCCGTC ACTAAGTCTA AGCTGGTTGT TATGTTGTAT ATTTGGATTA
ATTTTATTAG TAGGATTAAT TTTAACCCAT TATATTGACG TATTTACTAA TTACTTGTAT
CCTGATTTAA CAGTTGCTAA TGTATCTCCT CAAATCCCAT CAGAATTTAA CATCGAATCA
ATTTGTAAAA TACTCACCTC AATACTATCT AATTTGTGGT TATTAGGATT ATTTCTTCTA
TTATCTTTTG CTATAATTAT TAATCCTATA CTCTGGTTAA GTGTATTTGC AGTAGTTCTC
AGTTTAGTTT TTAGCTTTAT TCTTTCCAGC CATTGGGCAA ATATATTACA GCTGTTGCAT
GGGACTCCCT TTAATAAAAG TGAAGATTTA TTTCATATAG ATATAAGCTT TTATGTTTTT
CAACTCCCCG TTTTAGAGCT ATTAAGATTT TGGTTAATTG GATTATTTCT ATATGGATTT
GTTGCTTGTA TTTTGATATA TTTATTATCA GGAAAAAGTT TAAGCCAAGG AAATTTTTAT
CAATTTTCTC AACAGCAAGA AAAGCATCTT CACGGTTTAG GTGGAGGTTT TATATTAACC
ATAGCATTTA GTTACTTTAT AGCCTGTTTT GAGTTACTTT ACTCTCGCCG TGGTGTAGTT
TATGGTGCCG GTTATACCGA TATAAAAGTT CAGCTTCCAG CATATGTATT TTTAGGGATT
TTGGCGTTAC TAATTGCATT TTTTCTATTT TGGCAAGCAA TTTTTTCAGT CAAAAGTATT
CAGTCTTATA TTGAGGCAAG TTTGTGGTTT TTACGTTTAG GTCGTAAGAG AAAAAGAAAG
AAAAAAGTTA TTGCTAAACT ATTCGCTAAT AGCTATTCAT TAAGAGCAAT TTTGACATGG
TATTTAATAA TAGCAGTAAT TGCTGGTTGG TTAATACCAA AAATTGTACA AATGGCAATT
GTCCAACCTA ATGAGATAGA ACGAGAAATT CCTTATATTA AACGTAGCAT TACCTTTACT
AAAGAAGCTT ATATTGATGT AGATAAATTA GAAGTAGAAT TATTCGACCC AAATAATGAG
CTTACCTATG ATGACTTAAT AAATAATAAG TTAATCATTG AGAATATTCG TCTTTGGGAT
ACAAGACCAA TTTTACAAAC GAATCGTCAA TTGCAACAAA TTAGACCCTA CTATGAATTT
ATAAATGCTG ATATTGATCG TTATACATTT CTGAAAAAAG AGTCAGAAAG AACAAAAAAT
AATCTTACTA AAAAACAACA AGTAATTATA GCTGCTAGAG AACTAAACTA CGAATCTGTA
CCTCAGCCAG CACAGACTTG GGTCAATGAA CATTTAGTTT ATACTCATGG CTATGGTTTT
ACTCTTTCTC CAATTAATCA AGTTGAAAAA AATGGATTAC CAGAATATTT TGTGAAAAAT
ATTGGGCCAG ATCCTACTTT GGAAAAGAAT AGCACTTTAG AAGTATTAAA CAGGATTAGA
GACAGCATTC CCATCGGTAA ACCGAGAATT TATTATGGAG AACTTACTAA TACTAATATT
ATGACTTCTA CTGCACAAAG AAATAAAGAA TTAGATTACC CTAGTGGAGA AGCGAACTCT
TATAACACTT ATGATGGAAG TGGAGGAATT GTTATTGGTC AAGGGTGGCA AAGATGGATA
TTTGCTAAAT ATCTTAAAGA CTGGAAAATG TTATTAACTA ATGAATTTAT ACCTGAAACA
AAACTATTAT ATCGTCGTAA TATTAATGCT AGAGTCCGAA GTATAGCTCC ATTTCTACGT
TATGATCATG ACCCTTATTT AGTGGTGGCT GACCCTAACT TTGGTCATAA AAATATGAAT
CAAAAAAATC CTAATTATCT ATACTGGATT ATTGATGCTT ATACTACGAC TAATCACTAC
CCTTATTCTG ACCCAGAAAA TAATGAGTTT AACTATATTC GTAATTCAGT AAAAGTTGTA
ATTGATGCCT ATAATGGTTC AGTAAAATTC TATGTTGCTG ACCCAAAAGA CCCTATTATT
AGAACCTGGA AAAAAGCATT TTCAGATATG TTTAATTCCA TTGAAGAAAT GCCAACTAGT
CTTTATACTC ATATCCGCTA TCCACTAGAT TTATTTCAAG TACAATCTGA AGTTTTGTCA
ACTTATCATA TGGATGACCC TCGTGTATTT TATAATCGGG AAGACTTGTG GCGGGTTCCA
ATTGAGATTT ATGGGGCTCA ACAACAAAAA GTCAAACCTT ATTATCTAAT CACACAATTA
CCAACAGAAA CTTCAGAAGA ATTCATTTTA CTTCTACCTT ATACTCCAGC AAGTCGTAAT
AATTTAATTG CTTGGTTAGC AGCAAGATCG GATGGGGAAA ATTATGGTAA GTTACTGTTA
TATCAATTCC CTAAACAACG ATTAATATAT GGTATAGAAC AAATTGAAGC TTTGATTAAT
CAAGACCCAG AAATATCCCA GCAAATTTCT CTTTGGAATC GTCAAGGTTC AAAAGCAATT
AAAGGGAATT TATTAGTAAT TCCAATTAAT GAATCTCTGA TTTATGTTGA GCCTATTTAT
TTAGAAGCAG AGCAAAATAG TTTGCCAACT TTAAGAAGAG TAATTGTTTC TTATAAAAAC
CGAGTTGTTA TGAAGCCTAC TCTTGATGAA GCACTTCAGG AGGTTTTTCA AATACAATAA
 
Protein sequence
MKIHIIYQLK CRLIKAFKRI KSIHILIILF FSFVCWEILG FSTNILADFL WFQELNYLPV 
LINKLQTETW LWITTFLISM GFFLVNLRLA SFFKYSKKQV RITERSEEIM LIPPVTIPSS
KLRIEPSLSL SWLLCCIFGL ILLVGLILTH YIDVFTNYLY PDLTVANVSP QIPSEFNIES
ICKILTSILS NLWLLGLFLL LSFAIIINPI LWLSVFAVVL SLVFSFILSS HWANILQLLH
GTPFNKSEDL FHIDISFYVF QLPVLELLRF WLIGLFLYGF VACILIYLLS GKSLSQGNFY
QFSQQQEKHL HGLGGGFILT IAFSYFIACF ELLYSRRGVV YGAGYTDIKV QLPAYVFLGI
LALLIAFFLF WQAIFSVKSI QSYIEASLWF LRLGRKRKRK KKVIAKLFAN SYSLRAILTW
YLIIAVIAGW LIPKIVQMAI VQPNEIEREI PYIKRSITFT KEAYIDVDKL EVELFDPNNE
LTYDDLINNK LIIENIRLWD TRPILQTNRQ LQQIRPYYEF INADIDRYTF LKKESERTKN
NLTKKQQVII AARELNYESV PQPAQTWVNE HLVYTHGYGF TLSPINQVEK NGLPEYFVKN
IGPDPTLEKN STLEVLNRIR DSIPIGKPRI YYGELTNTNI MTSTAQRNKE LDYPSGEANS
YNTYDGSGGI VIGQGWQRWI FAKYLKDWKM LLTNEFIPET KLLYRRNINA RVRSIAPFLR
YDHDPYLVVA DPNFGHKNMN QKNPNYLYWI IDAYTTTNHY PYSDPENNEF NYIRNSVKVV
IDAYNGSVKF YVADPKDPII RTWKKAFSDM FNSIEEMPTS LYTHIRYPLD LFQVQSEVLS
TYHMDDPRVF YNREDLWRVP IEIYGAQQQK VKPYYLITQL PTETSEEFIL LLPYTPASRN
NLIAWLAARS DGENYGKLLL YQFPKQRLIY GIEQIEALIN QDPEISQQIS LWNRQGSKAI
KGNLLVIPIN ESLIYVEPIY LEAEQNSLPT LRRVIVSYKN RVVMKPTLDE ALQEVFQIQ