Gene Tery_0836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0836 
Symbol 
ID4243130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1327223 
End bp1330339 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content38% 
IMG OID638106110 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_720722 
Protein GI113474661 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAC TACTACACTC AAAAGAAATA ACCCTAGAAC CAGGGTTCAA AAATCCTAAA 
ATGACAGCTT CAGATCTGTT TCTACATAAT CGAATCAAAA TAGTTGAAAA TTTGTGGGAA
TCAGTGCTCA GACAAGAGTG TGGCCAAGAA TTGGTAGATA TACTCCAGAA GATGCGCTCG
GGTCATTCTC CAGAAGGACA AGCATCTGAC TTTCTAGGTT CAGAAATTGA ACAACTGATT
GAAAAATTAG AACTAAAAGA TGCAATTCGG GCAGCTCGAG CTTTTGCTCT ATATTTTCAG
CTAATTAATA TTGTCGAACA ACATTATGAA CAAAAAATTC AACAACTAGC TTACTCTCAC
AATAACAGTC TAGAAAAACT AATCTCTAAA GATGATATAG CTAAAGATCA TGATACCCGT
TCGCTGCGGG TTGAGAGTAT ACCAGTGTGG AACGATAACA GAATTAAACA TGAGGGAGAG
GGTACATTTC ATTATTTATT TCCCCTCCTA CAGACCCTCA ATGTACCATC ACAATTAATT
CAACGACTAA TTAATAATTT AGATATCCGT TTAGTGTTTA CAGCACACCC AACGGAAATT
GTTCGTCGTA CAATTAGAAC AAAACAAAGA CGTATCGCCA AAATTCTCCA ACAGCTCGAT
CAAGTAAATG AAAGCCTTTC AGAATCACAC GTAGATGAAG AACAAGTAAA TTCTTTAGTG
TCATCTTGGA AAATAGAATC CCTCAAAGAA CAGTTAACAG AAGAAATTCT TTTATGGTGG
CGTACAGATG AACTACATCA ATTTAAACCT AGTGTCTTAG ATGAAGTAGA AACTACTCTT
CATTATTTTA ACGAAGTCTT ATTTGATGCT ACTCCTGAAC TACATCGTAG GTTTAAGCAA
GCCTTACATA GTTCCTTTCC TTATCTGAAA CCACCAAGTT ACAATTTCTG TAAATTTGGT
TCCTGGGTAG GATCTGACCG TGATGGAAAT CCTTTTTGTA CACCAGCAGT GACTTGGCAA
ACAGCTTGTT ACCAACGTCA GATAGTATTA GAAAAATATC TTAATGCTAT TGATCGCCTC
AAAGAACTTT TGAGTTTATC TCTACATTGG AGTGATGTGT TACCAGAATT GTTAGATTCT
CTAGACCGTG ACCATATACA GATGTCAGAA GTTTACGATC AATGGGCAAT TCGCTATCGG
CAAGAACCTT ATCGTTTGAA GTTGTCTTAT ATCAAAAAGC GTTTAGAAAA TACTCGCGAT
CGCAATGCGC GTTTATATAA TGGTGATGAA GTTCAAAGAC AAAAAAAAGA AGTTCTTTCT
CAACGACAAA AACAAGTTCT CTCTCAATAT CAAGAAACAA AAAGCATTTA TCACTCTAGT
GCTGATTTTC TAGCTGAATT ACAACTAATT CAGCGTAACC TCAAAGAAAC TGGTTTAAGC
TGTAGTGACT TAGAAAATCT AATCTCTCAA GTAGAAATTT TTGGTTTTAA CTTAGCACGA
CTAGATATTC GTCAAGAGTC TTCAGTCCAT GAAGCGGCGA TCCAAGAGAT TACTGAATAT
TTACAAATTC TGCCTAAATC TTATATAGAA ATGTCGGAGG CAGAGCGGAC TGAATGGTTA
TCAACAGAGT TGCCTACTCG TCGCCCCTTA ATTCCCACAG AGTTACCTTT TTCTGAGAAA
ACCTGCGAAA TAATTAATAC TTTCCGAATG CTGCGAGAAT TACAGCTAGA GTTTGGTGAA
GAAATTTGCC AAACCTATAT TATTAGTATG AGTAGGGATG TTAGCGATCT ATTAGAAGTG
TTGTTGTTAG CTCAAGAAGC AGGACTTTAT GATCCGGCAA CTGGTGCGAG TAGCATTCAC
GTGGTTCCTT TGTTTGAGAC AGTGGAAGAC TTGAGAAGTG CTCCCAGGGT AATGCACGAT
TTATTTAAGT TGTCCCTATA TCGTGCAGGG CTTGCTGGTG GATATGATAA ATTATCAAAA
GAGCCAATTA ATGAATTAGT TAACGAAGCT CCTTATTTGC AAGAGGTGAT GTTGGGTTAC
TCAGATAGTA ATAAAGATTC TGGGTTTTTG AGTAGTAACT GGGAAATTCA TAAAGCTCAA
AAAGCTTTAT ACAAAGTAGG GGAAGAGCAT GGCATTGCTT TGCGTATCTT TCATGGACGC
GGTGGTTCTG TAGGACGTGG TGGCGGTCCA GCTTATAAAG CTATTTTGGC TCAACCTGGT
AAAAGCATTA GTGGGCGGAT TAAAATTACT GAACAAGGAG AGGTGCTCGC CTCTAAATAT
TCTCTGCCTC ACTTAGCAAT GTTTAACTTG GAAAATGTTA CTACTGCAGT AATTCAAGCT
AGTTTGCTAC ATACAGGGTT TGATGAAATT GAAACTTGGA ATCAAATTAT GGAGGAGTTG
GCAGTGCGAT CGCGTAGCCA TTACCGAAAT CTGATTTATG AACAAGAAGA TCTAGTAGAA
TTTTTCTATC AAGTTACACC AATGCCAGAA ATTAGTCAAC TACAAATTAG TTCCCGTCCA
GCTCGGCGGA AAAATGATAA GAAGAAAACA ATTTCTGGTT TAAGGGCAAT TCCTTGGGTA
TTTAGTTGGA CTCAAAGTCG CTTTCTCTTA CCTGCTTGGT ATGGTGTGGG AACTGCTTTA
CAAGAGTTTG TGGAAAAAGA ACCAGAAGAA CACCTAAAAC TTTTGCAATA CTTTTATGTA
AAGTGGCCTT TCTTTACTAC TGCGATTTCT AAAGTGGAGA TGACTTTAGC TAAGGTGGAT
TTGCAAATTG CTCATTATTA TGTGCGCGAA TTATCCAAAC CAGAAGACCG GGAACGCTTT
GAGACACTGT TTGAAGAAAT TACTATTGAG TATCACCTAA CACGGAACTT AGTGCTACAA
ATTTCTGGTC ATCAACGACC TTTGGATGGA GATCCAGATT TACAGCGTTC TGTACAATTA
CGTAATGCAA CTATTATTCC TCTGGGTATG TTGCAGGTAG CTTTGCTGAA ACGTCTGCGT
CAGCATGATA CAGGTACACC TGGTGTAATT AATTCTCGTT ATAGTAAGAG TGAGTTATTA
CGGGGTGCTT TGTTGACTCT TAATGGTATC GCTGCTGGTA TGCGAAATAC AGGTTGA
 
Protein sequence
MSSLLHSKEI TLEPGFKNPK MTASDLFLHN RIKIVENLWE SVLRQECGQE LVDILQKMRS 
GHSPEGQASD FLGSEIEQLI EKLELKDAIR AARAFALYFQ LINIVEQHYE QKIQQLAYSH
NNSLEKLISK DDIAKDHDTR SLRVESIPVW NDNRIKHEGE GTFHYLFPLL QTLNVPSQLI
QRLINNLDIR LVFTAHPTEI VRRTIRTKQR RIAKILQQLD QVNESLSESH VDEEQVNSLV
SSWKIESLKE QLTEEILLWW RTDELHQFKP SVLDEVETTL HYFNEVLFDA TPELHRRFKQ
ALHSSFPYLK PPSYNFCKFG SWVGSDRDGN PFCTPAVTWQ TACYQRQIVL EKYLNAIDRL
KELLSLSLHW SDVLPELLDS LDRDHIQMSE VYDQWAIRYR QEPYRLKLSY IKKRLENTRD
RNARLYNGDE VQRQKKEVLS QRQKQVLSQY QETKSIYHSS ADFLAELQLI QRNLKETGLS
CSDLENLISQ VEIFGFNLAR LDIRQESSVH EAAIQEITEY LQILPKSYIE MSEAERTEWL
STELPTRRPL IPTELPFSEK TCEIINTFRM LRELQLEFGE EICQTYIISM SRDVSDLLEV
LLLAQEAGLY DPATGASSIH VVPLFETVED LRSAPRVMHD LFKLSLYRAG LAGGYDKLSK
EPINELVNEA PYLQEVMLGY SDSNKDSGFL SSNWEIHKAQ KALYKVGEEH GIALRIFHGR
GGSVGRGGGP AYKAILAQPG KSISGRIKIT EQGEVLASKY SLPHLAMFNL ENVTTAVIQA
SLLHTGFDEI ETWNQIMEEL AVRSRSHYRN LIYEQEDLVE FFYQVTPMPE ISQLQISSRP
ARRKNDKKKT ISGLRAIPWV FSWTQSRFLL PAWYGVGTAL QEFVEKEPEE HLKLLQYFYV
KWPFFTTAIS KVEMTLAKVD LQIAHYYVRE LSKPEDRERF ETLFEEITIE YHLTRNLVLQ
ISGHQRPLDG DPDLQRSVQL RNATIIPLGM LQVALLKRLR QHDTGTPGVI NSRYSKSELL
RGALLTLNGI AAGMRNTG