Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0836 |
Symbol | |
ID | 4243130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1327223 |
End bp | 1330339 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106110 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_720722 |
Protein GI | 113474661 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCAC TACTACACTC AAAAGAAATA ACCCTAGAAC CAGGGTTCAA AAATCCTAAA ATGACAGCTT CAGATCTGTT TCTACATAAT CGAATCAAAA TAGTTGAAAA TTTGTGGGAA TCAGTGCTCA GACAAGAGTG TGGCCAAGAA TTGGTAGATA TACTCCAGAA GATGCGCTCG GGTCATTCTC CAGAAGGACA AGCATCTGAC TTTCTAGGTT CAGAAATTGA ACAACTGATT GAAAAATTAG AACTAAAAGA TGCAATTCGG GCAGCTCGAG CTTTTGCTCT ATATTTTCAG CTAATTAATA TTGTCGAACA ACATTATGAA CAAAAAATTC AACAACTAGC TTACTCTCAC AATAACAGTC TAGAAAAACT AATCTCTAAA GATGATATAG CTAAAGATCA TGATACCCGT TCGCTGCGGG TTGAGAGTAT ACCAGTGTGG AACGATAACA GAATTAAACA TGAGGGAGAG GGTACATTTC ATTATTTATT TCCCCTCCTA CAGACCCTCA ATGTACCATC ACAATTAATT CAACGACTAA TTAATAATTT AGATATCCGT TTAGTGTTTA CAGCACACCC AACGGAAATT GTTCGTCGTA CAATTAGAAC AAAACAAAGA CGTATCGCCA AAATTCTCCA ACAGCTCGAT CAAGTAAATG AAAGCCTTTC AGAATCACAC GTAGATGAAG AACAAGTAAA TTCTTTAGTG TCATCTTGGA AAATAGAATC CCTCAAAGAA CAGTTAACAG AAGAAATTCT TTTATGGTGG CGTACAGATG AACTACATCA ATTTAAACCT AGTGTCTTAG ATGAAGTAGA AACTACTCTT CATTATTTTA ACGAAGTCTT ATTTGATGCT ACTCCTGAAC TACATCGTAG GTTTAAGCAA GCCTTACATA GTTCCTTTCC TTATCTGAAA CCACCAAGTT ACAATTTCTG TAAATTTGGT TCCTGGGTAG GATCTGACCG TGATGGAAAT CCTTTTTGTA CACCAGCAGT GACTTGGCAA ACAGCTTGTT ACCAACGTCA GATAGTATTA GAAAAATATC TTAATGCTAT TGATCGCCTC AAAGAACTTT TGAGTTTATC TCTACATTGG AGTGATGTGT TACCAGAATT GTTAGATTCT CTAGACCGTG ACCATATACA GATGTCAGAA GTTTACGATC AATGGGCAAT TCGCTATCGG CAAGAACCTT ATCGTTTGAA GTTGTCTTAT ATCAAAAAGC GTTTAGAAAA TACTCGCGAT CGCAATGCGC GTTTATATAA TGGTGATGAA GTTCAAAGAC AAAAAAAAGA AGTTCTTTCT CAACGACAAA AACAAGTTCT CTCTCAATAT CAAGAAACAA AAAGCATTTA TCACTCTAGT GCTGATTTTC TAGCTGAATT ACAACTAATT CAGCGTAACC TCAAAGAAAC TGGTTTAAGC TGTAGTGACT TAGAAAATCT AATCTCTCAA GTAGAAATTT TTGGTTTTAA CTTAGCACGA CTAGATATTC GTCAAGAGTC TTCAGTCCAT GAAGCGGCGA TCCAAGAGAT TACTGAATAT TTACAAATTC TGCCTAAATC TTATATAGAA ATGTCGGAGG CAGAGCGGAC TGAATGGTTA TCAACAGAGT TGCCTACTCG TCGCCCCTTA ATTCCCACAG AGTTACCTTT TTCTGAGAAA ACCTGCGAAA TAATTAATAC TTTCCGAATG CTGCGAGAAT TACAGCTAGA GTTTGGTGAA GAAATTTGCC AAACCTATAT TATTAGTATG AGTAGGGATG TTAGCGATCT ATTAGAAGTG TTGTTGTTAG CTCAAGAAGC AGGACTTTAT GATCCGGCAA CTGGTGCGAG TAGCATTCAC GTGGTTCCTT TGTTTGAGAC AGTGGAAGAC TTGAGAAGTG CTCCCAGGGT AATGCACGAT TTATTTAAGT TGTCCCTATA TCGTGCAGGG CTTGCTGGTG GATATGATAA ATTATCAAAA GAGCCAATTA ATGAATTAGT TAACGAAGCT CCTTATTTGC AAGAGGTGAT GTTGGGTTAC TCAGATAGTA ATAAAGATTC TGGGTTTTTG AGTAGTAACT GGGAAATTCA TAAAGCTCAA AAAGCTTTAT ACAAAGTAGG GGAAGAGCAT GGCATTGCTT TGCGTATCTT TCATGGACGC GGTGGTTCTG TAGGACGTGG TGGCGGTCCA GCTTATAAAG CTATTTTGGC TCAACCTGGT AAAAGCATTA GTGGGCGGAT TAAAATTACT GAACAAGGAG AGGTGCTCGC CTCTAAATAT TCTCTGCCTC ACTTAGCAAT GTTTAACTTG GAAAATGTTA CTACTGCAGT AATTCAAGCT AGTTTGCTAC ATACAGGGTT TGATGAAATT GAAACTTGGA ATCAAATTAT GGAGGAGTTG GCAGTGCGAT CGCGTAGCCA TTACCGAAAT CTGATTTATG AACAAGAAGA TCTAGTAGAA TTTTTCTATC AAGTTACACC AATGCCAGAA ATTAGTCAAC TACAAATTAG TTCCCGTCCA GCTCGGCGGA AAAATGATAA GAAGAAAACA ATTTCTGGTT TAAGGGCAAT TCCTTGGGTA TTTAGTTGGA CTCAAAGTCG CTTTCTCTTA CCTGCTTGGT ATGGTGTGGG AACTGCTTTA CAAGAGTTTG TGGAAAAAGA ACCAGAAGAA CACCTAAAAC TTTTGCAATA CTTTTATGTA AAGTGGCCTT TCTTTACTAC TGCGATTTCT AAAGTGGAGA TGACTTTAGC TAAGGTGGAT TTGCAAATTG CTCATTATTA TGTGCGCGAA TTATCCAAAC CAGAAGACCG GGAACGCTTT GAGACACTGT TTGAAGAAAT TACTATTGAG TATCACCTAA CACGGAACTT AGTGCTACAA ATTTCTGGTC ATCAACGACC TTTGGATGGA GATCCAGATT TACAGCGTTC TGTACAATTA CGTAATGCAA CTATTATTCC TCTGGGTATG TTGCAGGTAG CTTTGCTGAA ACGTCTGCGT CAGCATGATA CAGGTACACC TGGTGTAATT AATTCTCGTT ATAGTAAGAG TGAGTTATTA CGGGGTGCTT TGTTGACTCT TAATGGTATC GCTGCTGGTA TGCGAAATAC AGGTTGA
|
Protein sequence | MSSLLHSKEI TLEPGFKNPK MTASDLFLHN RIKIVENLWE SVLRQECGQE LVDILQKMRS GHSPEGQASD FLGSEIEQLI EKLELKDAIR AARAFALYFQ LINIVEQHYE QKIQQLAYSH NNSLEKLISK DDIAKDHDTR SLRVESIPVW NDNRIKHEGE GTFHYLFPLL QTLNVPSQLI QRLINNLDIR LVFTAHPTEI VRRTIRTKQR RIAKILQQLD QVNESLSESH VDEEQVNSLV SSWKIESLKE QLTEEILLWW RTDELHQFKP SVLDEVETTL HYFNEVLFDA TPELHRRFKQ ALHSSFPYLK PPSYNFCKFG SWVGSDRDGN PFCTPAVTWQ TACYQRQIVL EKYLNAIDRL KELLSLSLHW SDVLPELLDS LDRDHIQMSE VYDQWAIRYR QEPYRLKLSY IKKRLENTRD RNARLYNGDE VQRQKKEVLS QRQKQVLSQY QETKSIYHSS ADFLAELQLI QRNLKETGLS CSDLENLISQ VEIFGFNLAR LDIRQESSVH EAAIQEITEY LQILPKSYIE MSEAERTEWL STELPTRRPL IPTELPFSEK TCEIINTFRM LRELQLEFGE EICQTYIISM SRDVSDLLEV LLLAQEAGLY DPATGASSIH VVPLFETVED LRSAPRVMHD LFKLSLYRAG LAGGYDKLSK EPINELVNEA PYLQEVMLGY SDSNKDSGFL SSNWEIHKAQ KALYKVGEEH GIALRIFHGR GGSVGRGGGP AYKAILAQPG KSISGRIKIT EQGEVLASKY SLPHLAMFNL ENVTTAVIQA SLLHTGFDEI ETWNQIMEEL AVRSRSHYRN LIYEQEDLVE FFYQVTPMPE ISQLQISSRP ARRKNDKKKT ISGLRAIPWV FSWTQSRFLL PAWYGVGTAL QEFVEKEPEE HLKLLQYFYV KWPFFTTAIS KVEMTLAKVD LQIAHYYVRE LSKPEDRERF ETLFEEITIE YHLTRNLVLQ ISGHQRPLDG DPDLQRSVQL RNATIIPLGM LQVALLKRLR QHDTGTPGVI NSRYSKSELL RGALLTLNGI AAGMRNTG
|
| |