Gene Tery_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5066 
Symbol 
ID4246721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7727163 
End bp7728194 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content39% 
IMG OID638109867 
Productpyruvate dehydrogenase (lipoamide) 
Protein accessionYP_724443 
Protein GI113478382 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAAG AACGCACCGT ACCTAAATTT GATACTAATA GTGTAAAAAT CACTAAGGAA 
AAAGCATTAA TTCTATATGA AGATATGGTT TTAGGGCGCT TATTTGAAGA TAAGTGTGCT
GAAATGTATT ACCGAGGCAA AATGTTTGGC TTTGTCCATC TTTATAATGG ACAAGAAGCA
GTTTCTTCCG GTGTAATTAA AGCTATGCGT CAAGATGAAG ATTTTGTTAG TAGCACCTAT
AGAGACCACG TTCATGCTCT AAGTGCTGGT GTACCTGCAC GAGAAGTAAT GGCTGAATTA
TTTGGCAAAG CTACAGGTTG TTCTAAAGGT CGTGGTGGTT CGATGCATAT GTTCTCAGCT
ACGCATAATC TATTAGGGGG TTATGCTTTT GTTGCTGAAG GTATTCCTGT CGCGACTGGG
GCAGCTTTTC AGAGTAAATA TCGTCGGGAA ACTATGGGAA ATCAAGCGGC TGACCAAGTA
ACGGCTTGTT TCTTTGGAGA TGGAGCTTGT AATAATGGTC AGTTTTTTGA ATGCCTAAAT
ATGGCTGCAC TGTGGAAACT ACCAATTATT TATGTAGTAG AAAATAACAA ATGGGCAATT
GGTATGGCTC ACGAACGCGC TACTTCTGAA CCGGAAATTT ATAAAAAAGC TCATGCTTTT
GGCATGGTAG GTGTAGAAGT TGATGGTATG GATATATTAG CAGTACATTC TGCTGCTCAA
GAGGCTGTTG CTCGGGCTCG TGCTGGCGAA GGACCAACTT TAATTGAAGC TTTGACTTAT
CGTTTTCGAG GACATTCTTT GGCAGACCCT GATGAATTAA GAGATCAAGA AGAAAAGCAA
TATTGGTTTT CTCGTGACCC CATTAAAAAA TTTACAACTT ACTTAACAGA AAATAATTTG
GTAGATGTTG CAGAATTAGT GGCAATTGAT AAAAAGATTG AGAATTTAAT TACTGAAGCT
GTTGATTTTG CTACAAATAG TCCAGAACCA GGTTCAGATG AACTATATCG GTATATTTTT
GCAGAGGATT AG
 
Protein sequence
MIQERTVPKF DTNSVKITKE KALILYEDMV LGRLFEDKCA EMYYRGKMFG FVHLYNGQEA 
VSSGVIKAMR QDEDFVSSTY RDHVHALSAG VPAREVMAEL FGKATGCSKG RGGSMHMFSA
THNLLGGYAF VAEGIPVATG AAFQSKYRRE TMGNQAADQV TACFFGDGAC NNGQFFECLN
MAALWKLPII YVVENNKWAI GMAHERATSE PEIYKKAHAF GMVGVEVDGM DILAVHSAAQ
EAVARARAGE GPTLIEALTY RFRGHSLADP DELRDQEEKQ YWFSRDPIKK FTTYLTENNL
VDVAELVAID KKIENLITEA VDFATNSPEP GSDELYRYIF AED