Gene Tery_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3171 
Symbol 
ID4243842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4839782 
End bp4841212 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content38% 
IMG OID638108180 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_722772 
Protein GI113476711 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.138028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAG AATTTGATTA CGACTTAATA ATTATTGGTG CAGGTGTCGG TGGACATGGT 
GCAGCATTAC ACGCTACCAG TTGTGGCCTA AAAACAGCTA TTGTAGAAGT AGCAGAAATG
GGAGGTACTT GTGTTAACCG AGGTTGTATA CCATCTAAGG CACTTCTCGC AGCATCAGGT
AAAGTTCGAG AGTTACGAAA TGCTCACCAC TTAAAAACTT TGGGAATTGA GTTGGATAAT
GTTTCTTATG ACAGACAAGT AATGGCCACT CATGCAAGTA ACATTGTGAC CAAAATTAGA
GGTGACATGA GCAAAAGCCT TAAACGTCTG AGTGTAGATA TTATTACAGG GTGGGCTCAG
GTAGCAGGAA AACAAAAAGT TACGGTTAAG ACGGAAAAGG GAGAAGAAAA CTTTACTGCC
AAAGATATTA TACTTGCTCC TGGTTCAGTA CCTTTTGTTC CTCCTGGAAT AGAATTGGAT
GGTAAAACAG TATTTACCAG TGATGATGCT CTTAAACTAG ACTGGTTACC ACCTTGGGTT
GCAATTATTG GTAGTGGTTA TATAGGACTA GAATTTTCTG ATATTTACAC TGCCCTTGGA
TCTGAAATTA CGATGATTGA GGCATTAGAT AAGTTAATGC CTACTTTCGA TCCAGATATA
GCTAAGATTG CACAAAGAGT TCTAATTCAG TCAAGAGATA TTGAAGTAAA AGTAGGGAAG
TTGGCTATAA AGGTAGTTCC TGGATCTCCG GTAATTATTG AACTTGCCGA TGCCAAGACT
AAAGAAGTAG AAGAAATTAT AGAGGTTGAT GCTTGTCTAG TTGCCACAGG TCGCATTCCC
TATACAAAAG ATTTAGGACT AGATTCTGTA GCAGTAGAAA CTGATAAATA TGGATTTATT
CCAGTAAATA GCAAAATGGC AGTTTTGTCA AGTGGTGAAC CAGTACCTAA TTTATGGGCA
ATTGGTGATG CAACAGGAAA AATGATGTTG GCTCATGCAG CATCTGCCCA AGGAATAACA
GTGGTAGAAA ATATATGTGG TCGTGATCGA GAACCAGATT ATCTTAGTAT TCCGGCGGCA
GCTTTTACTC ATCCAGAAAT TAGCTATGTT GGTATGACAG AACCAGCAGC AAAAGATTTA
GGCCAAAAAC AGGGGTTTGA AGTGGCAAGT GTCAGAACTT ATTTTAAGGG TAATTCTAAG
GCGATAGCTG AAGATGAAAC AGATGGTATT GCTAAAGTAA TTTATCGTCA AGATACAGGA
GAATTATTAG GAGTACATAT TATTGGTCTT CATGCCTCTG ACTTAATTCA AGAAGCAGCA
AATGCTATAG CTAAAAAACA ATCTGTTAAT GAGTTATCTT TTAATGTACA TACTCATCCT
ACTTTATCAG AAGTTTTGGA TGAAGCATTT AAACGAGCCA CTGTTCACTA G
 
Protein sequence
MTQEFDYDLI IIGAGVGGHG AALHATSCGL KTAIVEVAEM GGTCVNRGCI PSKALLAASG 
KVRELRNAHH LKTLGIELDN VSYDRQVMAT HASNIVTKIR GDMSKSLKRL SVDIITGWAQ
VAGKQKVTVK TEKGEENFTA KDIILAPGSV PFVPPGIELD GKTVFTSDDA LKLDWLPPWV
AIIGSGYIGL EFSDIYTALG SEITMIEALD KLMPTFDPDI AKIAQRVLIQ SRDIEVKVGK
LAIKVVPGSP VIIELADAKT KEVEEIIEVD ACLVATGRIP YTKDLGLDSV AVETDKYGFI
PVNSKMAVLS SGEPVPNLWA IGDATGKMML AHAASAQGIT VVENICGRDR EPDYLSIPAA
AFTHPEISYV GMTEPAAKDL GQKQGFEVAS VRTYFKGNSK AIAEDETDGI AKVIYRQDTG
ELLGVHIIGL HASDLIQEAA NAIAKKQSVN ELSFNVHTHP TLSEVLDEAF KRATVH