Gene TM1040_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3504 
Symbol 
ID4075183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp540502 
End bp541899 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content62% 
IMG OID638005019 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_611738 
Protein GI99078480 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.305711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCT ATGACGTCAT CGTAATCGGC GCCGGTCCCG GCGGCTATGT CAGCGCAATC 
CGCTGCGCCC AGCTGGGCCT CAAGACCGCC ATCGTAGAAG GCCGCGAAAC CCTTGGCGGC
ACCTGCCTCA ACGTGGGCTG CATCCCCTCC AAGGCGCTCT TGCACGCAAC CCATATGCTG
CATGAAGCAG AGCACAACTT CGGCGCCATG GGTCTCAAGG GCAAAAGCCC CTCGGTCGAC
TGGAACCAGA TGAAATCCTA CAAGGATGAG GTCATCGGCC AGAACACCGG CGGTGTCGAG
TTCCTCATGA AGAAGAACAA GATCGACTGG ATCAAGGGCT GGGCGTCGAT CCCCGAGGCG
GGCAAGGTCA AAGTGGGCGA CGACACCCAT GAGGCCAAGA ACATCATCAT CGCCTCCGGC
TCCGTGCCCT CCGCCCTGCC GGGTGTCGAG GTCGACAACG ACAAGGGCCT TGTGGTCGAC
AGCACCGGCG CTCTGGAACT GCCCAAAGTC CCGAAGAAAA TGGTCGTGAT CGGCGCAGGC
GTCATTGGCC TCGAGCTCGG CTCGGTCTAC GCGCGCCTTG GCGCAGAGGT CACCGTGGTC
GAATATATGG ACGCGGTCTG TCCCGGCATG GACAAGGACG TCCAGCGCGG CTTCAAACGC
ATCCTTGAAA AGCAGGGCCT CAGCTTCATC ATGGGGGCCG CCGTCAAGGG CGTTGAAACC
ACGAAATCCA AGGCCAAAGT CTCTTATGAG CCCAAAAAAG GCGGCGACGC AGAGGTCATC
GAGGCCGATG TGGTGCTCGT CGCCACGGGG CGCAAACCCT ATGCCGAAGG CCTTGGCCTC
GACGCGCTTG GCGTCAAGAT GACCGAACGC GGCCAGATCG CCACCGACGC GCAATGGGCC
ACGAACGTCA AAGGCATCTA CGCCATCGGC GACGTCATCG AGGGTCCGAT GCTCGCGCAT
AAGGCCGAAG ACGAAGGCAT GGCCGTGGCC GAAGTGATCG CGGGCAAACA TGGCCACGTG
AATTACGGCG TCATTCCCGG TGTGGTCTAC ACCACCCCAG AGGTGGCGAC CGTCGGTGCC
ACCGAAGACG CGCTCAAGGC CGAAGGTCGC AAGATCAAGG TGGGCAAGTT CATGTTCATG
GGCAACGCCC GCGCCAAGGC CGTGCATCAG GCCGAGGGTG GTTTTGTGAA ACTGATTGCC
GACAAGGAAA CCGACCGCAT CCTCGGTGCG GCCATCATCG GCCCCGGTGC AGGCGATCTG
ATCCACGAGA TCTGTGTGGC GATGGAATTT GGCGCCTCCG CCGAGGATCT GGCGCTGACC
TGCCACGCGC ATCCGACCTA TTCCGAGGCC GTGCGCGAAG CAGCCCTTGC CTGCGGCGAC
GGCGCGATCC ACAGTTAA
 
Protein sequence
MASYDVIVIG AGPGGYVSAI RCAQLGLKTA IVEGRETLGG TCLNVGCIPS KALLHATHML 
HEAEHNFGAM GLKGKSPSVD WNQMKSYKDE VIGQNTGGVE FLMKKNKIDW IKGWASIPEA
GKVKVGDDTH EAKNIIIASG SVPSALPGVE VDNDKGLVVD STGALELPKV PKKMVVIGAG
VIGLELGSVY ARLGAEVTVV EYMDAVCPGM DKDVQRGFKR ILEKQGLSFI MGAAVKGVET
TKSKAKVSYE PKKGGDAEVI EADVVLVATG RKPYAEGLGL DALGVKMTER GQIATDAQWA
TNVKGIYAIG DVIEGPMLAH KAEDEGMAVA EVIAGKHGHV NYGVIPGVVY TTPEVATVGA
TEDALKAEGR KIKVGKFMFM GNARAKAVHQ AEGGFVKLIA DKETDRILGA AIIGPGAGDL
IHEICVAMEF GASAEDLALT CHAHPTYSEA VREAALACGD GAIHS