Gene Acid345_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2791 
Symbol 
ID4072414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3303505 
End bp3305352 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content61% 
IMG OID637984809 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_591866 
Protein GI94969818 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.319819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACAGG AATTTAAACT CCCCGAACTG GGTGAGAATA TTGCCAGCGG CGACCTGGTT 
CGTGTGATGG TGAAGCCCGG CGACACCGTC AAGGAAGGCC AGCCTGTGAT CGAACTCGAG
ACGGACAAGG CCGTCATCGA AGTTCCTTCC ACCGTCAGCG GCAAGGTACA GGAAGTCAAA
GTCCAAAAAG GCCAGAAGCT GAAAGTCGGC GCCATCATCT TCACCTATGG CGACGGCGCG
GCCGCCGCCC CGGTGCAGCC CGCGGCTCCT GCAAAGACCG AAGACAAACC CAAGGCCGAG
CCTAAGGCTG AAGCACCCAA GCAAGCTGCG CCTTCTGCTG CAAAACCTGC CGCGAGCACC
GGCACAAAAC AAACGATCGA GTTCAAGCTT CCGGAACTAG GCGAGAACAT AAAGCAAGGC
CAGCTCGTCC GCATCATCGC TAAGCAAGGT GCGAGTGTCA GCGACGGCCA ACCGATCCTC
GAACTTGAAA CCGACAAAGC TGTCATCGAA GTTCCCGCCA CATTGACCGG CACCATCAAA
GAAGTGCATG TGAAGGAAGG CGACAAGATC GGCGTTGGCC AAACGATCTT CACCGTCGAA
ACCACCGAGG GTAACACCCA GCCGCCACAT CCACACACCA ACACCGAGGG CAACACGCAA
CCGCCCACCG GCGGCGGCGC TTCCTCGAAC ACGGAAGGCA ATACTCAGCC CCCGCATCCG
CACTCCAACA CCGAAGGCAA TCCACAGCCG CCCACTGGCG GCGGTGGATC TTCTTCCGCG
ACCGCTGCTC GCGACTTCGA ACTCAGCGGA CAGCAACTTG CTCGCCTTCA GTTCGAACTC
GCTCTGCGCA GCGAAGGCAA AACCGAACGC GAGGCGCATC CGCCCGACGT TCGTGATCTT
GGCGTGCGGG TCTCTCTCAC GCCACTGACG CCGGGCCGTC CCACTGTCGC CGCGTCGCCC
ACGGTACGCC GTCTCGCTCG CGAAATCGGC GTCGACATCG TGCAGGTGAA GGGAACCGGT
CCCGGCGGAC GCATCAGCGA AGGGGATGTA AAGCTCTTCG CAAAGCAACT GATCGTGCGC
CTCCAGCACG AAGCCGCTAC CGCGAAGGCC GCTCCCAAGG TCGTGCTCCC CGACTTCAGC
AAGTGGGGCT CGATCGAGAA AGAGCAGATG CGCAGCATTC GGCGCAAAAC TGCTGAGCGT
CTCACGCAGG CCTGGACCAC CATCCCGCAC GTTACTCAGC ACGACCGCGC TGATATCACC
GAGCTCGAAA AGCTGCGCGA GAAGTTCGCG AAGCAAGCCG AAGCCGCGGG TGGCAAGCTC
ACAGTCACGG CCATCGCGCT CAAAGTCATT GCTGCCGCGA TGAAGAAGTT CCCCAAGTTC
AACGCGTCCA TCGATATCGA TCGCGAAGAA ATTATCTATA AGAAGTACGT GCACATCGGC
GTCGCCGTTG ACACTGAAGC CGGACTCCTC GTTCCCGTAC TTCGCAACGT GGACCAGAAA
AACGTCTATC AGATCGCGGC CGAGATGAAC GAACTCTCGA AGCGCGCGCG CGAACGCAAG
CTCAAGCCGG AAGAGATGGA AGGTGGCACC TTCACCATCA CCAACCTCGG TGGCATTGGC
GGCACATCAT TCACGCCGAT CGTGAACCTC CCCGAGGTTG CCATCCTGGG CCTCTCGCGC
GGACGCACTG AGCCCGTGTG GGTCAACGAT CACTTCGAGC CCCGGACGAT GCTCCCGCTC
TCGCTCAGCT ACGACCACCG CATCATTGAT GGTGCCGACG CCGCCCGCTA CCTTCGCTGG
GTCGCTGACG CGCTCGAACA ACCGGTGCTG CTGCTCCTGC AAGGTTGA
 
Protein sequence
MAQEFKLPEL GENIASGDLV RVMVKPGDTV KEGQPVIELE TDKAVIEVPS TVSGKVQEVK 
VQKGQKLKVG AIIFTYGDGA AAAPVQPAAP AKTEDKPKAE PKAEAPKQAA PSAAKPAAST
GTKQTIEFKL PELGENIKQG QLVRIIAKQG ASVSDGQPIL ELETDKAVIE VPATLTGTIK
EVHVKEGDKI GVGQTIFTVE TTEGNTQPPH PHTNTEGNTQ PPTGGGASSN TEGNTQPPHP
HSNTEGNPQP PTGGGGSSSA TAARDFELSG QQLARLQFEL ALRSEGKTER EAHPPDVRDL
GVRVSLTPLT PGRPTVAASP TVRRLAREIG VDIVQVKGTG PGGRISEGDV KLFAKQLIVR
LQHEAATAKA APKVVLPDFS KWGSIEKEQM RSIRRKTAER LTQAWTTIPH VTQHDRADIT
ELEKLREKFA KQAEAAGGKL TVTAIALKVI AAAMKKFPKF NASIDIDREE IIYKKYVHIG
VAVDTEAGLL VPVLRNVDQK NVYQIAAEMN ELSKRARERK LKPEEMEGGT FTITNLGGIG
GTSFTPIVNL PEVAILGLSR GRTEPVWVND HFEPRTMLPL SLSYDHRIID GADAARYLRW
VADALEQPVL LLLQG