Gene Cagg_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2097 
Symbol 
ID7267604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2571512 
End bp2572846 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID643566931 
Productcatalytic domain of components of various dehydrogenase complexes 
Protein accessionYP_002463420 
Protein GI219848987 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA TCAAAATGCC CCAACTCGGC GAGAGCGTTA CCGAAGGTAC TGTTGGCCGC 
TGGCTGAAAC GGCCCGGCGA ACCGGTCGCG AAATACGAAC CGTTGTTGGA AGTAGTTACC
GATAAGGTCG ATACGGAGGT ACCGGCGCCT GAAGCTGGTG TTCTCCACGA GATTTTGGTT
CCCGAAGGTG AGACGGTACG GGTGGGAACG GTGATCGCAC GGCTCGCCCC GGCAGGAGCT
GCGGTGAGCA CGCCAACACC GGTGGCTGCG ACGAGTGCTG TAGCAGTCTC CACGACCAGT
GCCTCGGCAA CCACCACGAC AACGGTGGCT CCACCGGCCA GTGATGGGCG TAATACCTAT
CTCTCGCCGG TCGTAGCGCG GTTGCTCGCC GAACACAACC TTGATCCGGC GCAGATTCGC
GGTACTGGAC AAGGCGGGCG TATTACCAAA CAAGATGTTA TGCGCTTCCT CGCCGAACGT
GAACGCCAGG CGGTGAACGC GCCGGCTCCC ACACCCGCGC CGGTCGCTGC TCCTACCCCG
GCTCCCACAC CCGCGCCGGT CGCTGCTCCT ACCCCGATGC CTACACCCGC GCCGGTCGCT
GCTCCTTCGC CGACGCCCGC GCCCACACCG GTTGAGATTC CTGCTGACGC TGAATTGGTG
CCATTGACGC CAATGCGGCG CAGCATTGCC GAGCATATGG CTCGCTCGGT ACGCACCTCG
CCGCACGTCA CAACTGTGAT GGAGGCAGAT TTGAGCCGCG TGCTCGCCCA TCGAGCCGCT
CATCAAGAAG CCTTTAATCG GCAAGGTGTG CGTCTCACGC TTACCCCATA TTTCATCATT
GCCGCTATTG CCGGACTACA AGCCGTACCG GTATTCAACG GCAGCTTCAC CGAGCAAGGT
ATCATCCTAC ATCGCCGTAT TAATGTGGGG ATTGCAGTTG CGCTCAACGA AGGTCTATTG
GTACCGGTCA TTCCTGATGC TGACGAGAAG AATCTGCTCG GCCTGGCCCG TGCCGTCAAC
GATCTCGCCG AACGGGCACG TACCAGGCGT TTGCGCCCAG AAGAAACACA AGGTGGTACG
TTCACCATCA CGAACCACGG GGTGACCGGT AGCCTGTTTG CAACGCCGAT CATCAATCAG
CCGCAGGCCG GTATTCTCGG TATCGGTGCA GTGGTCAAAC GACCGGTCGT TATTTCCCAA
AATGGGCTTG ATGCCATCGC CATTCGTCCG CTCTGCTATC TCTCGTTTAC CTTCGATCAC
CGGATTGCCG ATGGCGCCAC TGCCGACCGA TTCCTCGCTA CCGTTAAACA ACGGCTCGAA
CAGTGGGAGA GTTAA
 
Protein sequence
MIDIKMPQLG ESVTEGTVGR WLKRPGEPVA KYEPLLEVVT DKVDTEVPAP EAGVLHEILV 
PEGETVRVGT VIARLAPAGA AVSTPTPVAA TSAVAVSTTS ASATTTTTVA PPASDGRNTY
LSPVVARLLA EHNLDPAQIR GTGQGGRITK QDVMRFLAER ERQAVNAPAP TPAPVAAPTP
APTPAPVAAP TPMPTPAPVA APSPTPAPTP VEIPADAELV PLTPMRRSIA EHMARSVRTS
PHVTTVMEAD LSRVLAHRAA HQEAFNRQGV RLTLTPYFII AAIAGLQAVP VFNGSFTEQG
IILHRRINVG IAVALNEGLL VPVIPDADEK NLLGLARAVN DLAERARTRR LRPEETQGGT
FTITNHGVTG SLFATPIINQ PQAGILGIGA VVKRPVVISQ NGLDAIAIRP LCYLSFTFDH
RIADGATADR FLATVKQRLE QWES