Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1872 |
Symbol | |
ID | 5899327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2007778 |
End bp | 2009052 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562362 |
Product | dehydrogenase catalytic domain-containing protein |
Protein accession | YP_001683499 |
Protein GI | 167645836 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCTCT ATCAGTTCCG GCTGCCCGAT ATCGGCGAAG GCGTCGCCGA GGCCGAAATC GTCGCCCTGC TGGTCAAGGT CGGCGATGTG GTCGAGGAAG ACCAGAACCT GGCCGAGGTG ATGACGGACA AGGCCACGGT TGAACTCAGC TCGCCCGTCG CCGGCGTCGT GACGGCCGTC CATGGTGAGA TCGGCGGCAT GATGCCGGTC GGCGCCGTGC TGATCGAATT CGAGAGCGAA GCGGGCGACG ATCGGGCTGT CGCCGCTCCG GCGTCGCCCC CTTCGGCCAC GCCGGCTCCC GCTACGGCGG CCACGCCTCG GAGTTCGGCG CCCGCCCCAA CCGTCTCAAC TGCGCCGCCC CCCGCGCCGG CGTCCCGCCG GGCCGCCTCG TCCGGCCGCC CGGCGGGAGA GGCGCCGCTC GCTGCTCCCT CGACCCGTCG GCGCGCCCTC GATCTGGGGG TTTCTCTGGT TCAGGTGCCC GGCACCGGTC CCGGCGGGCG GATCATGCCG GCAGATCTCG ACGCTTTCCT CGCCTCCGAT GGACAGAACG CGGGCGGTTC GGGCCTCGTC GCCCGCACAG GGGTCCATGA CACGCGCATC ATCGGATTAC GGCGCAAGAT CGCCGAGAAG ATGCAGGAGG CCAAGCGCCG CATCCCGCAC ATCAACTATG TCGAGGAATG CGATCTGACA GAGCTGGAAG CGCTGCGGCT CGACCTCAAC GAGCACCGCG CCGACGATCA GCCCAAGCTG ACGCTGTTGC CGTTCATCAT GCGGGCGATG GTCAAGGCCC TGCCGGACTT CCCGCAGATC AACGCCCACT ATGACGACGA CAACGGCGTG CTGCACGCGC ACGAAGGCGT CCACATCGGC ATCGCCACCC AGACGCCCAA CGGCCTGATC GTCCCGGTCG TGCGTCACGC CGAAGCGCGA GACATCTGGG ATTGCGCCCG CGAGGTCGCG CGGTTGGCCA AGGCCGTGCG CGACGGCTCG GCGGCCCGGG ACGAACTGTC CGGTTCGACC ATCACCCTGA CCAGCATGGG CCCCCTGGGG GGCATCGTCT CGACGCCGGT GATCAACCAT CCCGAGGTCG CCATCCTCAA TCCCAACAAG CTGGTGGACC GGCCGATGGT CCAGGGATCG TTCATCACCG TCCGCAAGAT GATGAACCTG TCCTCGGCCT TCGATCACCG CATCGTCGAC GGTTACGACG CCGCTCTGTT CGTCCAGCGC GTCAAGCGGC TGCTCGAGCA CCCCGCCCTG ATCTTCATGG ATTGA
|
Protein sequence | MGLYQFRLPD IGEGVAEAEI VALLVKVGDV VEEDQNLAEV MTDKATVELS SPVAGVVTAV HGEIGGMMPV GAVLIEFESE AGDDRAVAAP ASPPSATPAP ATAATPRSSA PAPTVSTAPP PAPASRRAAS SGRPAGEAPL AAPSTRRRAL DLGVSLVQVP GTGPGGRIMP ADLDAFLASD GQNAGGSGLV ARTGVHDTRI IGLRRKIAEK MQEAKRRIPH INYVEECDLT ELEALRLDLN EHRADDQPKL TLLPFIMRAM VKALPDFPQI NAHYDDDNGV LHAHEGVHIG IATQTPNGLI VPVVRHAEAR DIWDCAREVA RLAKAVRDGS AARDELSGST ITLTSMGPLG GIVSTPVINH PEVAILNPNK LVDRPMVQGS FITVRKMMNL SSAFDHRIVD GYDAALFVQR VKRLLEHPAL IFMD
|
| |