Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1409 |
Symbol | |
ID | 5898864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1498790 |
End bp | 1500286 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561896 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001683037 |
Protein GI | 167645374 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.671216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGACCA TCAGCCATTT CGTGAACGGA CAAACCTTTG AAGGGGCGTC GGGTCGCTTT GGCGACGTGT TCAATCCCAA CACCGGCGAG GTCCAGGCCC GCGTCCAGTT GGCCACCGAC GCCGAGCTCG ACGCCGCCGT ACAGGCCGCC GCCGCCGCCC AGATCGGCTG GGCCGCCACC AACCCGCAGC GCCGCGCCCG GGTGATGTTC GAGTTCAAGC GCCTGATCGA GCGCGACATG AACAGCCTAG CCGAGATCCT GTCGTCCGAG CACGGCAAGG TGGTCGCCGA CAGCAAGGGC GACATCCAGC GCGGCCTGGA GGTGATCGAG TTCGCCTGCG GCATCCCCCA CATCCTGAAG GGCGAATATA CCGAGGGCGC GGGCCCCGGC ATCGACGTCT ATTCAATGCG CCAGCCGCTG GGCGTCTGCG CCGGCATCAC CCCGTTCAAC TTCCCGGCCA TGATCCCGAT GTGGATGTTC GGCATCAGCA TCGCCGTGGG CAACACCTTC ATCCTCAAGC CGTCGGAGAA GGATCCGACG GTGCCGGTCA AGCTGGCCGA GCTGATGATG GAAGCCGGGG CTCCGGCCGG CGTGCTGAAC GTGGTGCACG GCGACAAGGT CTGCGTCGAC GCGATCCTGA CCCATCCGCT GATCCGCGCC GTCAGCTTCG TCGGTTCGTC GGACATCGCC CACTACGTCT ACCAGACCGG CACGGCGCAC GGTAAACGTG TCCAGGCCAT GGGCGGCGCC AAGAACCACG GCATTGTCCT GCCCGACGCC GACCTCGACC AGGTGGTCAA GGACTTGTCG GGCGCGGCCT TTGGTTCGGC GGGCGAGCGC TGCATGGCCC TGCCGGTGGT GGTTCCGGTC GGCCAGAAGA CCGCTGACGA ACTGCGCGAA CGGATGGTCG CCGAGATCGA GACGCTGCGG GTCGGCGTCT CCAGCGACCC GGCCGCCCAC TACGGCCCGG TGGTCAGCGC CCAGCACCGC GCCAAGATCG CGGACTACAT CCGTCTTGGC GTTGAAGAGG GCGCGGACTT GGTGGTCGAT GGCCGCGACT TTTCCATGCA GGGCTTCGAG AAGGGCTTCT TCATCGGCCC GTCGCTGTTC GACGGCGTCA AGAAGGGCAT GAAGACCTAT CAGGAAGAGA TCTTCGGACC GGTGTTGCAG ATCGTCCGCG CCGAGACCTT CGAAGAAGCC TTGGCCCTGC CGTCCGAGCA TCAGTACGGC AACGGCGTGG CGATCTTCAC CCGCAACGGC CGGGCGGCGC GCGAGTTCGC CAGCCGCGTC AATGTCGGCA TGGTCGGCAT CAACGTGCCG ATCCCGGTGC CGGTGGCCTA CCACACCTTC GGCGGCTGGA AGCGCAGCGC CTTTGGCGAC ACCAACCAGC ACGGCGTCGA GGGCGTGAAA TTCTACACCA AGGTCAAGAC GATCACCGCG CGGTGGCCCG AGGGCGACCA CGAGGGCGAC GCCTTCGTCA TTCCGACGAT GAAATAG
|
Protein sequence | MRTISHFVNG QTFEGASGRF GDVFNPNTGE VQARVQLATD AELDAAVQAA AAAQIGWAAT NPQRRARVMF EFKRLIERDM NSLAEILSSE HGKVVADSKG DIQRGLEVIE FACGIPHILK GEYTEGAGPG IDVYSMRQPL GVCAGITPFN FPAMIPMWMF GISIAVGNTF ILKPSEKDPT VPVKLAELMM EAGAPAGVLN VVHGDKVCVD AILTHPLIRA VSFVGSSDIA HYVYQTGTAH GKRVQAMGGA KNHGIVLPDA DLDQVVKDLS GAAFGSAGER CMALPVVVPV GQKTADELRE RMVAEIETLR VGVSSDPAAH YGPVVSAQHR AKIADYIRLG VEEGADLVVD GRDFSMQGFE KGFFIGPSLF DGVKKGMKTY QEEIFGPVLQ IVRAETFEEA LALPSEHQYG NGVAIFTRNG RAAREFASRV NVGMVGINVP IPVPVAYHTF GGWKRSAFGD TNQHGVEGVK FYTKVKTITA RWPEGDHEGD AFVIPTMK
|
| |