Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0419 |
Symbol | |
ID | 5897693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 458929 |
End bp | 460620 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641560905 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001682054 |
Protein GI | 167644391 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCC AGACCCAGTT CGACGTCATC GTCGTCGGCT CCGGCATCAC CGGCGGCATG GCGGCCAAGG AGCTGACCGA ACGCGGCCTG AAGGTTTTGA TGATCGAGCG CGGCCCGATG ATCGAGCATG GCGCCGACTA CAAGACCGAG ATGACGCCGC CCTGGGAGCT GCCGTTTCGC GGCTATGGCG ACCCACAGGT GCTGGCGAGC GACTACCCGG TGCAGAGCAA GGGGCGATAT TTCGACGAGT GGACCTCGGC CCACTTCTGC AACGATCGCG AAAACCCCTA TCAGACCTCG GCGGAGAACC CCTTTCAGTG GCGGCGCTCC TACAATTTGG GCGGACGTTC TCTGGTGTGG GGCCGACAGA GCTTTCGCTG GAGCGCGCTG GACTTCGAGG CCAACAAGAA GGACGGACAC GGCGTCGACT GGCCGATCCG ATACGAGGAC CTGGCCCCCT GGTACGACCA TGTCGAGACC TTCATCGGCG TGCAGGGGTC AACCGAGCAC ATGCCCGCCT TGCCGGACGG CAAGTTCCAA CCGCCGTTTG AGCTGAACGT GGTGGAAAAG GCCGTGGCCG CCAAGATCGC CGCCACCTAC CCCGACCGTC GCCTGATCAT CTCGCGCTCG GCCCACCTCA CCCAGGAGAA GGAGGGCCGC GGGGTCTGCC AATCGCGCAG CATCTGCGCG CGCGGCTGCT CCTACGGGGC CTATTTCAGC ACCCAAAGCG CCAGCCTGCC GGCGGCTCAG GCCACTGGTC GCCTGACCCT GATCACCGAC AGCCTAGTCG ACACCCTGGA CTATGACCCG GCCACGCGGC GGGTCACCGG GGTCAAGGTC CTGGATCTCA AATCCAAGAC CAGCGCCACC TACACGGCCA AGGCGGTCTT CCTCTGCGCG GGAAGCTTCA ACAGCGTGGC GCTGTTGTTG CGCTCGAAGT CCGCGGCCAT GCCGGCGGGC CTGGCCAACG CCAGCGGCGT GCTTGGCCAG TACATCATGG ACCATGTCGG AGCGACCTCG GCGGCGGTCG CCATTCCAGG CTTCGCCGAC AAGACCACGT TTGGCAATCG GCCCACGGGC ACCATCGTTC CGCGCTTTCG AAACCTCTTA GCCCATGAGG ACACCGACTT CCTGCGCGGC TACAGCTTCT TTGGCTCTTC CATGCAGCTC AGCTGGCGCT TTGGCGAATC CACGCCGGGC CTGGGCACGG CGCTCAAAGA CCGCCTACAC GCGCCTGGCC AATGGGTCAT GGCGCTTAAC GCCCACGGCG AACATCTGCC ACGGGCCGAA AACCGCATCA CGCTGGATCC CAACAGGGTG GACGCCAACG GGCAGGCGCA GCTGCGCATC GATTTCGCCT ATGGCGACAA CGAAAAGAAG ATGCTGCTCG ACGCCCAAAA GCAGGCCCTG GCCATGCTCG CCCCCATGGG CGGCAAGGTC AGCCGCTCCT CGGCCGATCT GAACCAAGGC GGCGCGACCG TTCACGAGAT GGGCGGGGCG CGCATGGGAC GTGACCCGAC CACCTCGGTG CTCAACGGCG AGAACCAGGC CCATGAGGTG ATCAATCTCT TCGTCACGGA CGGCGCCTGC ATGAGCTCGA GCGCCAGCGT CAATCCCTCC CTGACCTACA TGGCCCTGAC CGCCCGGGCC TGCGCCCGGG CGGCCAAACG GATCACCTCG GGGGCGCTGT GA
|
Protein sequence | MSGQTQFDVI VVGSGITGGM AAKELTERGL KVLMIERGPM IEHGADYKTE MTPPWELPFR GYGDPQVLAS DYPVQSKGRY FDEWTSAHFC NDRENPYQTS AENPFQWRRS YNLGGRSLVW GRQSFRWSAL DFEANKKDGH GVDWPIRYED LAPWYDHVET FIGVQGSTEH MPALPDGKFQ PPFELNVVEK AVAAKIAATY PDRRLIISRS AHLTQEKEGR GVCQSRSICA RGCSYGAYFS TQSASLPAAQ ATGRLTLITD SLVDTLDYDP ATRRVTGVKV LDLKSKTSAT YTAKAVFLCA GSFNSVALLL RSKSAAMPAG LANASGVLGQ YIMDHVGATS AAVAIPGFAD KTTFGNRPTG TIVPRFRNLL AHEDTDFLRG YSFFGSSMQL SWRFGESTPG LGTALKDRLH APGQWVMALN AHGEHLPRAE NRITLDPNRV DANGQAQLRI DFAYGDNEKK MLLDAQKQAL AMLAPMGGKV SRSSADLNQG GATVHEMGGA RMGRDPTTSV LNGENQAHEV INLFVTDGAC MSSSASVNPS LTYMALTARA CARAAKRITS GAL
|
| |