Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3245 |
Symbol | |
ID | 5900700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3507830 |
End bp | 3509455 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563750 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001684870 |
Protein GI | 167647207 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCGA GCTTCGACTA CATCGTCGTC GGCGGCGGCT CGGCGGGCAG CGTGGTGGCC GCCCGCCTAA GCGAAAGATC GGATCTGCAA ATCCTGCTGC TGGAGGCGGG CGGACGCGAT CGCGGCCTGC TTCTGCAAAT GCCTCTGGCC TTCCGCCTGC TCCGGGCCAA GATGTTGTTC GACTGGGGCC TGTCCTCCGA ACCGGAGCCT TACGCCAATG ACCGCAGCAT CCCGGCCGCC CGAGGCCGGG TCCTGGGAGG CAGTTCGTCG GTCAACGGCA TGATGTATTC GCGCGGCCAC CCGCGCGACT ACGACCAATG GGCGCAGATG GGAGCCCAGG GCTGGTCGTT CGAGGAGGTC CTGCCCTATT TCAGGCGATC CGAGGACAAC TGGCGCGGGG CGTCCCACTG GCACGGCGCC GGCGGGCCGC TGTCGGTCTC GCCCATGTCG CACGACGACC CTCTTGTGCG GGCCATCGAG GCGACGGCCC GGGGATTGGG TTATCCCGTC ACCGATGACT TCGAGGGAGA GCAGCCCGAG GGTTTCGGCC TGCCGGACCT GACCGTTCGC AACGGGCGGC GCGCCAGCGC CTCGCAAGCC TATCTGCACC CGGCCCGGCG CCGAACAAAC CTGACGGTCG TGACGTCCGC CCACGTTCGA CGGGTGTTGA TCGAAGGCGG CCGAGCGGTC GGCGTCGTCT ACCAGGTCGA TGGCCGGGAG CGGACGGCGC GCTGCGACCG GGAGGTAGTG CTATGCGGCG GCGCCTATGC CTCGCCCCAA CTCCTGATGC TGTCGGGCGT GGGGCCAGCC GACCACCTGC GCGATCACGG CATCGACGTT CTGGCCGACC TTCCGCAGGT CGGCCGAAAC CTCCAGGAAC ACCCGCTGAC GCCGATGGGC TTTCGCGGCA AGAAGCCGTT CGACTTCGGC GGCCAGCTTC GCGCCGACAA AGTGGCCCTG GCCGCAGCGC GCTGGCGCCT GACGGGCCAG GGCTTGATGG CCACCCAACC CCTGACCTCC ATCGCCTTCC ACAAATCCAG GCCGGGACTG GAGCGACCGG ACATCGAGAC CATGTTCATG CCCACCAGCC TGGACGCCAA GGTCTGGTTC CCCGGCGCGC GCAAACGGGC CGACGACATG CTGACCGTCC TCAATGTCGC CTTGCGGCCC AGCAGCCGCG GGGCGGTGAC GCTGCGTTCC GCCGATCCCA TGGCCAAGCC GAAGATCCTG TTCAACCTCT TGTCGGATCC CGACGACATG GCGCTTCTGC GCCACAGCCT GCGCTGGACT CGCGAGCTCC TGCGCCAGGG GCCGATCGCC GACTATGTGG GCGAGGAAGT CTTCCCGGGG CCGGCCCTGC AAAGCGACGC TCAGCTCGAC GCCTTCACTC GGGCCTCCAG CGTCACCGCC CAGCACCCGG TCGGCACGTG CCGCATGGGC CAGGACGCCG GCGCCGTGGT CGATCCGCGT CTGCGGGTGA GGGGCCTGCA AGGCCTGCGG GTCGCCGACG CCTCGGTGAT GCCGACCCTG ATCGGCGGCC ACACCAATGC GCCGGCGATC ATGATCGGCG AGCGCGCCGC GGCGATGATG CTGGAGGACG CCCAGGGCGC GCCGCCCAGG GCCTAG
|
Protein sequence | MASSFDYIVV GGGSAGSVVA ARLSERSDLQ ILLLEAGGRD RGLLLQMPLA FRLLRAKMLF DWGLSSEPEP YANDRSIPAA RGRVLGGSSS VNGMMYSRGH PRDYDQWAQM GAQGWSFEEV LPYFRRSEDN WRGASHWHGA GGPLSVSPMS HDDPLVRAIE ATARGLGYPV TDDFEGEQPE GFGLPDLTVR NGRRASASQA YLHPARRRTN LTVVTSAHVR RVLIEGGRAV GVVYQVDGRE RTARCDREVV LCGGAYASPQ LLMLSGVGPA DHLRDHGIDV LADLPQVGRN LQEHPLTPMG FRGKKPFDFG GQLRADKVAL AAARWRLTGQ GLMATQPLTS IAFHKSRPGL ERPDIETMFM PTSLDAKVWF PGARKRADDM LTVLNVALRP SSRGAVTLRS ADPMAKPKIL FNLLSDPDDM ALLRHSLRWT RELLRQGPIA DYVGEEVFPG PALQSDAQLD AFTRASSVTA QHPVGTCRMG QDAGAVVDPR LRVRGLQGLR VADASVMPTL IGGHTNAPAI MIGERAAAMM LEDAQGAPPR A
|
| |