Gene Caul_3245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3245 
Symbol 
ID5900700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3507830 
End bp3509455 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID641563750 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001684870 
Protein GI167647207 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGA GCTTCGACTA CATCGTCGTC GGCGGCGGCT CGGCGGGCAG CGTGGTGGCC 
GCCCGCCTAA GCGAAAGATC GGATCTGCAA ATCCTGCTGC TGGAGGCGGG CGGACGCGAT
CGCGGCCTGC TTCTGCAAAT GCCTCTGGCC TTCCGCCTGC TCCGGGCCAA GATGTTGTTC
GACTGGGGCC TGTCCTCCGA ACCGGAGCCT TACGCCAATG ACCGCAGCAT CCCGGCCGCC
CGAGGCCGGG TCCTGGGAGG CAGTTCGTCG GTCAACGGCA TGATGTATTC GCGCGGCCAC
CCGCGCGACT ACGACCAATG GGCGCAGATG GGAGCCCAGG GCTGGTCGTT CGAGGAGGTC
CTGCCCTATT TCAGGCGATC CGAGGACAAC TGGCGCGGGG CGTCCCACTG GCACGGCGCC
GGCGGGCCGC TGTCGGTCTC GCCCATGTCG CACGACGACC CTCTTGTGCG GGCCATCGAG
GCGACGGCCC GGGGATTGGG TTATCCCGTC ACCGATGACT TCGAGGGAGA GCAGCCCGAG
GGTTTCGGCC TGCCGGACCT GACCGTTCGC AACGGGCGGC GCGCCAGCGC CTCGCAAGCC
TATCTGCACC CGGCCCGGCG CCGAACAAAC CTGACGGTCG TGACGTCCGC CCACGTTCGA
CGGGTGTTGA TCGAAGGCGG CCGAGCGGTC GGCGTCGTCT ACCAGGTCGA TGGCCGGGAG
CGGACGGCGC GCTGCGACCG GGAGGTAGTG CTATGCGGCG GCGCCTATGC CTCGCCCCAA
CTCCTGATGC TGTCGGGCGT GGGGCCAGCC GACCACCTGC GCGATCACGG CATCGACGTT
CTGGCCGACC TTCCGCAGGT CGGCCGAAAC CTCCAGGAAC ACCCGCTGAC GCCGATGGGC
TTTCGCGGCA AGAAGCCGTT CGACTTCGGC GGCCAGCTTC GCGCCGACAA AGTGGCCCTG
GCCGCAGCGC GCTGGCGCCT GACGGGCCAG GGCTTGATGG CCACCCAACC CCTGACCTCC
ATCGCCTTCC ACAAATCCAG GCCGGGACTG GAGCGACCGG ACATCGAGAC CATGTTCATG
CCCACCAGCC TGGACGCCAA GGTCTGGTTC CCCGGCGCGC GCAAACGGGC CGACGACATG
CTGACCGTCC TCAATGTCGC CTTGCGGCCC AGCAGCCGCG GGGCGGTGAC GCTGCGTTCC
GCCGATCCCA TGGCCAAGCC GAAGATCCTG TTCAACCTCT TGTCGGATCC CGACGACATG
GCGCTTCTGC GCCACAGCCT GCGCTGGACT CGCGAGCTCC TGCGCCAGGG GCCGATCGCC
GACTATGTGG GCGAGGAAGT CTTCCCGGGG CCGGCCCTGC AAAGCGACGC TCAGCTCGAC
GCCTTCACTC GGGCCTCCAG CGTCACCGCC CAGCACCCGG TCGGCACGTG CCGCATGGGC
CAGGACGCCG GCGCCGTGGT CGATCCGCGT CTGCGGGTGA GGGGCCTGCA AGGCCTGCGG
GTCGCCGACG CCTCGGTGAT GCCGACCCTG ATCGGCGGCC ACACCAATGC GCCGGCGATC
ATGATCGGCG AGCGCGCCGC GGCGATGATG CTGGAGGACG CCCAGGGCGC GCCGCCCAGG
GCCTAG
 
Protein sequence
MASSFDYIVV GGGSAGSVVA ARLSERSDLQ ILLLEAGGRD RGLLLQMPLA FRLLRAKMLF 
DWGLSSEPEP YANDRSIPAA RGRVLGGSSS VNGMMYSRGH PRDYDQWAQM GAQGWSFEEV
LPYFRRSEDN WRGASHWHGA GGPLSVSPMS HDDPLVRAIE ATARGLGYPV TDDFEGEQPE
GFGLPDLTVR NGRRASASQA YLHPARRRTN LTVVTSAHVR RVLIEGGRAV GVVYQVDGRE
RTARCDREVV LCGGAYASPQ LLMLSGVGPA DHLRDHGIDV LADLPQVGRN LQEHPLTPMG
FRGKKPFDFG GQLRADKVAL AAARWRLTGQ GLMATQPLTS IAFHKSRPGL ERPDIETMFM
PTSLDAKVWF PGARKRADDM LTVLNVALRP SSRGAVTLRS ADPMAKPKIL FNLLSDPDDM
ALLRHSLRWT RELLRQGPIA DYVGEEVFPG PALQSDAQLD AFTRASSVTA QHPVGTCRMG
QDAGAVVDPR LRVRGLQGLR VADASVMPTL IGGHTNAPAI MIGERAAAMM LEDAQGAPPR
A