Gene Caul_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0419 
Symbol 
ID5897693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp458929 
End bp460620 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content66% 
IMG OID641560905 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001682054 
Protein GI167644391 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCC AGACCCAGTT CGACGTCATC GTCGTCGGCT CCGGCATCAC CGGCGGCATG 
GCGGCCAAGG AGCTGACCGA ACGCGGCCTG AAGGTTTTGA TGATCGAGCG CGGCCCGATG
ATCGAGCATG GCGCCGACTA CAAGACCGAG ATGACGCCGC CCTGGGAGCT GCCGTTTCGC
GGCTATGGCG ACCCACAGGT GCTGGCGAGC GACTACCCGG TGCAGAGCAA GGGGCGATAT
TTCGACGAGT GGACCTCGGC CCACTTCTGC AACGATCGCG AAAACCCCTA TCAGACCTCG
GCGGAGAACC CCTTTCAGTG GCGGCGCTCC TACAATTTGG GCGGACGTTC TCTGGTGTGG
GGCCGACAGA GCTTTCGCTG GAGCGCGCTG GACTTCGAGG CCAACAAGAA GGACGGACAC
GGCGTCGACT GGCCGATCCG ATACGAGGAC CTGGCCCCCT GGTACGACCA TGTCGAGACC
TTCATCGGCG TGCAGGGGTC AACCGAGCAC ATGCCCGCCT TGCCGGACGG CAAGTTCCAA
CCGCCGTTTG AGCTGAACGT GGTGGAAAAG GCCGTGGCCG CCAAGATCGC CGCCACCTAC
CCCGACCGTC GCCTGATCAT CTCGCGCTCG GCCCACCTCA CCCAGGAGAA GGAGGGCCGC
GGGGTCTGCC AATCGCGCAG CATCTGCGCG CGCGGCTGCT CCTACGGGGC CTATTTCAGC
ACCCAAAGCG CCAGCCTGCC GGCGGCTCAG GCCACTGGTC GCCTGACCCT GATCACCGAC
AGCCTAGTCG ACACCCTGGA CTATGACCCG GCCACGCGGC GGGTCACCGG GGTCAAGGTC
CTGGATCTCA AATCCAAGAC CAGCGCCACC TACACGGCCA AGGCGGTCTT CCTCTGCGCG
GGAAGCTTCA ACAGCGTGGC GCTGTTGTTG CGCTCGAAGT CCGCGGCCAT GCCGGCGGGC
CTGGCCAACG CCAGCGGCGT GCTTGGCCAG TACATCATGG ACCATGTCGG AGCGACCTCG
GCGGCGGTCG CCATTCCAGG CTTCGCCGAC AAGACCACGT TTGGCAATCG GCCCACGGGC
ACCATCGTTC CGCGCTTTCG AAACCTCTTA GCCCATGAGG ACACCGACTT CCTGCGCGGC
TACAGCTTCT TTGGCTCTTC CATGCAGCTC AGCTGGCGCT TTGGCGAATC CACGCCGGGC
CTGGGCACGG CGCTCAAAGA CCGCCTACAC GCGCCTGGCC AATGGGTCAT GGCGCTTAAC
GCCCACGGCG AACATCTGCC ACGGGCCGAA AACCGCATCA CGCTGGATCC CAACAGGGTG
GACGCCAACG GGCAGGCGCA GCTGCGCATC GATTTCGCCT ATGGCGACAA CGAAAAGAAG
ATGCTGCTCG ACGCCCAAAA GCAGGCCCTG GCCATGCTCG CCCCCATGGG CGGCAAGGTC
AGCCGCTCCT CGGCCGATCT GAACCAAGGC GGCGCGACCG TTCACGAGAT GGGCGGGGCG
CGCATGGGAC GTGACCCGAC CACCTCGGTG CTCAACGGCG AGAACCAGGC CCATGAGGTG
ATCAATCTCT TCGTCACGGA CGGCGCCTGC ATGAGCTCGA GCGCCAGCGT CAATCCCTCC
CTGACCTACA TGGCCCTGAC CGCCCGGGCC TGCGCCCGGG CGGCCAAACG GATCACCTCG
GGGGCGCTGT GA
 
Protein sequence
MSGQTQFDVI VVGSGITGGM AAKELTERGL KVLMIERGPM IEHGADYKTE MTPPWELPFR 
GYGDPQVLAS DYPVQSKGRY FDEWTSAHFC NDRENPYQTS AENPFQWRRS YNLGGRSLVW
GRQSFRWSAL DFEANKKDGH GVDWPIRYED LAPWYDHVET FIGVQGSTEH MPALPDGKFQ
PPFELNVVEK AVAAKIAATY PDRRLIISRS AHLTQEKEGR GVCQSRSICA RGCSYGAYFS
TQSASLPAAQ ATGRLTLITD SLVDTLDYDP ATRRVTGVKV LDLKSKTSAT YTAKAVFLCA
GSFNSVALLL RSKSAAMPAG LANASGVLGQ YIMDHVGATS AAVAIPGFAD KTTFGNRPTG
TIVPRFRNLL AHEDTDFLRG YSFFGSSMQL SWRFGESTPG LGTALKDRLH APGQWVMALN
AHGEHLPRAE NRITLDPNRV DANGQAQLRI DFAYGDNEKK MLLDAQKQAL AMLAPMGGKV
SRSSADLNQG GATVHEMGGA RMGRDPTTSV LNGENQAHEV INLFVTDGAC MSSSASVNPS
LTYMALTARA CARAAKRITS GAL