Gene Caul_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3744 
Symbol 
ID5901206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4059489 
End bp4061153 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content69% 
IMG OID641564267 
Productcholine dehydrogenase 
Protein accessionYP_001685369 
Protein GI167647706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0339169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA CGCGCTACGA TTACGTCATC ATCGGCGCCG GCTCGGCCGG CTGCGTCCTG 
GCGGCTCGGC TGACCGAGGA CGCCAACGTC AAGGTCCTGC TGCTGGAGGC CGGCGGCAAG
AACACCTCGA TCCTGGTCAA GATGCCCGCC GGGGTGGGCG AGTTGATCAA GGCCAAGGGC
GATCAGAACT GGGGCTTCTG GACCGAGGCC GAGCCGCACC TGAATGACCG CAAGCTGTGG
TGGCCGCGCG GCAAGGGCCT GGGCGGCAGC TCGGCCATCA ACGGCATGAT CTACATCCGC
GGCCACGCCC GCGACTATGA CCAGTGGCGG CAGATGGGGC TGTCGGGCTG GTCCTATGCC
GAGGTGCTGC CCTACTTCAA GCGTTCGGAG ACCCACCATG GCGGCGGCGA CGCCTATCAC
GGCGGGGCCG GACCGCTGCA CGTGTCGGGC GGCGAGAGCA AGAGCCCGTT CTACCCCGCC
CTGATCGAGG CCGGCCGCCA GGCGGGCCAT GCGACCACGA AGGATTTCAA CGGCTTCCGG
CAGGAAGGCT TTGGCCCCTA CGATCTGACG ATCCGCGACG GCAAGCGCTG GAGCGCGGCG
GCGGCCTATC TGACGGCGGC CCTGGCCCGT CCGAACCTGA CCTGCGTGAC CGAGGCCCGC
ACCACGCGGA TCCTGATCGA GAACGGCAAG GCGATCGGCG TGGAATATGT GGTCGGGACC
GATCCGGCGC GGCTGGTCGC CCATGCCGAC GCCGAGGTGC TGCTCAGCGC CGGCGCCGTG
CAGTCGCCGC ATATCCTGCA GCTGTCGGGC GTCGGCGATC CGGACGACCT GAAGGCCCAC
GGCATCGCCC CCGTGCACGA GGCCAAGGGC GTCGGCGCCA ACCTGCAGGA TCACCTGGAC
GTCTGCCTGT CGTGGACCAG CAAGAACCTG GTCACCGCCT ATTCGGCCAA CAAGGGCCTC
AAGAAGCTGG GCACGGGCCT GTCCTACATG CTGCTGGGCA AGGGCCTGGG TCGTCAGCAG
TTCCTGGAGA GCGGAGCCTT CCTGAAGTCG CGCCCCGATC TGGACCGCCC CGACCTGCAG
ATCCACGGCG TGCTGGCGAT CATGCAGGAC CACGGCAAGA CGATGATCGA GAAGGACGGC
TTCACCCTGC ACGTCTGCCA GCTTCGTCCC GAGAGTCGCG GAAAGGTCGG GTTGCGCTCG
GCCGACCCGT TCGACGACCC GACCATCCTG GGCAACTACC TGGCGACCGA CGAGGACCGG
CGCGCGATCC GCGAGGGGGT GCGCATCGGC CGCGACGTGG CCGCCCAGGC GGCGCTGGAT
CCCTATCGGG AGTCCGAATA CGCGCCAGGC GCCGACATCA AGACCGACGC CGAGATCGAC
GCCTGGGTCC GTGCCAAGGC CGAGACCATC TATCACCCGG TCGGCACCTG CCGCATGGGC
GCGGCGGGCG ACCCGCTGGC CGTGGTCGAT GACCAGCTGC GCGTACAGGG GATCGAAGGC
CTGCGGGTGA TCGACGCCTC GGTGATGCCC ACCCTGATCG GCGGCAACAC CAACGCCCCC
ACGATCATGA TCGCCGAACG GGCTTCCGAC CTGATCCGCG GCAAGGTCCT GCTGCCGCCG
GTCGAGGTTC CGGTGTTCGA GGACGGAAGG GCGGTCGCGG CTTAA
 
Protein sequence
MASTRYDYVI IGAGSAGCVL AARLTEDANV KVLLLEAGGK NTSILVKMPA GVGELIKAKG 
DQNWGFWTEA EPHLNDRKLW WPRGKGLGGS SAINGMIYIR GHARDYDQWR QMGLSGWSYA
EVLPYFKRSE THHGGGDAYH GGAGPLHVSG GESKSPFYPA LIEAGRQAGH ATTKDFNGFR
QEGFGPYDLT IRDGKRWSAA AAYLTAALAR PNLTCVTEAR TTRILIENGK AIGVEYVVGT
DPARLVAHAD AEVLLSAGAV QSPHILQLSG VGDPDDLKAH GIAPVHEAKG VGANLQDHLD
VCLSWTSKNL VTAYSANKGL KKLGTGLSYM LLGKGLGRQQ FLESGAFLKS RPDLDRPDLQ
IHGVLAIMQD HGKTMIEKDG FTLHVCQLRP ESRGKVGLRS ADPFDDPTIL GNYLATDEDR
RAIREGVRIG RDVAAQAALD PYRESEYAPG ADIKTDAEID AWVRAKAETI YHPVGTCRMG
AAGDPLAVVD DQLRVQGIEG LRVIDASVMP TLIGGNTNAP TIMIAERASD LIRGKVLLPP
VEVPVFEDGR AVAA