Gene Caul_4606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4606 
Symbol 
ID5902068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4982095 
End bp4983843 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content67% 
IMG OID641565125 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001686224 
Protein GI167648561 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.159853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATC TGAACGGTCG CGCGCGTCGG AAGAACACCT ATGACGCCAT CGTCGTGGGC 
AGCGGCATCA CTGGCGGCTG GGCGGCCAAG GAGCTGACCC AGAAGGGTCT CAAGACCCTG
GTGTTGGAGC GCGGCCCGAT GGTCCGCCAC ATCGAGGACT ATCCCACCGC CACCGTGGCT
CCCTGGGAGA CCAAGTACCC GCAAGGGCAA TTGCCGCAGG AAGAGCTGGC CGCTCACTAC
CCCGTGCAGC GCCGCACCAA CTATACGATG ACGGAGTACA CCAAGCACTT CTTCGTCCGT
GACGACCAGG ACCCCTATGT CGAGGAAAGC CGCTTCGACT GGATTCGCGG CTATCACGTC
GGCGGACGGT CGCTGACCTG GGGTCGCCAG AGCTATCGCC ACAGCCCCAT CGACTTCGAG
GCCAACGCCC GCGAGGGGAT CGGCGTCGAC TGGCCGATCC GCTACGAGGA GCTCGCGCCC
TGGTACGACC ATGTCGAGCG CTTCATCGGC GTATCGGGCC AGGCCGAGGG CCTGCCCCAC
CTGCCCGACG GCCACTATCA GCCGCCGATG GAGATGAACT GCGTCGAGAA GGCCTTCAAG
GCCAAGTCCG AGGCGCGATT CCCCGAGCGG CGGATCACCA TGGGGCGCAC CGCCCACCTG
ACCGAGCCGA CCGAGGAGCA ACTGAGTCTG GGTCGCACCA AGTGCCAGTA CCGCAACATG
TGCATGCGCG GCTGCCCGTT CGGCGGCTAT TACAGCTCCA ATTCCGGCGG GTTGGTGGCC
GCCGAGCGCA CCGGCAACAT GGTGATCCGG CCCGGCTCGA TCGTCACCTC GCTGATCTAT
GACGAGAAGG CCGGCAAGGC GACGGGGGTT CGCATCCTGG ACGCGACGAC CCGCCAGGAA
GAGGAGTTCT ACGCCGACGT CGTGTTCCTG TGCGCCTCGG CCTTCAACTC GGCCTGGATC
ATGATGAACT CGACCAGTTC GCGGTTCCCC AACGGCTTTG GCAACGGCAG CGACCAGCTC
GGCCGCAACG TCATGGACCA CCATCTGGGC GTGGGGGCGT CAGGCGAGGC GCCTGAATTC
GCGGACATGT ACTATTCCGG CCGTCGGCCC AATGGCATCT ACGTCCCGCG CTTCCGCAAT
CTGGGCGACG CCGCCACCAA GCGCAGCGAC TACCTGCGGG GCTTCGGCTA CCAGGGCGGC
GCCTCTCGCC AGGGTTGGGA CCGCGATCTT GGCAAGCACG ACGGCGAAGG CGGCGGCTTC
GGGGCGGCGC GCAAGGCCGC GCTCAGCCAG GCGGGACCCT GGACCATCGG CCTGGGCGGG
TTCGGCGAGA TCCTGCCCTA TCAGGACAAT CGCCTCACCC TGAACCACGA GACCCGGGAC
CAGTTTGGCC TGCCGACCCT GTCGATGAAC GTCACGATCC GCGAGAACGA GATCGCCATG
CGCCGCGACA TGCAGACGGC GGCGGCCGAG ATGCTGGAGG CGGCGGGCTT CAAGAACGTC
CAGGGCCGCG ACGGCGGCTA TTCGCCGGGG CTGGGCATCC ACGAGATGGG CACCGCCCGC
ATGGGTCGCG ATCCCAAGAC CTCGGTGCTG AACGCCCACA ACCAGGTCCA TGAGTGCAGG
AACGTCTATG TCACCGATGG GGCGGCGATG ACTTCGGCAT CGTGCGTGAA CCCGTCCCTT
ACCTATATGG CGCTGACGGC GCGGGCGGCC GACCACGCCG TCCAGGCGCG CAAGCGGGGA
GAGCTGTGA
 
Protein sequence
MANLNGRARR KNTYDAIVVG SGITGGWAAK ELTQKGLKTL VLERGPMVRH IEDYPTATVA 
PWETKYPQGQ LPQEELAAHY PVQRRTNYTM TEYTKHFFVR DDQDPYVEES RFDWIRGYHV
GGRSLTWGRQ SYRHSPIDFE ANAREGIGVD WPIRYEELAP WYDHVERFIG VSGQAEGLPH
LPDGHYQPPM EMNCVEKAFK AKSEARFPER RITMGRTAHL TEPTEEQLSL GRTKCQYRNM
CMRGCPFGGY YSSNSGGLVA AERTGNMVIR PGSIVTSLIY DEKAGKATGV RILDATTRQE
EEFYADVVFL CASAFNSAWI MMNSTSSRFP NGFGNGSDQL GRNVMDHHLG VGASGEAPEF
ADMYYSGRRP NGIYVPRFRN LGDAATKRSD YLRGFGYQGG ASRQGWDRDL GKHDGEGGGF
GAARKAALSQ AGPWTIGLGG FGEILPYQDN RLTLNHETRD QFGLPTLSMN VTIRENEIAM
RRDMQTAAAE MLEAAGFKNV QGRDGGYSPG LGIHEMGTAR MGRDPKTSVL NAHNQVHECR
NVYVTDGAAM TSASCVNPSL TYMALTARAA DHAVQARKRG EL