Gene Caul_1348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1348 
Symbol 
ID5898803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1431937 
End bp1433604 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content69% 
IMG OID641561835 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001682976 
Protein GI167645313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA TCATCGAGGC CGACTACATC GTTGTGGGCG CCGGCTCGGC CGGTTGCGTG 
CTGGCGGCGA GGCTGTCGGA GGACGGCCGG TACAAGGTGC TGCTGCTGGA AGCCGGCGGC
GACGACCGGC CGACCAGGAA CCCCTCGCAG TTCCTGTCGA ACCTGATGAT CCACATCCCG
GTCGGCTACG CCCAGACCCT GAAGGACCCC AAGGTCAACT GGCTGTATGA GACCGAGCCG
GACCCTGGCA CCGGCGGGCG CTCGCACGTC TGGCCACGCG GCAAGGTGCT GGGCGGCTCA
TCGTCGATCA ACGCCATGCT CTATGTGCGC GGCCAGCGTG ACGACTATGA CGGCTGGCGG
CAGATGGGCA ATTCCGGCTG GGGCTGGGAC GATGTCCTGC CCTATTTCCG CAAGTCCCAG
AACCAGGAGC GCGGGGCCTG CGATCTGCAC GCCACCGGCG GGCCGCTCAA CGTCGCCGAC
ATGCGCGACG GCCACGCGGT GTCGCAGCTG TTGATCGACG CCTGCCACGA GGCCGGAATC
CCGCGCATCG TCGATCTGAA CGGCGAGCAG CAGGAGGGCG CGACCTGGTT CCAGGTCACC
CAGAAGAACG GCCAGCGCTG TTCGTCGGCC GTGGCCTATC TGCACCCGGC CATGGGCCGG
CCGAACCTGA GGGTCGAGAC CAACGCCCTG GCCCGCCGCG TGCTGTTCGA GGGCAAGCGC
GCGGTCGGGG TCGAGTTCAG CCAGAACGGC GTGGTCCGCA CCGCCAAGGC CCGGGCCGAG
GTGATCCTGG CCGGCGGGGC GGTCAACTCG CCCCAACTTC TCCAACTCTC CGGCGTCGGC
CCCGGCGCGC TGCTGGCCGA GCACGGGATC GCCGTGGTCC ACGACCTGCG GGGCGTCGGC
GAGAACCTTC AGGACCACTA TGTCACCGGC GCGCGCTACC GCCTGAAGGC CGGGACGGTG
TCGGTCAACG AGCAGAGCAA GGGCGCGCGA CTAGCCGGCG AGGCGCTGAA GTACCTGTTC
ACCCGCAAGG GCCTGCTGAC CCTGTCGGCG GCCCATGTCG CGGCTTTCTG CAAGTCGCGG
CCGGACCTGG CCAGCCCCGA CCTCCAGTTC CACATCCTGC CGGCCACCAT GGACCTGGCG
AAACTGTTCA ACGAGCAGAA GATGGAGCTG GAAAGCGCGC CCGGCCTGAC CATCGCGCCC
TGCCAACTGC GGCCCGAGAG CCGAGGTCAT ATCCGCATCA AGTCGGCCGA CCCTACCGCC
TATCCGGCGA TCTTCGCCAA CTACCTGTCC AATCCGCTGG ATCAGGAAGT CACGGTCGCG
GGCCTGCGCT GGGCCCGCAA GATCGCCGCC CAACCGTCCA TCGCGCCGCT GATCGACCAC
GAGATGAACC CCGGCCCGGG CTTCGAGAGT GACTTCATGC TGCTGGAATA TGCCCGCGCC
TCGGGCTCGA CGATCTATCA CCCGGTCGGC ACCTGCCAGA TGGGCGCCGG ACCGATGGCC
GTGGTCGACA GCGAGCTGCG GGTGCGCGGC GTCAGCGGCC TGCGGGTGGT CGACGCTTCG
ATCATGCCCT GCCTGGTCTC GGGCAACACC AACGCCCCGA CCATCATGAT CGCCGAAAAG
GGCGCGGACA TGATCCGCCA GGCCGCCAGG ACCGAGGTCG CCGCCTGA
 
Protein sequence
MTSIIEADYI VVGAGSAGCV LAARLSEDGR YKVLLLEAGG DDRPTRNPSQ FLSNLMIHIP 
VGYAQTLKDP KVNWLYETEP DPGTGGRSHV WPRGKVLGGS SSINAMLYVR GQRDDYDGWR
QMGNSGWGWD DVLPYFRKSQ NQERGACDLH ATGGPLNVAD MRDGHAVSQL LIDACHEAGI
PRIVDLNGEQ QEGATWFQVT QKNGQRCSSA VAYLHPAMGR PNLRVETNAL ARRVLFEGKR
AVGVEFSQNG VVRTAKARAE VILAGGAVNS PQLLQLSGVG PGALLAEHGI AVVHDLRGVG
ENLQDHYVTG ARYRLKAGTV SVNEQSKGAR LAGEALKYLF TRKGLLTLSA AHVAAFCKSR
PDLASPDLQF HILPATMDLA KLFNEQKMEL ESAPGLTIAP CQLRPESRGH IRIKSADPTA
YPAIFANYLS NPLDQEVTVA GLRWARKIAA QPSIAPLIDH EMNPGPGFES DFMLLEYARA
SGSTIYHPVG TCQMGAGPMA VVDSELRVRG VSGLRVVDAS IMPCLVSGNT NAPTIMIAEK
GADMIRQAAR TEVAA