Gene Rsph17029_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0856 
Symbol 
ID4897811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp873871 
End bp875517 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content70% 
IMG OID640111441 
Productcholine dehydrogenase 
Protein accessionYP_001042739 
Protein GI126461625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATT ACATCATCGT CGGCGCCGGC TCGGCCGGCT GTGCCATGGC CTATCGCCTC 
GGCGAGGCGG GCCGCTCGGT GCTGGTCGTC GAGCATGGCG GCACGGATGC CGGGCCCTTC
ATCCAGATGC CCGCGGCGCT CTCCTATCCG ATGAACATGG GGATCTACGA CTGGGGCCTG
AAGACCGAGC CCGAGCCGCA TCTGGACGGG CGCGTGCTCG CCACGCCGCG CGGCAAGGTG
ATCGGCGGCT CCTCCTCGAT CAACGGCATG GTCTATGTCC GCGGTCACGC GCGCGATTTC
GACCATTGGG CCGAGAGCGG GGCCACCGGC TGGGGCTTTG CCGATGTGCT GCCCTATTTC
AAGCGGATGG AGAACTGGCA CGTCCCCGGC GATGTCGAGT GGCGCGGCCA TGACGGGCCG
CTGCACGTCA CGCGCGGCCC GCGCTCCAAT CCCCTGTTCA ACGCCTTCAT CGAGGCGGGG
CGGCAGGCGG GCTATCCGGT GACCGACGAT TACAACGGCG CGGCGCAGGA GGGCTTCGGC
CCGATGGAGG CCACGATCTG GCAGGGCCGG CGCTGGTCGG CCGCCAATGC CTATCTCAGG
CCCGCGATGA AGCGGTTCGG GGTCCAGCTC ACCCGTGCCC TCGCGCTGAA GGTGGTGATC
GAGGAGGGCC GTGCCGTCGG CGTCGAGGTG CAGCGCCGCG GCGGCCGCGA GGTGATCCGG
GCCGGCCGCG AGGTGATCCT CGCCGCCTCC TCGCTCAACA CGCCGAAGCT TCTGATGCTG
TCGGGCATCG GGTCCGCCGC GCATCTGGCC GAGCACGGGA TCCCGGTCGT GGCCGACCGG
CCGGGCGTGG GCCGGAACCT CCAGGACCAT CTGGAGGTCT ACATGCAGTT CGCGAGCCTC
CAGCCGGTCA CGCTCTTCAA GCACTGGAAC CTGCGCGGCA AGGTCAGCAT CGGCGCGCAG
TGGCTGTTTA CCGGGCGCGG GCTCGGGGCC TCGAACCAGT TCGAGGCCTG CGCCTTCATC
CGCTCGAAGC CGGGCGTGGA TTATCCCGAC ATCCAGTATC ACTTCCTGCC CATTGCCGTG
CGCTACGACG GCAAGGCCGC CGCCGAGGGC CACGGCTTCC AGGTGCATGT GGGGCCGATG
CGCTCGCCCT CGCGCGGGTC GGTGACGCTG CGCTCGGCCG ATCCCGAGGC GCCGCCGGTG
ATCCGCTTCA ACTACATGTC GACCGAAGAG GATTGGCAGG ACTTCCGCCG CTGCGTCCGC
CTCACGCGCG AGATCTTCGG GCAGGAGGCC TTCAAGCCCT TCGTCCGGCA CGAGATCCAG
CCGGGGGCGG CCTGCGCTTC GGACGCGGAG ATCGACGCCT TCATCCGCGC GCATGTCGAG
AGCGCCTACC ACCCCTGCGG GACGGCGCGG ATCGGGCGGG CGGACGATCC GATGGCGGTC
GTCGATCCCG AGTGCCGGGT GATCGGGGTC GAGGGGCTGC GCGTGGCCGA CAGTTCGATC
TTCCCGCGGG TGACGAACGG CAACCTCAAC GCGCCCTCGA TCATGGTGGG CGAGAAGGCG
TCCGACCATA TCCTCGGCCG GACCCCGCTC GCGCCGCTGA ACCTCGAGCC GGTGACGAAC
CCGAACTGGC GCACGGCCCA ACGCTGA
 
Protein sequence
MFDYIIVGAG SAGCAMAYRL GEAGRSVLVV EHGGTDAGPF IQMPAALSYP MNMGIYDWGL 
KTEPEPHLDG RVLATPRGKV IGGSSSINGM VYVRGHARDF DHWAESGATG WGFADVLPYF
KRMENWHVPG DVEWRGHDGP LHVTRGPRSN PLFNAFIEAG RQAGYPVTDD YNGAAQEGFG
PMEATIWQGR RWSAANAYLR PAMKRFGVQL TRALALKVVI EEGRAVGVEV QRRGGREVIR
AGREVILAAS SLNTPKLLML SGIGSAAHLA EHGIPVVADR PGVGRNLQDH LEVYMQFASL
QPVTLFKHWN LRGKVSIGAQ WLFTGRGLGA SNQFEACAFI RSKPGVDYPD IQYHFLPIAV
RYDGKAAAEG HGFQVHVGPM RSPSRGSVTL RSADPEAPPV IRFNYMSTEE DWQDFRRCVR
LTREIFGQEA FKPFVRHEIQ PGAACASDAE IDAFIRAHVE SAYHPCGTAR IGRADDPMAV
VDPECRVIGV EGLRVADSSI FPRVTNGNLN APSIMVGEKA SDHILGRTPL APLNLEPVTN
PNWRTAQR