Gene Rsph17025_2310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2310 
Symbol 
ID5084960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2352958 
End bp2354604 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content69% 
IMG OID640483873 
Productcholine dehydrogenase 
Protein accessionYP_001168504 
Protein GI146278345 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.568209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATT ATGTGATCGT GGGTGCCGGC TCGGCCGGCT GTGCCATGGC CTACCGGCTG 
GGCGAGGCGG GCCGTTCGGT GCTGGTCATC GAGCATGGGG GGACCGATGC CGGCCCCTTC
ATCCAGATGC CGGCGGCGCT CTCCTATCCG ATGAACATGG GGATCTACGA CTGGGGCCTG
AAGACCGAAC CCGAGCCGCA TCTGGGCGGG CGGGTGCTGG CCACGCCGCG CGGCAAGGTG
ATCGGGGGCT CGTCCTCGAT CAACGGGATG GTCTATGTCC GCGGCCATGC GCGCGACTTC
GACCACTGGG CCGAGGCGGG GGCGGCGGGC TGGGGCTTTG CCGAAGTGCT GCCCTATTTC
AAGCGGATGG AAAACTGGCA CGTTCCGGGC GACGTGGACT GGCGCGGCCA TGACGGGCCG
CTGCATGTGA CGCGGGGGCC ACGTTCGAAC CCGCTGTTCA ACGCCTTCAT CGAGGCGGGC
CGGCAGGCCG GCTATCCGGT CACCGACGAC TACAACGGCG CCGCGCAGGA GGGCTTCGGC
CCGATGGAGG CCACGATCTG GCAGGGCCGG CGCTGGTCGG CGGCCAACGC CTATCTCAAG
CCCGCCATGA AGCGGTTCGG CGTCAAGGTC ACGCGCGCGC TCGCGCTGCG GGTGGTGATC
GAGGAGGGCC GGGCGGTCGG CGTCGAGGTG CAGCGCCGCG GGCGGCGCGA GGTGATCCGG
GCGGGTCGCG AGGTGATCCT CGCCGCCTCC TCGATCAACA CGCCGAAGCT GCTGATGCTG
TCGGGCATCG GGCCGGCCGC GCATCTGGCC GAGCATGGGC TGCCCGTGGT GGCCGATCGG
CCGGGCGTGG GCCGGAACCT CCAGGATCAT CTGGAGGTCT ACATGCAATA CGCAAGCCTC
CTGCCGGTCA CGCTCTTCAA GCACTGGAAC CTGCGCGGCA AGGTGATGGT CGGCGCGCAG
TGGCTGTTTA CGGGGCGCGG CCTTGGCGCC TCGAACCAGT TCGAGGCCTG CGCCTTCATC
CGCTCCAGGC CCGGGGTGGA TTATCCCGAC ATCCAGTATC ACTTCCTGCC GATCGCCGTG
CGCTATGACG GCAAGGCCGC GGCCGAGGGG CACGGTTTTC AGGTCCATGT CGGACCGATG
CGCTCGCCCT CGCGCGGGTC GGTCACGCTG CGGTCGGCCG ATCCCGAGGC CGCCCCGGTA
ATCCGCTTCA ACTACATGTC CACGCCCGAG GACTGGGAGG ACTTCCGCCG CTGCATCCGC
CTCACGCGCG AGATCTTCGG GCAGGAGGCC TTCCGCCCCT TCGTGAAGGG CGAGATCCAA
CCCGGGCCGG CCTGCCAGTC CGATGACGAG ATCGACGCCT TCATCCGCGA GCATGTCGAG
AGCGCCTACC ACCCGTGCGG CACGGCGCGG ATGGGGCGGG CGGACGATCC GATGGCGGTG
GTCGATCCCG AATGCCGGGT GATCGGCGTG GCGGGGCTGC GCGTGGCCGA CAGTTCGATC
TTCCCGCGGG TGACGAACGG AAACCTGAAC GCCCCCTCGA TGATGGTGGG CGAGAAGGCG
GCGGACCATG TGCTCGGCCG GACGCCGCTG GCGCCCTTGA ACCACGAGCC CGTGGTGAAC
CCGAACTGGC GCGTGGCGCA GCGGTGA
 
Protein sequence
MFDYVIVGAG SAGCAMAYRL GEAGRSVLVI EHGGTDAGPF IQMPAALSYP MNMGIYDWGL 
KTEPEPHLGG RVLATPRGKV IGGSSSINGM VYVRGHARDF DHWAEAGAAG WGFAEVLPYF
KRMENWHVPG DVDWRGHDGP LHVTRGPRSN PLFNAFIEAG RQAGYPVTDD YNGAAQEGFG
PMEATIWQGR RWSAANAYLK PAMKRFGVKV TRALALRVVI EEGRAVGVEV QRRGRREVIR
AGREVILAAS SINTPKLLML SGIGPAAHLA EHGLPVVADR PGVGRNLQDH LEVYMQYASL
LPVTLFKHWN LRGKVMVGAQ WLFTGRGLGA SNQFEACAFI RSRPGVDYPD IQYHFLPIAV
RYDGKAAAEG HGFQVHVGPM RSPSRGSVTL RSADPEAAPV IRFNYMSTPE DWEDFRRCIR
LTREIFGQEA FRPFVKGEIQ PGPACQSDDE IDAFIREHVE SAYHPCGTAR MGRADDPMAV
VDPECRVIGV AGLRVADSSI FPRVTNGNLN APSMMVGEKA ADHVLGRTPL APLNHEPVVN
PNWRVAQR