Gene Rsph17029_4024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4024 
Symbol 
ID4899060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1170380 
End bp1171981 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID640114627 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001045874 
Protein GI126464761 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCCG ATTACATCAT CGTGGGGGCG GGCAGCGCAG GCTGCGTGCT GGCGAACCGC 
CTGTCCAAGG ATCCCTCGAA CCGCGTGCTG CTGATCGAGG CGGGCAAGCG CGACAATTAC
CACTGGGTGC ATATCCCGGT GGGCTATCTC TACTGCATCA ACAATCCCCG CACCGACTGG
TGCTTCACCA CCGAGCCCGA GGAAGGGCTC GAGGGTCGCA GCCTGATCTA TCCCCGCGGC
AAGGTGCTCG GCGGCTGTTC CTCGATCAAC GGCATGATCT ACATGCGCGG GCAGGCCGAG
GATTACGACG GCTGGCGCCA GATGGGCTGC ACCGGCTGGG GCTGGGACGA TGTGCTGCCC
CTCTTCCGCC GCCAGCAGGA CCACCACCGC GGCGAAAGCG AACATCACGG CGCGGGCGGC
GAATGGCGGG TGGAGCGGGC GCGGGTCCGC TGGGCAGTGC TCGACGCTTT CCTCGATGCG
GCCGAGCAGG CGGGCATCCC GCGGACCGAG GATTTCAACC GCGGCTCGAA CGAGGGCGGC
GGCTATTTCG ACGTGAACCA GAGGTCCGGC ATCCGCTGGA ACACGGCCAA GGCCTTCCTG
AAGCCCGCCC TCTCCCGCCC GAACCTGCGC GTCGTGACCG AGGCGCAGGT CGAGCGGCTG
ATCGTCGAGG CGGGCGAGGT GCGGGGCGTG CTCTACCGGC AGGGCGGCAC CCTGCACGAG
GCCCGGGCGC GGCGCGAGAC GGTCCTTGCG GCGGGTGCCA TCGGCTCGCC GCACATTCTG
GAGCTTTCGG GCATCGGCGA TCCCGAGGTG CTGCGCGCGG CGGGCGTCGA GCCGCAGGTC
GCCGTGCCGG GCGTGGGCGC GAACCTGCAG GATCACCTGC AGCTCCGCCT CGTCTTCAAG
GTGCGGGGCG TGCCCACGCT GAACGAGAAG GCCACCAGCC TCTTCGGCCG TGCCGCGATC
GGGGCGGAAT ATCTCCTGCG CCGGTCGGGG CCGATGTCGA TGGCACCGAG TCAGGTCGGG
ATCTTCACCC GCTCCGGCTC CGAGAAGGCC ACGCCCGATC TCGAGTTCCA TGTCCAGCCG
GTCTCGCTCG ACAAGTTCGG CGACAAGGTC CACCCCTTCC CCGGCATGAC GGCGAGCGTC
TGCAACCTTC GCCCCGAAAG CCGCGGCAGC GTCCATCTGA AAAGCCCCGA TCCCGCGCGT
CAGCCCGCCA TCGCGCCGCA CTATCTTTCG ACCGAGGGCG ACCGCGAGGT GGCGGTGCGC
TCGATCCAGA TCGCGCGCCA TATCGCCTCG CAGCCCGCCT TTGCGCGGTT TCACCCCGAG
GAATACCGTC CGGGAGCCGA GCACGACACG CGCGAGGCGC TGGTCGCCGC CGCGGGCCGC
ATCGGTACCA CGATCTTCCA CCCGGTCGGC ACCTGCCGCA TGGGGTCGGA TCCGGCGAGC
GTCGTCGATC CGCGGCTGAA GTTCCGGGCG CTCGGCGGCC TCAGGATCGC GGATGCGTCG
ATCATGCCGG CCATCACCTC GGGAAACACC AACTCGCCCA CCCTCATGAT TGCCGAGAAG
GCGGCCGAGA TGATCCTCGA GGATGCCCGG CAGCGGGTTT GA
 
Protein sequence
MEADYIIVGA GSAGCVLANR LSKDPSNRVL LIEAGKRDNY HWVHIPVGYL YCINNPRTDW 
CFTTEPEEGL EGRSLIYPRG KVLGGCSSIN GMIYMRGQAE DYDGWRQMGC TGWGWDDVLP
LFRRQQDHHR GESEHHGAGG EWRVERARVR WAVLDAFLDA AEQAGIPRTE DFNRGSNEGG
GYFDVNQRSG IRWNTAKAFL KPALSRPNLR VVTEAQVERL IVEAGEVRGV LYRQGGTLHE
ARARRETVLA AGAIGSPHIL ELSGIGDPEV LRAAGVEPQV AVPGVGANLQ DHLQLRLVFK
VRGVPTLNEK ATSLFGRAAI GAEYLLRRSG PMSMAPSQVG IFTRSGSEKA TPDLEFHVQP
VSLDKFGDKV HPFPGMTASV CNLRPESRGS VHLKSPDPAR QPAIAPHYLS TEGDREVAVR
SIQIARHIAS QPAFARFHPE EYRPGAEHDT REALVAAAGR IGTTIFHPVG TCRMGSDPAS
VVDPRLKFRA LGGLRIADAS IMPAITSGNT NSPTLMIAEK AAEMILEDAR QRV