Gene Rsph17029_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0421 
Symbol 
ID4895973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp435458 
End bp436636 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content71% 
IMG OID640111005 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001042309 
Protein GI126461195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC GCAGCCCGCT CGTCCTTCTG CTCCTGCCCG CCGGCGCCAT CGCCCAGCCG 
GTCGAGCAGG GTCCCCCCGC CACCGACTAC GAACCGGCCT TCGAGACCCA GACCCGCGCG
CCGGCGCTGG AAGCAACCGG GGCCACGGCG GAGCCCTTCG TGCAGGGGCT CGAGCATCCC
TGGGGCATCG CGGCCCTGCC GGAGGGCGGA TGGCTCGTGA CCGAGCGGCC CGGTCGGCTG
CGGATGGTCT CCGAGGACGG CACGCTGTCC GACCCGATCA AGGGTCTGCC CGAGGTGGAT
GCCCGCAAGC AGGGCGGTCT CCTCGACGTC GCCGCGGGCC CGACCTTCGC CGAAGACCGG
ATGATCTACT GGACCTATGC CAAGGCGGTC GAGGGCGGCA CCATCACCGC CGCGGCACGA
GGCGTGCTGT CGGAGGACGG AACGGAGGTG AGCGCGGTCG AGGACATCTT CCGGCAGGAG
CCGCCCTCAC AGGCGCCGAT GCACTACGGC TCGCGCATCC TGTTCGACGG CGAGGGCCAT
GCGATCATCA CCACGGGCGA ACATTCGATC GAGGCCGAGC GCGATCGGGC GCAGGATCTC
GGCACCAGCT ACGGCAAGGT GATCCGGGTG GCGCTGGATG GCAGGACGCC CGAGGACAAT
CCCTTCGCCG AGAGCGAGGG CCTCGGCACC ATCTGGAGCT ACGGCCACCG CAACATCCAG
AGCGCGGCCT TCGACGCGGA GGGCCAGCTC TGGATCGTCG AGCACGGGCC CAAGGGCGGA
GACGAGCTGA ACCTGATCCA GCCGGGCGCA AACTACGGCT GGCCCGAAGT GAGCTACGGG
GTGAATTACG ACGGCTCGCC CGTGGGCACC GGAGAGCCGC GCGGCGAGGG CTTCACCGAG
CCCACCTACT ACTGGGATCC GGTCATCGCG CCGGGCGACA TGACCTTCTA CCGGGGCACC
GCGTTCGAGG GCTGGCAGGG CGACCTGCTC GTGGGCTCAA TGAAGCCCGG CGGTCTCGTC
CGGCTGACGC TCGAGGAGGG CCGCGTCGCC GGCGAGGAGC GCCTGCTGGG CGACGTGGGC
CGGGTCCGCG ATGTCGAGGA GACGGGGGAG GGTCACCTTC TCCTGCTGAT CGACGCGCCC
GACGGCGGCA TCCTGCGGGT GACGCCCGAG GCCGGCTGA
 
Protein sequence
MIRRSPLVLL LLPAGAIAQP VEQGPPATDY EPAFETQTRA PALEATGATA EPFVQGLEHP 
WGIAALPEGG WLVTERPGRL RMVSEDGTLS DPIKGLPEVD ARKQGGLLDV AAGPTFAEDR
MIYWTYAKAV EGGTITAAAR GVLSEDGTEV SAVEDIFRQE PPSQAPMHYG SRILFDGEGH
AIITTGEHSI EAERDRAQDL GTSYGKVIRV ALDGRTPEDN PFAESEGLGT IWSYGHRNIQ
SAAFDAEGQL WIVEHGPKGG DELNLIQPGA NYGWPEVSYG VNYDGSPVGT GEPRGEGFTE
PTYYWDPVIA PGDMTFYRGT AFEGWQGDLL VGSMKPGGLV RLTLEEGRVA GEERLLGDVG
RVRDVEETGE GHLLLLIDAP DGGILRVTPE AG