Gene Rsph17025_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1771 
Symbol 
ID5083677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1808503 
End bp1809759 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content66% 
IMG OID640483331 
Productglucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_001167969 
Protein GI146277810 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.230445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC TGACAGCGTC CCTCGCGGCA CTTTCCCTGA CCGCGGGTCT TGCCCATGCG 
CAGGGTGCCG ACGTGCCTGA CAACCTCGAG AAGCTCTCGA ATTTCCAGAG CACGGGCACG
ACCGACTTCA CCTTCATCGA GCAGGGCGGG GACTTCGCCG AAGGCATCAA GCGCACGCTC
GAGCGGATCA CGCTGCCGCC GGGCTTCCGC ATCGGCCTTT ACGCGGTCGT GCCCGACGCG
CGCCACATGG CGGTGGGGCC GCAGGGCATC GTGACCTTCG TCGGCACCCG CAAGGACAAG
GTCTGGGCCG TCACCGACCG CAACAAGGAC CGGGTCGCCG ACGAGGTGAA GGACTTCGCC
CCCTCGCTGC GCTTCACCAT CCCGAACGGT CCCTGCTTCT CGCGGGACGG CTTCCTCTAC
ATCGCCGAGC AAAACCGGGT GCTCCTGTTC CCCGCGGCCG AGTTCTTCTA CGAGTCGGGC
GACGTGGCGG CCTTCAACCT CGTCAAGCAG GGCGAGCTGA TCCCGGTCGA GGAGGAAAGC
TTCAACCACA CGGCGCGGGT CTGCGACATC GGGCCGGACG GGATGATCTA CATCGCACTC
GGCCAGCCCT TCAACGTGCC GGCGCCCGAA AAGCGCGAGC TTTACGACAA GTGGGGCATC
GGCGGCATCA TTCGCATGAA GACGGACGGC AGCGGCCGCG AGGTCTTTGC CCGCGGCATC
CGCAACTCGG TGGGCATGGA CATCCACCCA GAGACGGGCG AGGTCTGGTT CACCGACAAT
CAGGTGGACG GGATGGGCGA CGACATCCCT CCGGGTGAGT TGAACCGCGC GACCGGGCCG
GGGCAGAACT TCGGCTTTCC CTGGTATGGC GGCGGCAGCG TCCGCACCGT GGAATACAAG
GATGAAGAGC CGCCTGCGGA TGCGGTGATG CCTGTGGTCG AGATGGATGC CCATGCCGCC
GATCTCGGCA TGATGTTCTA CACCGGCTCG ATGTTCCCCG AGGAATATCG CGGGGCCATC
TTCTCGGCGC AGCACGGCTC GTGGAACCGG ACGACGCCGG TCGGCGCGCG CGTCATGGTC
ACCACCCTCG CCGAGGATGG CAGCGCCACG ACGAAGCCCT TTGCCGAAGG CTGGATCGAC
GAGAACGGCG AGTATCTCGG CCGGCCCGTC GATGTGGCGC AGCTGCGCGA CGGCTCGATC
CTCGTGTCGG ACGATCTGGT GGGGGCGATC TACCGCATCT GGTATCAGCC GGAATGA
 
Protein sequence
MKRLTASLAA LSLTAGLAHA QGADVPDNLE KLSNFQSTGT TDFTFIEQGG DFAEGIKRTL 
ERITLPPGFR IGLYAVVPDA RHMAVGPQGI VTFVGTRKDK VWAVTDRNKD RVADEVKDFA
PSLRFTIPNG PCFSRDGFLY IAEQNRVLLF PAAEFFYESG DVAAFNLVKQ GELIPVEEES
FNHTARVCDI GPDGMIYIAL GQPFNVPAPE KRELYDKWGI GGIIRMKTDG SGREVFARGI
RNSVGMDIHP ETGEVWFTDN QVDGMGDDIP PGELNRATGP GQNFGFPWYG GGSVRTVEYK
DEEPPADAVM PVVEMDAHAA DLGMMFYTGS MFPEEYRGAI FSAQHGSWNR TTPVGARVMV
TTLAEDGSAT TKPFAEGWID ENGEYLGRPV DVAQLRDGSI LVSDDLVGAI YRIWYQPE