Gene Rsph17025_3908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3908 
Symbol 
ID5085456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp808475 
End bp809764 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID640485466 
Producthypothetical protein 
Protein accessionYP_001170067 
Protein GI146279909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.724834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.185081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTC TGGCGCGTGC GACCTCCATC GTCGGCAACA CGATGGTCCT GATGAGGCGG 
ATCGGCTCGC CGGGCACCCA GGCGGTGGGG GCAAGCCCCG CCATCCCCGA GGCCCGGAAG
CAGGGGATCA TGACGCTCAA GATGCCCGCT GCGCACGGCT GGGCGCCGGG GCACCTGCCC
ACGCCGGCGC CCGGGCTGAA GGTCAACGCC TTTGCCACGG GGCTCGAGCA TCCCCGCTGG
ATCGAGGTCC TGCCCAACGG GGACGTTCTC GTGGCCGAAG CGCGCCAGAT CCCGACGCCC
CCCAAGAGCC TGCTGGATCG GGCGGCACAG GCCACCATGC GGCGGGCGCG CGCCCTCGGC
GACAGCCCGA ACCGCATCAC GCTCCTGCGC GACGGCGGGC AGAGTGGCGA GGCCACACAG
CGCGAGACCT TCCTTGCGGG CCAGAGCCAG CCCTTCGGCA TGGCGCTGGT CGGCGACACC
TTCTATGTCG GCAACACCGA CGGGATCGTG GCCTTCCCCT ACAGCGAGGG CGCGACGCGC
CTCGAGGGCG AGGGCCGCAA GCTCGTGACC TTCAAGCCCG GCGGGCACTG GACGCGCAGC
CTGATCGTCT CGCCCGACGG CCGCCGGATC TATGCGGGCG TGGGATCGCT CAGCAACATC
GGTGACGACG GGATGGAGGC GGAAGAGGGC CGCGCCGCGA TCTGGGAGCT TGACCTGGCG
TCGGGCACGT CGCGCATCTA CGCCTCGGGC CTGCGCAACC CGGTGGGGCT CGCGTGGGAG
CCCGCAACCC GCGTGCTCTG GACCGTGGTG AACGAGCGCG ACGGGCTGGG CGACGAGACA
CCCCCCGACT ATCTGACTTC GGTCGAGGAA GGGGGCTTCT ATGGCTGGCC CTACTGCTAC
TGGAACCGCA TCGTGGATGA TCGCGTGTCC CAGGATCCGG CGATGGTGGC GCGCGCGATC
ACGCCCGACT ATGCACTCGG CGGGCACACG GCCTCGCTCG GCCTGTGCTG GGTGCCGCCG
GGGACGCTCC CCGGCTTTCC GGGCGGGATG GCCATCGGCC AGCACGGCTC GTGGAACCGC
TCGAAGCTGA GCGGCTATCG GCTGATCTTC GTGCCCTTCG CGAATGGCCG CCCCTCGGGT
CCGCCCCGCG ACATCCTCAC GGGCTTCCTC TCGCCTGACG AAAAGCTTGC CTATGGCCGC
CCCGTTGGCG TCGCGGTGGG CCCGGACGGC CGGTCCCTCC TGCTCGCGGA CGATGTGGGC
GACGTGATCT GGCGCGTGAC GGGAGCGTGA
 
Protein sequence
MDFLARATSI VGNTMVLMRR IGSPGTQAVG ASPAIPEARK QGIMTLKMPA AHGWAPGHLP 
TPAPGLKVNA FATGLEHPRW IEVLPNGDVL VAEARQIPTP PKSLLDRAAQ ATMRRARALG
DSPNRITLLR DGGQSGEATQ RETFLAGQSQ PFGMALVGDT FYVGNTDGIV AFPYSEGATR
LEGEGRKLVT FKPGGHWTRS LIVSPDGRRI YAGVGSLSNI GDDGMEAEEG RAAIWELDLA
SGTSRIYASG LRNPVGLAWE PATRVLWTVV NERDGLGDET PPDYLTSVEE GGFYGWPYCY
WNRIVDDRVS QDPAMVARAI TPDYALGGHT ASLGLCWVPP GTLPGFPGGM AIGQHGSWNR
SKLSGYRLIF VPFANGRPSG PPRDILTGFL SPDEKLAYGR PVGVAVGPDG RSLLLADDVG
DVIWRVTGA