Gene RSP_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3072 
Symbolsndh 
ID3721348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp112532 
End bp113821 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID640072748 
Productputative L-sorbosone dehydrogenase 
Protein accessionYP_354589 
Protein GI77465086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.726244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTCC TGGCGCGTGC CACCGCCATC GTCGGCAATA CGATGGTCCT GATGCGGCGG 
TTCGGATCGC CGGGCACCCA GGCGATCGGG CAGAGCCCGG CCATCCCCGA GGCCCAGAAG
CAGGGCATCA TGACCCTCAA GATGCCCGCG GCCAAGGGCT GGGCGCCGGG CCATCTGCCG
ACGCCCGCAC CGGGCCTCAA GGTCAATGCC TTCGCCCGCG ATCTGGAACA TCCGCGCTGG
ATCGAGGTGC TGCCCTCGGG CGACGTGCTG GTTGCCGAGG CGCGCCAGCT TCCCACCCCG
CCGAAGACCC TCCTCGACCG TGCGGCGCAG GCCACCATGC GCCGCATCCG CGCGCTCGGC
GACAGCCCGA ACCGCATCAC CCTCCTGCGC GACCCGGACG GCCGGGGCGA GGCGCAAGAG
CGCGAGACCT TCCTCGAGAA CCAGAGCCAG CCCTTCGGCA TGGCGCTGGT GGGCGACACC
TTCTACGTCG GCAACACCGA CGGCATTATG GCCTTCCCCT ACCGTCCGGG CGCCACGCGG
CTCGAGGGGC CGGGACGGCG CCTGACCACC TTCAAGCCCG GCGGGCACTG GACGCGCAGC
CTGATCGTCT CGCCCGACGG GCGCCGGATC TATGCCGGCG TGGGCTCGCT CAGCAACATC
GGCGACGACG GGATGGAGGC CGAGGAGGGC CGCGCCGCGA TCTGGGAGCT GGACCTCGCC
AGCGGTCAGG CCCGGATCTA TGCCTCGGGC CTGCGCAACC CGGTGGGCCT CGCGTGGGAG
CCCACGACGC GCGTGCTCTG GACCGTGGTC AACGAGCGCG ACGGGCTCGG CGACGAGACC
CCGCCGGACT ATCTGACCTC CGTCGAGGAG GACGGCTTCT ACGGCTGGCC CTATTGCTAC
TGGAACCGGA TCGTCGACGA CCGCGTGCCG CAGGACCCGG CGATGGTCGC CCGCGCGATC
ACGCCCGACT ATGCGCTCGG CGGACACACG GCCTCGCTCG GCCTCTGCTG GGTGCCGGCG
GGCACGCTGC CGGGCTTCGG CGACGGGATG GCCATCGGCC AGCACGGCTC GTGGAACCGT
TCGAAACTCA GCGGTTACCG GCTGATCTTC GTGCCCTTCG CGAACGGCCG GCCCTCCGGC
CCGCCGCGGG ACATCCTGAC CGGTTTCCTG TCCGACGACG AGAAGCTGGC CTACGGTCGC
CCGGTCGGCG TGGCGGTGGG CCCCGACCGC CGCTCGCTTC TGCTCGCCGA CGATGTGGGC
GACGTGATCT GGCGCGTAAC CGGCGCCTGA
 
Protein sequence
MDFLARATAI VGNTMVLMRR FGSPGTQAIG QSPAIPEAQK QGIMTLKMPA AKGWAPGHLP 
TPAPGLKVNA FARDLEHPRW IEVLPSGDVL VAEARQLPTP PKTLLDRAAQ ATMRRIRALG
DSPNRITLLR DPDGRGEAQE RETFLENQSQ PFGMALVGDT FYVGNTDGIM AFPYRPGATR
LEGPGRRLTT FKPGGHWTRS LIVSPDGRRI YAGVGSLSNI GDDGMEAEEG RAAIWELDLA
SGQARIYASG LRNPVGLAWE PTTRVLWTVV NERDGLGDET PPDYLTSVEE DGFYGWPYCY
WNRIVDDRVP QDPAMVARAI TPDYALGGHT ASLGLCWVPA GTLPGFGDGM AIGQHGSWNR
SKLSGYRLIF VPFANGRPSG PPRDILTGFL SDDEKLAYGR PVGVAVGPDR RSLLLADDVG
DVIWRVTGA