Gene Rsph17029_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1995 
Symbol 
ID4896113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2114452 
End bp2115477 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID640112589 
Productaldo/keto reductase 
Protein accessionYP_001043871 
Protein GI126462757 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGA CCATGCGTGA CTTCGACCGG ACGGGGCGCC CGCTACGCTT CACCGAGCTG 
GGCTTCGGCT CCTCGCCCCT GGGCAACCTC TACCGTGCCA TCTCGGACGA GGAGGCGCAG
GCGCTGCTCG AACGCGCCTG GGCCGGCGGC ATCCGCTATT TCGACACGGC GCCCCTCTAT
GGCTACGGGC TGGCCGAGGA ACGGCTGGGC CGCTTCCTGG CCGGGCACCC GCGGGCCGAT
TATGTGCTCT CGACCAAGGT CGGGCGGCTC CTGCGGCCGG TCGAGCCGGG CGAGGCCCGC
GACGGGTTGG GCAAGTTCTT CGAGGTGCCC GAGCGCAAGG AGCGGTTCGA CTACGGCTAC
GACGGGGTGA TGCGCTCGCT CGAGGCTTCG CTCGACCGGC TCGGCCTCGA CCGGGTGGAT
GTGCTCTATG CCCACGATCT CGACCTCTTC ACCCACGGCT CGCAGGAAGC GCTGGAGGCA
CGGCTCGCGG AATTCATGGC CGGCGGCTAC CGGGCGCTGG TCGAGCTGCG CGATCAGGGC
GTGATCTCGG CCTTCGGCGC GGGAGTGAAC GAGTGGCAGC CCTGCCAGTG GCTCGCCGAG
CGGGGCGAGT TCGACCTCTT CCTCCTGGCC GGCCGCTACA CCCTTCTGGA GCAGGAGGCG
CTCGAGAGCT TCCTGCCCCT GGCCGAAGAG CGCGGCATCG GCATCGTGAT CGGCGGCCCC
TACAATTCCG GCGTTCTCGC CACGGGTCCG AAGCCCGGCT CCTTCTACGA TTACCGGCTG
GCTCCGCAGG CCGTGCTCGA CCGGGTGGCC CAGATCCACA CGATCTGCGA GCGCTGGGGC
GTGCGGATGT TCGAGGCGGC CTTCCAGTTC CCGCTGCGCC ACCCCGCCGT GCTCTCGGTG
ATCCCCGGCC CGCAGTCGGT GGGCGAGGTG ATGGAGAACC GCATCGCGGC CGATGCCGAA
CTGCCGCCGG GTCTGTGGGA GGATCTTAAG GTGGCGGGGC TCCTCCGCCC CGACGCGCCG
GTCTGA
 
Protein sequence
MRLTMRDFDR TGRPLRFTEL GFGSSPLGNL YRAISDEEAQ ALLERAWAGG IRYFDTAPLY 
GYGLAEERLG RFLAGHPRAD YVLSTKVGRL LRPVEPGEAR DGLGKFFEVP ERKERFDYGY
DGVMRSLEAS LDRLGLDRVD VLYAHDLDLF THGSQEALEA RLAEFMAGGY RALVELRDQG
VISAFGAGVN EWQPCQWLAE RGEFDLFLLA GRYTLLEQEA LESFLPLAEE RGIGIVIGGP
YNSGVLATGP KPGSFYDYRL APQAVLDRVA QIHTICERWG VRMFEAAFQF PLRHPAVLSV
IPGPQSVGEV MENRIAADAE LPPGLWEDLK VAGLLRPDAP V