Gene Rsph17029_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2079 
Symbol 
ID4897979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2199763 
End bp2200803 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID640112673 
Productaldo/keto reductase 
Protein accessionYP_001043954 
Protein GI126462840 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.042571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA TCCCCCTCGG CCGCACCGAT CTGACCGTTT CCGAACTCTG CCTCGGCACG 
ATGACCTGGG GCAGCCAGAA CAGCGAGGCC GAAGCCCATG CCCAGATCGA CCTCGCCCTC
GACCACGGGG TGAATTTCCT CGACACGGCC GAGATGTATC CGACCAATCC CGTGACGGCC
GAGACGGTGG GCGGCACCGA AACCATCATC GGTCGCTGGC TTGCCGCGCG GGGCGGGCGC
GACCGGATCG TGCTGGCGAC CAAGATCACC GGCGAGGGCA GTGCGGCGGT ACGCGGCGGC
GAGCCGGTGA CGCCCGAGAG CCTGCGCCGC GCGCTCGAGG GCTCGCTCGC GCGGCTCGGC
ACCGATCATG TCGATCTCTA CCAGATCCAC TGGCCGAACC GGGGCTCCTA TCACTTCCGC
AAGATGTGGG CCTATGTGCC GCCCACGGGG GTCGAGGCGG TGCGCGACAG CATGCTCGCG
GTGCTCGAAG AGGCGCAGAA GCTGGTGGCC GAGGGCAAGG TGCGCCATTT CGGCCTCTCG
AACGAGACGG TCTGGGGCGC GGCGCAGTGG CTCTCGCTGG CCGACCGGCA CGGGCTGCCG
CGCATGGCCT CGGTCCAGAA CGAATATTCG CTGCTCTGCC GCCAGTTCGA CACCGACTGG
GCCGAGCTTT CGGCGCTGGA GGAGATGCCG CTTCTGGCCT TCTCGCCCCT CGCTGCGGGG
CTTCTGTCGG GCAAATATGC CGGAGACGTG ACGCCCGACG GCTCGCGCCG CGAACGCAAT
GCCACGCTGG GCGGGCGCGT CACGCCCACC GTCTTCGAGG CGGTGGCGGG CTATCTCGGG
ATCGCCGCGC GCCACGGGCT CGACCCCTGC CAGATGGCGC TCGCCTTCTG CCGCAAGCGT
CCCTTCCCGG TGATCCCGAT CCTCGGCGCC ACCTCGCTCG ACCAGCTGCG CACCAACCTC
GGCGCCTGTG ACCTCGAGCT GTCCCCCGAA GTCGAGGCCG AGATCGCGGC GGCCCATCGC
ACCTGGCCCG CGCCCTACTG A
 
Protein sequence
MKRIPLGRTD LTVSELCLGT MTWGSQNSEA EAHAQIDLAL DHGVNFLDTA EMYPTNPVTA 
ETVGGTETII GRWLAARGGR DRIVLATKIT GEGSAAVRGG EPVTPESLRR ALEGSLARLG
TDHVDLYQIH WPNRGSYHFR KMWAYVPPTG VEAVRDSMLA VLEEAQKLVA EGKVRHFGLS
NETVWGAAQW LSLADRHGLP RMASVQNEYS LLCRQFDTDW AELSALEEMP LLAFSPLAAG
LLSGKYAGDV TPDGSRRERN ATLGGRVTPT VFEAVAGYLG IAARHGLDPC QMALAFCRKR
PFPVIPILGA TSLDQLRTNL GACDLELSPE VEAEIAAAHR TWPAPY