Gene Rsph17029_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3940 
Symbol 
ID4898327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1073702 
End bp1074721 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content70% 
IMG OID640114543 
Productaldo/keto reductase 
Protein accessionYP_001045790 
Protein GI126464677 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.336434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTACC GTCGTCTCGG CCCGAGCGGC CTTTTTGTCT CCGAGCTCTG CCTCGGCACC 
ATGACCTTCG GCGGCTCGGA CGGCATCTGG GGCCAGATCG GTCAGCTCGG ACAGGACGAG
GCGGATGCGC TGGTGCGCAC CGCGCTCGAT GCGGGCATCA ATTTCATCGA CACGGCCAAT
GTCTATGCGG GCGGCGAGAG CGAGCGCATC CTCGGCCGGT CGCTGCGCAA CCTCGGGGTG
CGGCGCGAGG ATGTGGTGAT CGCGACCAAG GTGCTCGGGC CGATGGGCGC GGGCGTCAAT
GCGCGCGGGG CCTCGCGCGT CCATATCCTC GATCAGTGCA AGGCCAGCCT CGAGCGGCTG
CAGCTCGACC ATATCGACCT CTATCAGATC CACGGGTTCG ACGCCGAGAC CCCCATCGTC
GAGACGCTGG AGGCGCTCGA CACGCTCGTG CGCCACGGCC ATGTCCGCTA CATCGGCCTG
TCGAACTGGG CGGCCTGGCA GGTGATGAAG GCGGTGGGGA TCGCCGAGGC GCGCCGGCTG
GCGCCGATCC TGTCGCTTCA GGCCTATTAC ACCCTGGCCG GCCGGGATCT CGAGCGCGAG
GTGGTGCCGA TGCTGAAGGA CACGGGCATG GGCCTCATGG TCTGGAGCCC GCTGGCGGGC
GGCTTCCTGT CGGGGAAATA CGACCGCGAG GGCAAGGCCG CCGACGGGCG CCGCGCGGCC
TTCGACTTCC CGCCGGTCGA CAAGGATCGC GGCTGGACCG TGATCGAGGC GATGCGCCCC
ATCGCGGAGG CCAAGGGCTC GTCTGTCGCG CAGGTGGCGC TGGCCTGGCT CCTGCATCAG
GAGGCGGTCA CGAGCGTGAT CGTGGGGGCC AAGCGCGTGG ACCAGCTGGC CGACAACATC
GCCGCGACCG AGGTGCGCCT CGAGGCCGAG GATCTGGCGG CGCTCGACCG GGCGAGCGCG
CTGGCGCCGG AATATCCGGG CTGGATGCTC GAGCGGCAGC GGAGCTACCG CGCCCGGTAG
 
Protein sequence
MRYRRLGPSG LFVSELCLGT MTFGGSDGIW GQIGQLGQDE ADALVRTALD AGINFIDTAN 
VYAGGESERI LGRSLRNLGV RREDVVIATK VLGPMGAGVN ARGASRVHIL DQCKASLERL
QLDHIDLYQI HGFDAETPIV ETLEALDTLV RHGHVRYIGL SNWAAWQVMK AVGIAEARRL
APILSLQAYY TLAGRDLERE VVPMLKDTGM GLMVWSPLAG GFLSGKYDRE GKAADGRRAA
FDFPPVDKDR GWTVIEAMRP IAEAKGSSVA QVALAWLLHQ EAVTSVIVGA KRVDQLADNI
AATEVRLEAE DLAALDRASA LAPEYPGWML ERQRSYRAR