Gene Rsph17029_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3581 
Symbol 
ID4898299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp672334 
End bp673695 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content71% 
IMG OID640114190 
ProductMmgE/PrpD family protein 
Protein accessionYP_001045444 
Protein GI126464331 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.604157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0576732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTG CCCAAGAACT CGCCCGGCGG GCGATGTCGA TCCGGCTCGA CAGCCTGCCC 
GAAGACGCGC TCCACATCGG CCGCCGCGCC TTCGCCGACA CGGTGGGGGT GGCCCTCGCC
GGGTCGGACG CGCCCTGCCT CGACGCGATC GAGGAGGCGC TGGCCATTGC GAACGCCCCC
GGACAGGTCA CGCTCTGGGG CCGGGGCGGC CGTCGCGCCT CGATCCTGCA TGCCTGCATG
GTGAACGGCA CCTCGGCCCA TGCGCTCGAT TTCGACGATT GCTCGACCAC GATGGGCGGC
CATCCCTCGG CGCCGGTGGT GCCGGTGGTG CTGGCGCTGG CCGAGGCCCA TGGCGCGCCC
GCGGGCAGGG CGCTCGAGGC CTGGGTCACG GGCGTCGAGG TCGAGACCCG GCTCGCCCGC
GGGCTTCTGC CCCATCATTA CGAGAAGGGA TGGCATCCGA CGGCCACGCT CGGCGTCTTC
GGAGCCACCG CGGCCGCGGC GCGGATGCTC GATCTCGACG AGGCGCAGAC CACCACCGCC
CTTGCCGTCG CCGTCTCGAT GGCCTCGGGC CTCAAGTCGA ACTTCGGCAC CCCGGTCAAG
CCCATGCATG TGGGTCAGGC CGCCCACAAC GGGCTGATGG CGGCCCTCGT CGCCCGCCGC
GGCATGACCG CGAACGCCGA GGCCTTCGAA CATACCTACG GGTTCTTCAA TCTCTTCAAC
GGCCCGGGCA CGTTCGATGC GGATGCGATC CTTGCGGACT GGGACGGACC GCTCGAGGTG
CTGAGCCCCG GCATCGCGAT CAAGCAGCAT CCCTGCTGCG GCAGCGCCCA TTCGGCCATC
GACGCGGCCC TGCGGATCGT GGCGGCCGAA GGCCTTTTGC CTGCCGAGGC CATCGCCCGC
ATCGACATCC GCACCCACGA GCGCCGGCTG GCCCATACCA ACCGCCCTGC CCCGCGCTCG
GGTCTCGACG CGAAATTCAG CGTCCAGTTC CTCACCGCGC GCGCCCTGAC CGCGGGCCGG
ATCCGGCTGG CCGATTTCGA CGATGCGCGC TTCCTCACGC CCGAGATCGC GGCCCTTCTG
CCGCGGGTCA GCGCCACGGG GCACCGCGAG GCCGACGCCT ACCGGGGCGA GGTGCGGGTC
ACGATGACGG ACGGGCGCCT CTTCGAGGCC AGCGCCTCGA CCAATTTCGG CCGCGGGCCG
CTGAACCCGA TGTCGGACGC CGAACTGACC GAGAAATTCA CCGATTGCGC GGCGGCGCGG
CTGGGCTCCG GGGCGGCCGG GATCTGTGCG GCCTTCCTTG CGCTCGCACC CGACACCCCG
CTTGCGCCGC TTCTTGCACA GCTCAGCGGA CAGGAGGACT GA
 
Protein sequence
MTFAQELARR AMSIRLDSLP EDALHIGRRA FADTVGVALA GSDAPCLDAI EEALAIANAP 
GQVTLWGRGG RRASILHACM VNGTSAHALD FDDCSTTMGG HPSAPVVPVV LALAEAHGAP
AGRALEAWVT GVEVETRLAR GLLPHHYEKG WHPTATLGVF GATAAAARML DLDEAQTTTA
LAVAVSMASG LKSNFGTPVK PMHVGQAAHN GLMAALVARR GMTANAEAFE HTYGFFNLFN
GPGTFDADAI LADWDGPLEV LSPGIAIKQH PCCGSAHSAI DAALRIVAAE GLLPAEAIAR
IDIRTHERRL AHTNRPAPRS GLDAKFSVQF LTARALTAGR IRLADFDDAR FLTPEIAALL
PRVSATGHRE ADAYRGEVRV TMTDGRLFEA SASTNFGRGP LNPMSDAELT EKFTDCAAAR
LGSGAAGICA AFLALAPDTP LAPLLAQLSG QED