Gene Rsph17029_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1647 
Symbol 
ID4895409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1739633 
End bp1741036 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content72% 
IMG OID640112240 
ProductGntR family transcriptional regulator 
Protein accessionYP_001043529 
Protein GI126462415 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.420905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGATA CAATATGGCA TCCTGACCTC GCACAATTTC CCGGCCCCAA ATATCTCGCC 
CTGACCCGGG CGCTGCGGGA GGCGATCCGC GAGGGGGTGC TGCTGCCGGG TGCGCAGCTT
CCGACCGTGC GGGATCTGGC TTGGAGGCTG TCGGTGACGC CCGGCACCGT CTCGCGGGCC
TATCAGATGG CCACGCAGGA GGGGCTTCTG GCCGCGACCG TGGGGCGGGG CACCTTCGTC
GCGGCGGCCG AGCCTCGTCT CGGGCCGACG CAGGCCCTTT TCGTCGACCG CGAGCCGCAG
GCCGCGCCGG GCCTTCTGGA TCTGCGCTCG CCGCAACTGC CGGACGTGGG GCAGATGCCG
CTCTTCGCCG AGGCGCTGCG GCGGGTGGCG GGGCAGGTCG GCAACGACTG GCGCGATTAC
CCCACACAGC GCGAGGAGAC GGCCCTGCGC GAGGCGGTGC GCGACTGGCT CGGCGACCGG
GTGCTGGGGC CGGTCACGCC CGAGGACATT GCCCTCACCC ATGGCGGGCA GAGCGGCATC
GGCCTCGTGA TGTTCTGCTG CCTTCGCGGC GACCGGCCCG TGGTGCTGAC CGAGGAGCTG
GCCTATCCCG GTTTCCGTCA TGCGGCGCGG CTGGCGCGGG CCGAGGTGGT GGGCGTCGAG
CTCGACCAGC ACGGGATCCG GCCGGATGCG CTGGAGGCCT GCTGCCGCAA GCATCTGCCG
CAGGTGCTGT GCGTCACGAC GGAGGCGCAG AACCCGACCG CCGTGCGGAT GCCCGAGGAG
CGCCGGGCCG AGATCGTGGC CATCGCCCGC CGGCACGAGC TCCAGATCAT CGAGGACGAT
TGCTATACGG TGGCCGAAAG CACGCTGCCC TCGATGCGCG CGCTCGCGCC CGAGCGGACG
TGGTATGTGG GCAGTCTCTC GAAGACCGTC TCGGCGGCGC TGCGCTTCGG CTATATCCTC
TGCCCGACGG GCCGGGGCGA GGCGGGGCGC CTGACGGCGC AGCACGCGTT CTTCGCCCTG
GGCCGGCCGG TCTCGGATCT CTGCCTGGAC CTCTTCCGCA GCGGTCAGGC CGTCGAGATC
CGCAGTCGCG TCCAGAGCGC CTTCGCCGAC CGGCTGAAGG CCATCGTGAA CGGGCTCGGC
GCGCACGATC TGGTCTGGCA GCCGGGGCTG CCCTTCGTCT GGCTGCGGCT GCCGGTGGGT
TGGCGCACCT CCTCCTTCAC CCGCACCGCC GAAGCAGAGG GCGTGCTGCT GCGGTCGGCC
GACGAGTATG CGCTGGTGCA CGGACGCTCG CCCAACGCCG TGCGGCTCGC CATCGCAGGC
CAGGTGCCGC GCGCCCGGCT CGAGGCGGCG GTGGACCGGC TGTCGCGGCT GCTGGTCTCG
CCACCGTCGG AACTGCCTGT GTGA
 
Protein sequence
MTDTIWHPDL AQFPGPKYLA LTRALREAIR EGVLLPGAQL PTVRDLAWRL SVTPGTVSRA 
YQMATQEGLL AATVGRGTFV AAAEPRLGPT QALFVDREPQ AAPGLLDLRS PQLPDVGQMP
LFAEALRRVA GQVGNDWRDY PTQREETALR EAVRDWLGDR VLGPVTPEDI ALTHGGQSGI
GLVMFCCLRG DRPVVLTEEL AYPGFRHAAR LARAEVVGVE LDQHGIRPDA LEACCRKHLP
QVLCVTTEAQ NPTAVRMPEE RRAEIVAIAR RHELQIIEDD CYTVAESTLP SMRALAPERT
WYVGSLSKTV SAALRFGYIL CPTGRGEAGR LTAQHAFFAL GRPVSDLCLD LFRSGQAVEI
RSRVQSAFAD RLKAIVNGLG AHDLVWQPGL PFVWLRLPVG WRTSSFTRTA EAEGVLLRSA
DEYALVHGRS PNAVRLAIAG QVPRARLEAA VDRLSRLLVS PPSELPV