Gene Rsph17029_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2179 
Symbol 
ID4895952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2309501 
End bp2310805 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content72% 
IMG OID640112773 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_001044054 
Protein GI126462940 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.570526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0685307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGA TGCAGTTCCA GCGTCAGACG ACCCAGCTGG CCATGACCCA GCGGATGCAG 
GAGTCGCTGC GGATCCTGCA GATGAGCAAC GCCGATCTCG CCGACTATCT GACGGCGCAG
GCGCTGGAAA ATCCCTGCCT CGAGGTGCGC GTGCCCGAAG GGACGTCGGT CGCCCCCGCG
CTGCCCTCGC GCGGGATCCA GGCGGGGCTC GACCGCGATG CCTTCGCCAC CGTCGAGGGC
CAGCCGCCGA GCCTTCTGGC CCATGTCGAG GCGCAGATCG ATCTGGCCTT CTTCGATCCG
GGCGACCGGC GCACGGCCCT GGCCTTCGCC GAGGCGCTGG AGCCCTCGGG CTGGCTCGGC
CAGCCCGTCT CCGAGATCGC CGCCGCGGCC GAGGTGGAGG AGGAGGAGGC GCTGGTCATC
CTCGAGCGGT TGCAGGCCTT GGAGCCCGCG GGCCTCTTCG CCCGGTCGCT GGCCGAATGC
CTCGCGCTGC AGCTCGAGGA TCTGGGGCTG CTGACCTGGG AGCTGCGCAC GATGCTCGAC
CATCTGCCGC TTCTCGCCGA GGGGCGGATC GCCGATCTCG CCCGCCGCTG CGACTGCGAG
CCCGAGCATA TCCGCGAGAA TCTGGCGCTG ATCCGCAGCC TGAGCCCCAA GCCCGGCGAG
GCCTTCGCGG CCGACCGCAC GCCGATCCAG CCGCCCGACG TGCGCGTGCT GCGCGGCCCG
GAGGGCTGGG AGGTCGAGCT CACCCGGGCG CAGCTGCCCC GCATCCGGGT CAGCGAGGCA
GGAGACACCG GCGACCGGCA GGCCGACGCC TGGCTCGCCC GCGCCCGCTC GCAGGCGCGC
TGGCTGGAGC GGGCGGTCGA GCGGCGGCAG GCCACGCTCC TGCGCACCGC CGTCTGCCTC
GTGCGCCATC AGGCCGACTT TCTCGATCAG GGGCCGCGCG CGCTCCGGCC GCTGTCGATG
GAGGAGGTGG CGCTGGAACT CGACCTCCAT CCCTCGACCA TCAGTCGCGC CACCGCCACC
CGGCTGATCG AGACGCCGCG CGGGCTGATC CCGCTGCGCG CCTTCTTCAG CCGGTCGGTC
TCCTCGGACG GGCCCGAGGC GCCGCAGTCG CAGGATGCGC TGATGGCGCT CGTGCGCGAC
ATCATCGCGC GCGAGGATCG CACGAAACCC TTCTCGGACG ATGCGATCGT GAAGCAGGCG
AAGCTCGCGG GCGCGGTTCT GGCCCGGCGC ACCGTCACCA AATATCGCGA GACGCTGGGG
ATCCCCTCGT CCTACGACCG CAAGCGCGCC GCCGCCGCGG CCTGA
 
Protein sequence
MDMMQFQRQT TQLAMTQRMQ ESLRILQMSN ADLADYLTAQ ALENPCLEVR VPEGTSVAPA 
LPSRGIQAGL DRDAFATVEG QPPSLLAHVE AQIDLAFFDP GDRRTALAFA EALEPSGWLG
QPVSEIAAAA EVEEEEALVI LERLQALEPA GLFARSLAEC LALQLEDLGL LTWELRTMLD
HLPLLAEGRI ADLARRCDCE PEHIRENLAL IRSLSPKPGE AFAADRTPIQ PPDVRVLRGP
EGWEVELTRA QLPRIRVSEA GDTGDRQADA WLARARSQAR WLERAVERRQ ATLLRTAVCL
VRHQADFLDQ GPRALRPLSM EEVALELDLH PSTISRATAT RLIETPRGLI PLRAFFSRSV
SSDGPEAPQS QDALMALVRD IIAREDRTKP FSDDAIVKQA KLAGAVLARR TVTKYRETLG
IPSSYDRKRA AAAA