Gene Rsph17025_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1657 
Symbol 
ID5082735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1697725 
End bp1699050 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID640483215 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_001167855 
Protein GI146277696 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.937056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTT ATACCGCTCA GAGCTTCGCG CAGCGTCAGT CGCTCGTCGT GACCGCGCAG 
TTGCAGCAGG CGATCTGCCT GCTCCAGATG CCCAACGCGG AACTGTCCTC GTTCATCGAG
ACACAGTCCG AGGAAAACCC GTTCCTCGAG CTGCGGCTGC CCCCGGCGCC CGCGCCGTCC
TCGGCGTTGC CCCGCAGCCA GGCGGCGGCG GGCGACGACT GGGACCGTGT GGCGGGCCTT
GCCGCCGATC CGGGGCCGTC GCTCTATGTC CATGTCACCG CCGAGATCGC CCGGCTGGGT
CTCACGGCCG AGGAGAGCGC CGCCGCCTCG GTCTTTCTGG ATGCGCTCGA ACCCTGGGGA
TGGCTTGGCC AGCCGCTCGA GCTGCTGGCC CCGCGCGCGG GTCTCTCGCT CGAGGCGGCC
GAGCGGCTGC TGGCGAAGCT GCACAGGATC GAACCGGCCG GTCTCTTCGC CCGCTCGCTC
GCCGAGTGCC TGCGGCTGCA GGCCAGCGAG CAGGGGCTTC TGACGCCGCT CTTTGCCGCC
GTGCTCGACC ATCTGTCGCT GCTGGCCGCC GCCGACCTGC GCGGGCTCTG CCGCGCCTGC
GGCTGCGGCA TGGAGGAGCT GAAGGCCGTC CTGCGCCAGC TGCGCGGGCT CAATCCCAAG
CCGGGTGCGC TGTTCGATGC CGCCCCCTCG CCGCAGCGCC CGCCGGATCT GCTGGTCAGC
CCCGGCCCCG ATGGCTGGCG CGTGGATCTC AACCGCTCGA CACTGCCCAC GGTTGTGGTG
CGCGCCGACA CGGCGCAGGA CTTCGCCGGA AGCGCCGCGC CCTATGTCGG CGAGCGGCTC
TCGGTGGCCC GCTGGCTGGC CCGCGCGGTC GAGCACCGGA ACCAGACGAC GCTCAAGATC
GGTGCGGAAG TGGTGCGCCG GCAGCGCGGC TTCCTCGAGG AGGGCCCGGC GCGGATGGAG
CCGATGACGC TGCGCGAGGT GGCCGATGCG GTGGGCGTGC ACGAAAGCAC GGTGAGCCGC
GTGAGCTCCG GTCTGATGAT CGCCACGCCG CAGGGCACCT TTCCGCTGAA GTCGTTTTTC
ACGGCCGCTC TCTCGGCGCG CGAGGGGGAC ACGGCCGGTT CGGCCGCGGC CGTCCGCCAT
CGCGTGCGCC AGCTGGTCCA GGCGGAGTCG CCGGATGATC CCCTGAGCGA CGATGCCATC
GCCCGCATCA TCTCGGACGA GGGCGTGACG CTGGCCCGCC GCACGGTGGC CAAATACCGC
GAGCAGCTCA ACATTCCGTC CTCGGTCCAG CGCCGGCGGC AGGCGCTGGT GACGGGCGCG
CTCTAG
 
Protein sequence
MQLYTAQSFA QRQSLVVTAQ LQQAICLLQM PNAELSSFIE TQSEENPFLE LRLPPAPAPS 
SALPRSQAAA GDDWDRVAGL AADPGPSLYV HVTAEIARLG LTAEESAAAS VFLDALEPWG
WLGQPLELLA PRAGLSLEAA ERLLAKLHRI EPAGLFARSL AECLRLQASE QGLLTPLFAA
VLDHLSLLAA ADLRGLCRAC GCGMEELKAV LRQLRGLNPK PGALFDAAPS PQRPPDLLVS
PGPDGWRVDL NRSTLPTVVV RADTAQDFAG SAAPYVGERL SVARWLARAV EHRNQTTLKI
GAEVVRRQRG FLEEGPARME PMTLREVADA VGVHESTVSR VSSGLMIATP QGTFPLKSFF
TAALSAREGD TAGSAAAVRH RVRQLVQAES PDDPLSDDAI ARIISDEGVT LARRTVAKYR
EQLNIPSSVQ RRRQALVTGA L