Gene Nmul_A0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0084 
Symbol 
ID3785809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp87281 
End bp88747 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content55% 
IMG OID637810154 
Productsigma-54 (RpoN) 
Protein accessionYP_410785 
Protein GI82701219 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAA CTCTTCAAGT CAAGCTGTCG CAACATCTGG CGCTTACGCC GCAATTGCAG 
CAGTCCATCC GGTTATTGCA ACTCTCCACG ATAGAGTTGA ACCAGGAAAT AGAGCGCATC
GTACAGGAGA ATCCCCTTCT GGAACTTGAT GACAGCCTCG GCAACGACAG TACCGATTAT
CATAGCCTCA TACCGGAGGA GTTGGCCGCA CCGTTGCCAA ACGAGCTGGA GGTTGCCGAG
GAAAGCCTCC CTATTCTCTC CGTCGAAGCA GAAACCGATG TGAAGCAACC GCTTGACGAG
CGGGAGTGGC CGCCGGATGA CAGCGTTTTT CGGCCCTACC GGGACGATGA GGACGAACGT
GATATTCCAC AGCAAGCGGT GGAACCTCCG AATCTCCGTG CCCACCTCAA CTCGCAACTC
AGCCTGAGCC AGATCTCCCA GCGGGATAGA AAGATCACCG GATTACTGAT CGACAGCCTG
GATGATGATG GCTATCTTGT CCAGGACCTG GAAGAATTGG TCGATCTGTT GCCCGCCGAG
CTGTCAATAG ATATCGACGA CCTGCACATT GCCCTCGAAC ATTTGCAGCA CTTGGATCCC
CCCGGCATCG GTGCGCGCAA CTTGAGAGAA TGCCTCGTCA TGCAACTGCA GGCGCTCCCA
CCCGATACGC CCTATCTGGA GCAGGCATTG GCGCTGGTGA ACAACCATCT TGAAAGTCTG
GCATCCCGGG ACTTTGGCGC CATCAAGCGG GTGCTGCACT GCAACGACGA CTGTCTTCGC
TCTGTCCAGC AGATGATAAC GCGACTGAAT CCGAGACCGG CCACGGCCTT CAGCTCCACA
GTCGCCTGCT ACATCGTGCC CGACGTCATC GTGACCAAAG TCGGCGGTTC CTGGGTAGCC
AGCCTTAATC CGGAAGCCAT GCCGCGTCTC AGGATCAACC GGCTCTATGC GGAGATCCTG
AAAGGATGCA ATGACGACTC TACCCGTCGG CTGATCAGCC AATTGCAGGA AGCAAAATGG
CTGGTAAAGA ACGTACAACA GCGTTTCAAC ACCATCCTCA AGGCATCGGC AGCCATTGTG
GAACGCCAGC AGCAGTTTTT CGAGCATGGA GCAGTTGCGA TGCGTCCGAT GATACTGCGG
GAAATCGCCG ACGTGCTGAA TCTGCACGAG TCCACCATTT CGCGGGTTAC CACGCAGAAA
TTCATGCGCA CGCCGCGCGG CATTTTCGAA TTGAAGTATT TTTTTGGGAG CCACGTATCA
ACGGATAGTG GTGGCGCTTG CTCCGCCACT GCCATTCGCG CGCTGATCAA GCAAATGATC
AGTGAGGAAA ACTCGAGGAA GCCGCTCAGC GACAGTCAGA TCTCGGAGGT CCTGGGGCAG
CAGGGTATCA TGGTTGCTCG CCGTACCGTA GCAAAATATC GGGAGTCCTT GCAAATTCCT
TCCGTCAATC TTCGGAAATC GTTTTAA
 
Protein sequence
MKPTLQVKLS QHLALTPQLQ QSIRLLQLST IELNQEIERI VQENPLLELD DSLGNDSTDY 
HSLIPEELAA PLPNELEVAE ESLPILSVEA ETDVKQPLDE REWPPDDSVF RPYRDDEDER
DIPQQAVEPP NLRAHLNSQL SLSQISQRDR KITGLLIDSL DDDGYLVQDL EELVDLLPAE
LSIDIDDLHI ALEHLQHLDP PGIGARNLRE CLVMQLQALP PDTPYLEQAL ALVNNHLESL
ASRDFGAIKR VLHCNDDCLR SVQQMITRLN PRPATAFSST VACYIVPDVI VTKVGGSWVA
SLNPEAMPRL RINRLYAEIL KGCNDDSTRR LISQLQEAKW LVKNVQQRFN TILKASAAIV
ERQQQFFEHG AVAMRPMILR EIADVLNLHE STISRVTTQK FMRTPRGIFE LKYFFGSHVS
TDSGGACSAT AIRALIKQMI SEENSRKPLS DSQISEVLGQ QGIMVARRTV AKYRESLQIP
SVNLRKSF