Gene Nther_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2020 
Symbol 
ID6315875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2130614 
End bp2132053 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content34% 
IMG OID642644408 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_001918175 
Protein GI188586630 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAG GCTTTGATTT GAATCTGGAG CAAAAGCAAC AGTTGGTTAT TACCCCGGAA 
CTTAGAAAAG CTATCAAACT TTTGCAGTTG TCCAGTATAG AATTAGATAA TTATATTGAA
AAAGAATTGG TTGAAAACCC CATGTTGGAA ATTTCCTCAA CTGAAGAAAG TGAAAGTGGG
GATGATAACG GAAAAGAAGA CAGTGTGGAA AAATTAGAGA ACAGTCAGGA GTCAGCACAA
AGTGAAGACC GCTTTGATAT TGATTGGCAA AAATATTTTG AAGATTCCAG TGATGTAGGT
AAATATAATA ACTTGCCTAA AGAGAATTAT GACGAGGGTA AGGATGGTTT CGAAACCTTC
ACCCCTTCTA CAATAAGCTT AACAGAACAT CTTTACTTCC AATTGTCTCT TTACTCAGAT
CAAGAAGAAC ACATTAAACC ACTGGCTGAA TTTTTGATAG GAAATTTGGA TGCCAATGGA
TATTTAAATG GGACTTTACA AGAAATTGCA GAATTTTTAG ATGTGGAAGA AAAGGAGTTA
GAAAAGGCCT TAGAATTGGT GCAAAGCCTT GATCCTCCTG GTGTAGGAGC AAGGAGCCTT
AAAGAATGCT TGCTGTTACA AGTGGAAACT GATTCTCATA GCCCCGATTC TGCTTATGAA
CTAATTGAAA ACTTTCTATC GGAAATCGCG GAAAATCGCC TTGACAAAAT TGCAAAAGAG
TTAAAATTAC CTATTAAAGA GGTGCAAGAG ATAGTTGATT ATATCAGGTC TTTAACTCCT
AAGCCTGCAA GTCCTTTCTC TGAAGAAGGC CTTCCCTCTT ATATAACACC TGATATTGTT
ATTAAAAGAG TTGAAGATGA ATATGAAATT ATTTTAAATG ATTCCATGAG TCCTCGTCTT
AAAATTAATT CAAAATATAG ACAATTATTA AAAACAGAAA AGGGTTCGGG AGTAGCTAAA
TTTCTTAATT CCAGGTTAGA CTCGGCTATG TGGTTAATTA AAAGTATAGA GCAAAGGCGA
ATCACTCTTT ATAATATTAT GCAAAAACTT GTAGAAATGC AGAGGCCTTT CTTAGATAAT
GGAGTGAGAT ATCTCAAACC ATTAACTCTA AAAGAAGTAG CTGACGAAAT AGATGTGCAT
GAATCTACCG TAAGTAGAGC CACAGCAAAT AAATATGTGC AAACTCCCCA GGGAGTATAT
CCACTAAGAT TTTTCTTTTC CAGTAAACTA GATAATAATC AAGATGATTA TAACTCTTCT
ACTAGTATTA AACAAAAGAT CAAAGAGTTA GTTGAAGAAG AGGATAAGAA AAAACCTCTT
AGTGATCAAA AAATTGCTGA AATTTTACAG GAATCATCTA TAAATATATC AAGAAGAACT
GTAGCAAAAT ATAGAAAAGA ACTTAATTTA CCTTCTTCTT CAAAGAGGAA AAGGTATTAA
 
Protein sequence
MKLGFDLNLE QKQQLVITPE LRKAIKLLQL SSIELDNYIE KELVENPMLE ISSTEESESG 
DDNGKEDSVE KLENSQESAQ SEDRFDIDWQ KYFEDSSDVG KYNNLPKENY DEGKDGFETF
TPSTISLTEH LYFQLSLYSD QEEHIKPLAE FLIGNLDANG YLNGTLQEIA EFLDVEEKEL
EKALELVQSL DPPGVGARSL KECLLLQVET DSHSPDSAYE LIENFLSEIA ENRLDKIAKE
LKLPIKEVQE IVDYIRSLTP KPASPFSEEG LPSYITPDIV IKRVEDEYEI ILNDSMSPRL
KINSKYRQLL KTEKGSGVAK FLNSRLDSAM WLIKSIEQRR ITLYNIMQKL VEMQRPFLDN
GVRYLKPLTL KEVADEIDVH ESTVSRATAN KYVQTPQGVY PLRFFFSSKL DNNQDDYNSS
TSIKQKIKEL VEEEDKKKPL SDQKIAEILQ ESSINISRRT VAKYRKELNL PSSSKRKRY