Gene Nmar_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0347 
Symbol 
ID5773966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp309504 
End bp312851 
Gene Length3348 bp 
Protein Length1115 aa 
Translation table11 
GC content37% 
IMG OID641315975 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_001581681 
Protein GI161527855 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATT CATCCACAAA ACGTTGGCCA GTAATTCAAG ATATTTTGAA AAGAGAGGGT 
ATTGCACGTC AGCACCTAAA CTCATTTGAT GAATTTTTAG AAAGAGGATT ACAAAGTATT
ATCAACGAGG TTGGTCAGAT CGATATTGAA AATGCTGAGT ACCCTTACAA AATTCAACTA
GGTAAAGTCA AATTACAACA ACCAAGAATG ATGGAACTTG ATGGTTCTAT TACTCATATC
ACTCCTGCAG AAGCTAGATT GAGAAACGTA TCTTATTCTG CACCTGTAAT GATGGAAGCA
AGTGTTGTTG AAGATGGAAA AATCCTTGAA TCAAGATTTG TCCACATTGG TGATGTTCCC
GTTATGGCAA AATCAAACGC TTGTATCTTA CACAATTTTT CTACTCAAAA ACTAGTTGAA
CATGGTGAGG ATCCAAACGA TCCTGGTGGT TACTTTATCA TTAATGGTTC TGAGAGAGTA
ATTGTTGGAT TAGAAGATCT TTCTTACAAC AAAATCATTG TTGATAGAGA AACAGTAGGT
GGAAACATTG TACACAAAGC CAAAGTTTAT TCTTCAATTG TTGGCTATCG TGCAAAATTA
GAACTTGTCA TGAAAAATGA TGGACTAATT GTTGCCAGAA TTCCTGGTTC TCCAGTTGAT
ATTCCAGTAG TTACTTTGAT GAGAGCTCTA GGTTTAGAAT CTGATAGAGA GATTGCAGCA
GCAGTTTCAT TGGTAGATGA ACTCCAAGAT GAACTAGAAG CATCTTTTGA AAAAGCAGGA
GATGTTCCAA CTGCAAAAGA TGCCATTGTT TACATCAGTA AGAGAATTGC CCCTGGAATG
CTTGAAGAAT TTCAGATTAA ACGTGCTGAG ACTTTACTTG ATTGGGGTTT GTTGCCTCAC
TTGGGCAAAC ACCCTGAAAA TAGAAAAGAA AAAGCGCAAT TCTTGGGAGA AGCAGCTTGT
AAATTATTAG AACTAAAACT TGACTGGATT AGACCTGATG ACAAAGACCA CTATGGAAAC
AAAGTCATTA AATTTGCAGG ACAGATGCTT GCAGACTTGT TTAGAACTGC ATTTAGAAAT
CTTGTCCGTG ATATGAAATA TCAATTAGAA CGTTCTGGAC AAAAACGTGG AATTAATGCA
GTAGCTGCTG CAATTCGTCC AGGAATTATT ACTGATAAAC TAAACAACGC AATTGCCACT
GGAAACTGGG GTCGAGGCAG AGTAGGTGTT ACTCAATTAC TTGATAGAAC AAACTATCTT
TCTACAATTA GTCACCTTAG AAGAATTCAG TCTCCACTAA GTAGAACTCA GCCAAACTTT
GAAGCAAGAG ACCTGCATGC AACACACTTT GGAAGAATTT GTCCAAGTGA AACTCCTGAA
GGCTCTAACT GTGGTCTTGT AAAGAATCTT GCATTATCTG GAATAATTTC TGTAAACGTA
CCATCTGAAG AAATTGTAGA GAAACTCTAT GATCTTGGAA CTGTCCACTT CTTTGATGCA
AAAGAAGATT TGAAGAAAGA CGGAACTAGA ATCTTTGTTG ATGGTAGACT AATTGGATAT
TACAAAGATG GTGAACAACT AGCAGAATCT CTTAGAGACT TAAGAAGAAA CTCAAAGATT
CATCCACATG TTGGTGTATC CTTCCACAAA TCTGAAATTG AAGGTTCAAC CCGAAGACTT
TACGTAAACT GTAATGCAGG ACGTGTTTTA AGACCACTAA TCATCATTAA AGATAACAAA
CCATTACTTA CTGCTGATTT ACTAGACAAA ATTTCAAAGA AACTCATCTC GTGGACTGAT
CTCTTGAGAA TGGGTGTTTT GGAAATGATT GATGCAAATG AAGAAGAGAA CTGTTATGTC
ACACTAGATG AAAAGGATAC AAAGAAACAC ACTCACCTTG AAGTCTTCCC ACCAGCAATC
CTTGGTGCAG GTGCTTCAAT CATTCCATAT CCTGAACACA ACCAATCTCC AAGAAACACA
TACGAGTCTG CAATGGCAAA ACAGAGTTTA GGATTCTCAA CCCCTATGAT GAATACAAGT
ACATATGTTA GACAACACTT TATGCTATAT CCTCAGGTTC CAATAGTTAA CACAAAAGCA
ATGAAACTTT TGGGATTAGA AGATAGACCT GCAGGTCAGA ACTGTGTGGT AGCAGTGTTA
CCATTTGATG GTTACAACAT TGAGGATGCC ATCGTTCTTA GTAAAGCATC AGTTGATAGA
GGATTAGGAA GAACATTCTT CTTTAGAATC TATGATGCTG AAGCAAAACA ATATCCTGGT
GGAATGCGTG ACGCATTTGA AATTCCAAAT GCTGAAGATA ACATCAGAGG TTACAAGGGA
GAACGTGCAT ACAGACTCTT AGAAGAAGAT GGTGTTGTTG CAACAGAAGC TCCAGTAAAA
GGTGGAGATA TTTTAATTGG AAAAACTAGT CCTCCAAGAT TTATGGAAGA ATACCGAGAG
TTTGAATCAT CTGGTCCTTA CAGAAGAGAT ACATCAATTG GTGTTAGACC TTCAGAAACT
GGTGTTGTTG ATACTGTAGT TATGACCCAA TCAAATGAAG GTGGAAAGAT GTACAAGATT
AGAGCAAGAG ATATGAGAAT TCCCGAAATT GGTGACAAGT TTGCATCAAG ACATGGACAA
AAAGGTGTCT TAGGAATTTT AGCTAAAGCA GAAGATTTGC CATATACTGC AAGTGGAATG
TCTCCTGATG TTTTGATTAA TCCTCATGCA TTCCCATCTA GAATGACTGT TGGTATGATG
ATGGAATCCA TATGTGGAAA ATCTGCCGCA TTACGTGGAA AGAGATTTGA TGGTTCTGCA
TTTGTTGGAG AAAAAATGGA TGAAGTAAGA GAAGTAATGG ATGCACATGG CTTTGAATAT
TCTGGTAAAG AAATAATGTA TGATGGAAGA ACTGGAAAAT CATTCCCAGT CGAAGTTTTC
ATCGGAGTTG TATATTATCA AAAATTACAC CACATGGTTG CAGACAAGAT TCATGCAAGA
GCACGTGGAC AAGTTCAAAT GTTAACAAAA CAACCAACAG AAGGAAGAGC AAGAGGTGGT
GGCCTTAGAT TCGGTGAAAT GGAGAGAGAT TGTCTTATTG CTTATGGCGC TTCTATGATT
CTTAAAGACA GATTACTTGA CGAGTCTGAT AAATCTGATA TTTTTGTATG TGAGAGGTGT
GGTCTTGTTG CTTATCATGA TGTTAAACAA AGAAAATATG TTTGCAGAGT TTGTGGTGAT
AAAGCCAAAG TCTCATCAGT TTCAGTTGCT TATGCATTCA AACTACTCTT ACAAGAGATG
CAGAGCCTCA ACGTCGCACC AAGATTGTTA ATCAAGGAGA AACTATAA
 
Protein sequence
MADSSTKRWP VIQDILKREG IARQHLNSFD EFLERGLQSI INEVGQIDIE NAEYPYKIQL 
GKVKLQQPRM MELDGSITHI TPAEARLRNV SYSAPVMMEA SVVEDGKILE SRFVHIGDVP
VMAKSNACIL HNFSTQKLVE HGEDPNDPGG YFIINGSERV IVGLEDLSYN KIIVDRETVG
GNIVHKAKVY SSIVGYRAKL ELVMKNDGLI VARIPGSPVD IPVVTLMRAL GLESDREIAA
AVSLVDELQD ELEASFEKAG DVPTAKDAIV YISKRIAPGM LEEFQIKRAE TLLDWGLLPH
LGKHPENRKE KAQFLGEAAC KLLELKLDWI RPDDKDHYGN KVIKFAGQML ADLFRTAFRN
LVRDMKYQLE RSGQKRGINA VAAAIRPGII TDKLNNAIAT GNWGRGRVGV TQLLDRTNYL
STISHLRRIQ SPLSRTQPNF EARDLHATHF GRICPSETPE GSNCGLVKNL ALSGIISVNV
PSEEIVEKLY DLGTVHFFDA KEDLKKDGTR IFVDGRLIGY YKDGEQLAES LRDLRRNSKI
HPHVGVSFHK SEIEGSTRRL YVNCNAGRVL RPLIIIKDNK PLLTADLLDK ISKKLISWTD
LLRMGVLEMI DANEEENCYV TLDEKDTKKH THLEVFPPAI LGAGASIIPY PEHNQSPRNT
YESAMAKQSL GFSTPMMNTS TYVRQHFMLY PQVPIVNTKA MKLLGLEDRP AGQNCVVAVL
PFDGYNIEDA IVLSKASVDR GLGRTFFFRI YDAEAKQYPG GMRDAFEIPN AEDNIRGYKG
ERAYRLLEED GVVATEAPVK GGDILIGKTS PPRFMEEYRE FESSGPYRRD TSIGVRPSET
GVVDTVVMTQ SNEGGKMYKI RARDMRIPEI GDKFASRHGQ KGVLGILAKA EDLPYTASGM
SPDVLINPHA FPSRMTVGMM MESICGKSAA LRGKRFDGSA FVGEKMDEVR EVMDAHGFEY
SGKEIMYDGR TGKSFPVEVF IGVVYYQKLH HMVADKIHAR ARGQVQMLTK QPTEGRARGG
GLRFGEMERD CLIAYGASMI LKDRLLDESD KSDIFVCERC GLVAYHDVKQ RKYVCRVCGD
KAKVSSVSVA YAFKLLLQEM QSLNVAPRLL IKEKL