Gene Nmar_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1628 
Symbol 
ID5774668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1484428 
End bp1486599 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content35% 
IMG OID641317282 
Productregulatory protein ArsR 
Protein accessionYP_001582962 
Protein GI161529136 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1328] Oxygen-sensitive ribonucleoside-triphosphate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGT CTGATGAAGA ATTAGAAATG GATCTAGAAG GTGATCTAGA TGCAGACATG 
GATGACGATG ATCTAGATAC AGACTTAGAT GATGATTCAG ATTCACCAAA ACGCGGTGGA
ATTTTACAAT CTACATCAAA ACGTGTCAGA ATGATCTTTT CTGTCATGGC AAGTCCTAAC
AGAATCGATA TCCTGAGAAT TCTAAATTCA AAAGGCCCTC TAACTTATTC AGAATTAAAA
TCTCTGGCTG GATTCAAATC AAAAAAAGAG AGTGGAAAAT TTGCATACCA CTTGAGAAAA
TTACTTAGAC AATCACTTGT TGCATTAAAC AAATCTGAAA GACGTTACAC AATTACAAAT
CTTGGAAAAC TAGTTTTGAG CTTAGCAAGA CAAATTGAAG AAAGATCAAT TATTGAAAGT
GGAAAGATGT ATGTTAGAAC ATCACACGAA TCAATTGAGG AATTTAATTC TCACAAAATC
ATACAATCAT TGGTACGTGA AGGTAGCCTC CCACTTGAAT TAGCACAAAA AATTACTGAA
GAAGTTGAAA ATAGAATCTA CAAATATCAA ACTACCTATC TTACAGGCTC ACTTATCAGG
GAAATGGTAA ACTCTGTCTT GCTTGAGCAT GGTCATGAGG AATACCGCAA CAAGCTTGCA
CGCCTAGGCT TGCCTGTGTT TGACGTTCAA GAGATGATTA CTAATCTAGA CAATGTAGAC
AATGGCGCTG AAGGTCTCTT GTTTAAAACA GGACAAACAG TATTTGCTGA ACATCTTCTG
ACAAACATCT TACCAAAAGA TGTAGCTGAC TCTCATCTTT CTGGTGATCT ACACATTACA
AATCCTGGAA TCTGGTCAAT GATTCCTGAT ACAATGTTTG TTAACATTAA AGAATTAATT
GAAGATGGAA TTGATCTTGG TGGAAAGTAT CTTGATGTTT CAAGAATTCC TGCATCAAAA
CAACTTGATG AAATTACAAG TGCACTGTCT ATTGTAGTCT CTCTTCTTTC AAAAGAAGCT
TCACAAGAAA TTATACTTGA TGGATTTGTT TCACTATTCT CAAAACATTC AAAATCACTT
CCAGAACTAG AACAAAAATT AACTACAGCA TTTGCAACTG CATCTACAAC TTCCAAATAC
AATAAATCAA GTACTAACGT TTCAATTAGA TTACAATTAG GATCTGATAC AAAAATAATT
AATTCAATTA TTAATGCATA CAAAAACTAC ACAAAGCTTA CCCCAATTCC AAAAATTGGA
TTAATCATAG ATAATGAAAA AGGAAAGATC ACAGATGTTT CAGCAGCAGT ATCAGAAATA
ATTTCACTTG GTGGAAAAGT AATGTTTGCC AAAGGACAAA CATCAAGTCA TGGTATTACT
AATGGATCAA CTAAAAGTTC TGGTCCACTT TCAATAATGC TTGAATCAGT ATCAATCAAT
CTTCCAAGAT TAGCATTTGA ATCAAACAAA GATGAAACTT ACTTTAGAGC AAGATTGGCA
TTACTCATGA AACCAGCTTT ATCTTCCATG GCCTTGAGAA AGAAAGACAT TTCTGATTTG
ACTAGACGCG GATTGAATCC AATTTTGGCC AAAAATACTC AGTATATGCA AAAAAGTAAT
GTTTCACTTG TAATCAATTT GGTAGGCCTC AAAGAGGCAG TCTTTAACAT CCTTGGATTC
CAAGATAATA AAGAAGGTCG TGAAATCCTT CACAAGGTAA TTGAAACTGC AGTAGACGTT
GGCGCCAAAA AAGGCAAAGA ATTAGGTGAT CCTGTGGCTA TTTGCATGAC TGAAACTGAA
AGTGCAACCA GATTTGCTAC TCTTGATGGT GAGAAATATG GCAAAAATTC ATCCTTAAAC
TCTATGGAAG GTGATTCTTA TTCTGAGGGA ATCATTATTG ATGCCTCTGA AGTCTCTGAT
TATACTGCCA AGAGCGAGCC AATCTCTGAG TGTAACAAAC TCTCAAAGTT GCTAAACGGG
GGATTGTTTG TAACACTAAA TATCGGCAAA GATGCAAAGC CTGCAGAAAT TAAAAAAGCA
ATTGAGAAGA CATCAGAACT TACAACATCC TTCAAGCCTG TTCAAGACAT TGCAATCTGT
GGTGAGTGTG GTTTTAAAGA TGAACCGTTT GAAGATAAGT GTCCAAAGTG CAAGTCTCCA
TATGTTGTCT GA
 
Protein sequence
MKMSDEELEM DLEGDLDADM DDDDLDTDLD DDSDSPKRGG ILQSTSKRVR MIFSVMASPN 
RIDILRILNS KGPLTYSELK SLAGFKSKKE SGKFAYHLRK LLRQSLVALN KSERRYTITN
LGKLVLSLAR QIEERSIIES GKMYVRTSHE SIEEFNSHKI IQSLVREGSL PLELAQKITE
EVENRIYKYQ TTYLTGSLIR EMVNSVLLEH GHEEYRNKLA RLGLPVFDVQ EMITNLDNVD
NGAEGLLFKT GQTVFAEHLL TNILPKDVAD SHLSGDLHIT NPGIWSMIPD TMFVNIKELI
EDGIDLGGKY LDVSRIPASK QLDEITSALS IVVSLLSKEA SQEIILDGFV SLFSKHSKSL
PELEQKLTTA FATASTTSKY NKSSTNVSIR LQLGSDTKII NSIINAYKNY TKLTPIPKIG
LIIDNEKGKI TDVSAAVSEI ISLGGKVMFA KGQTSSHGIT NGSTKSSGPL SIMLESVSIN
LPRLAFESNK DETYFRARLA LLMKPALSSM ALRKKDISDL TRRGLNPILA KNTQYMQKSN
VSLVINLVGL KEAVFNILGF QDNKEGREIL HKVIETAVDV GAKKGKELGD PVAICMTETE
SATRFATLDG EKYGKNSSLN SMEGDSYSEG IIIDASEVSD YTAKSEPISE CNKLSKLLNG
GLFVTLNIGK DAKPAEIKKA IEKTSELTTS FKPVQDIAIC GECGFKDEPF EDKCPKCKSP
YVV