Gene Nmar_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0944 
Symbol 
ID5773424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp821861 
End bp823369 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content34% 
IMG OID641316583 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001582278 
Protein GI161528452 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.219014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGAAT TTATTGAAAA AAGATATGTC AAAAAAGACT CTATTGAAAA GCGAGATTAC 
CAAGTAAACC TTGCAAATCA GGCTATCTCT GAAAATTGTA TTGTAGTTTT GCCAACTGGT
CTTGGAAAAA CTGCAATTGC ATTACAAGTT ATTGCGGAAT ATCTCTCCAA AGGTACTGGC
GGTGCATTGT TTTTAGCACC TACAAGAGTA CTTGTAAACC AACATTATGA ATTTCTAAAA
GAGAATCTAA CATTAGATGA CATTTCTCTA ATAACTGGTG AGGATACTAT TCAAAAAAGA
ACCAAACTTT GGAATAACAG TGTAATTTGC GCCACTCCTG AAATTGCAAA AAATGATTTG
GATAGGGGAA TTGTTTCGTC TAACCAATTT AATCTCATAA TATTTGATGA AGTACATAGA
ACAGTTGGAG ATTATGCATA TTCTGGAATC GCAGAACGTT TTGTTAATTC TGATGGAAGA
ATTGTTGGAA TGACTGCAAC ATTGCCTAGT GAAAAAGACA AGGCAACTGA AATTTTAACT
AAATTAAAAA TTGCAAGTGT TGCTGAAAGA ACAGAAAATA GTCCTGATGT CAAACCATAT
ACTCAGGAGA CCAACACTGA ATGGATAAAT GTTGAACTCC CACCAGAACT AAAAACAATT
CAAACATTAT TGAAATTAGC CCTTGATCAA AGATATCAAA CATTACGTGA TAATGGAATC
AAACTAGCTG AACAACAATC ACTTTCTGCT CTATTAAGAA TTAGACAATT TGTATTAAAT
CAGAATAGAC GTTCTGCAAA ACCACTATTC ACTGCAATTA GAATTCACTA TGCACTAAAC
ATCTTAGAGG CACATGGAAT TACGCCGTTT TTGAAGTTCT GTGAGCGTGC CAAGGCAAAA
AAAGGTGCAG GAGTAAAGGA ACTCTTTGAG GTTGACCCAA ATTTTACTCG TGCAGTACAT
CTTGCAAAAG AAGCTCAATC TAGGGGAATT GAACATTCTA AGATTCCTAA ACTAAAAGAT
ATCATAGAAT CTGTACCTGG AAAGGCTTTG ATTTTTACAA GCTATCGTGA TTCTGTTGAT
TTAATTCATA GTAAATTGAC TGAACTTGGA GTTTCTGCGG GAATTCTGAT TGGTAAAGCA
GGTGAAACCG GCCTAAAACA AAAAAAGCAA ATTGAAATTG TACAAAAGTT TCGTGATGGT
ATCTTTGATG TTTTAATTGC AACTCGTGTT GGTGAAGAAG GGTTGGATAT TGCTGAAGTA
AACCAAGTTA TCTTTTATGA TAATGTTCCT AGCTCTGTTA GATTTATTCA AAGACGAGGT
AGAACTGGAA GAAAAGATAC TGGAAAACTA GTTGTTCTAA TTGCAAAAAA TACTATTGAT
GAGACATACT ATTGGATTGG TAAACGAAAA ATGTCTGCAT CAAAAGCAAT GGGTGATAAA
ATGACTAAGG TATTGGAAAA AAATCAAGAA GTTGTTTCTA AAAAGACAGG ATTAGATGCG
TTTATCTAA
 
Protein sequence
MTEFIEKRYV KKDSIEKRDY QVNLANQAIS ENCIVVLPTG LGKTAIALQV IAEYLSKGTG 
GALFLAPTRV LVNQHYEFLK ENLTLDDISL ITGEDTIQKR TKLWNNSVIC ATPEIAKNDL
DRGIVSSNQF NLIIFDEVHR TVGDYAYSGI AERFVNSDGR IVGMTATLPS EKDKATEILT
KLKIASVAER TENSPDVKPY TQETNTEWIN VELPPELKTI QTLLKLALDQ RYQTLRDNGI
KLAEQQSLSA LLRIRQFVLN QNRRSAKPLF TAIRIHYALN ILEAHGITPF LKFCERAKAK
KGAGVKELFE VDPNFTRAVH LAKEAQSRGI EHSKIPKLKD IIESVPGKAL IFTSYRDSVD
LIHSKLTELG VSAGILIGKA GETGLKQKKQ IEIVQKFRDG IFDVLIATRV GEEGLDIAEV
NQVIFYDNVP SSVRFIQRRG RTGRKDTGKL VVLIAKNTID ETYYWIGKRK MSASKAMGDK
MTKVLEKNQE VVSKKTGLDA FI