Gene Nmar_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0888 
Symbol 
ID5774377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp778828 
End bp780228 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content33% 
IMG OID641316527 
Producthelicase c2 
Protein accessionYP_001582222 
Protein GI161528396 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTG AAAAAGTAGA CAATTCTAGA AGAGCTATGA GATGGGGACT AACCTGTGAA 
AAAGGAGAAT GTATTGAAAG AACCAGTAAA AACGGTAAAG AAGTAATTGA AACCTGCAAA
TTCAAACCTA CAATAAAACA AGTTGAAGAA AACACTCAGG ATTCCCAGTC ATGTCATTAT
TATCTTCAAA AATATGAGGC ACTAGTCTCC AAACACTCTC TTTGGAATTA TCATGCATTT
TTTCAAATTA TGAAATTTAA CAAGAAACTG TTTGAAGATT ATTTGGATAG AAAGGTCTCT
GTTTTTGATG AGGCTCATAA AATTGAAGAT CAAATTATTC AGTTTGTAGG ATTTGATATC
TTTGCTGGAC AAGTTGATGA ATGTAATCTT AATGCAGACA AGTATGATTT TGCTGATTTA
GATTCTATGA TATCTTTAAC TGACGACATT GCATATTCTT ATGCAAAAAA AATCAAAGAT
ATCAAAGAAA GTCCTGGTTT TGAAAATGAT CCTGATTATG AATTGATATC AAGATTAGAA
AGAAGATATG ATAGAGCGGC ACAGGCAAAA ATCGACATTT CTTCAGACAA AAATAATTTT
GTTGTAAATG ATCCTGTACG TGATTTGAAT GGAAATTTTC GAACTATTTC TGTAAAACCA
ATTGATGTAT CAAAATTTGC AAACGAGTTT TTTGAAACAG AATATCAAAT TTTCATGTCT
GCTACAATTG ACAAACAAAG TTTTTGTGAA AATATGGGTT TGAAAAAAGA TGAAGTTGCA
TTTGTTGACA CTCCAAAATC TCCATTCCCA ATTGAAAACA GAAATGTTGA TCTGTTGAAC
ATTAGGAGAT TAAGTTATGG TTCAACTGAA GAAGATGAGT TAGAAGTAAT CAAAACAATT
GATCGAATAC TAGATGAACA TTCTACTGAA CGTGGGTTAA TTTTGACATC ATCTATTCCA
AGATGTCAAA AAATTCTAAG ATACCTTTCT CCAAAAAACA CTCGTAGAAT CAGAATATGT
CACAGTAAAA ACAAGGATGG TAAAACTCAA GATGAAGTAA TATCAGAACA CGCCTCTGAT
CCTACTGGGG TTTTACTCTC ATCTTCTCTT TGGGAAGGAG TTGATCTAAA AGATGACCTT
TCTCGTTTTC AGATTATTGC AAAGGTTCCC TACCCTAACT ATAAGGAAAA AAGAATCAAG
GCCAAGATGG ACAAATTTCC CCTTTGGTAT ACCTCTCAAA CATTGACAAA ACTACTACAG
GGATTTGGAC GTTCTATTAG GAGTGAGGAT GATTGGGCTA GGACCTATGT TCTTGATGCT
GCTGCAAATA ATGTGTTTTT CAAGGCACAA CAGATGATTC CAAAATCATA CTATGATGTT
CTAGGCATAG AAGACCTGTA G
 
Protein sequence
MASEKVDNSR RAMRWGLTCE KGECIERTSK NGKEVIETCK FKPTIKQVEE NTQDSQSCHY 
YLQKYEALVS KHSLWNYHAF FQIMKFNKKL FEDYLDRKVS VFDEAHKIED QIIQFVGFDI
FAGQVDECNL NADKYDFADL DSMISLTDDI AYSYAKKIKD IKESPGFEND PDYELISRLE
RRYDRAAQAK IDISSDKNNF VVNDPVRDLN GNFRTISVKP IDVSKFANEF FETEYQIFMS
ATIDKQSFCE NMGLKKDEVA FVDTPKSPFP IENRNVDLLN IRRLSYGSTE EDELEVIKTI
DRILDEHSTE RGLILTSSIP RCQKILRYLS PKNTRRIRIC HSKNKDGKTQ DEVISEHASD
PTGVLLSSSL WEGVDLKDDL SRFQIIAKVP YPNYKEKRIK AKMDKFPLWY TSQTLTKLLQ
GFGRSIRSED DWARTYVLDA AANNVFFKAQ QMIPKSYYDV LGIEDL