Gene Nmar_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1709 
Symbol 
ID5773272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1567013 
End bp1568557 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content34% 
IMG OID641317363 
Producthypothetical protein 
Protein accessionYP_001583043 
Protein GI161529217 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0661252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTG GAAGAACAAT CTATGTGTTG ATCAAACTCT TGCCATCAAT TTTGGCTCTA 
CGAAAAGATA GAAAAAAATG GACACAACAA GAAAAAAATA CAATTGATTC TGAACAATTC
AGAAAAAATG CAAGAAAAGC TCTTGACACT TTTGTCTCTT TGGGACCTGT ATACATCAAA
CTAGGACAAT GGTTGTCATC TAGGGCAGAT ATTCTACCTC AACCATATCT TGAAGAACTA
TCTAAACTAC AAGACAGCGT TCCATCTGCA CCATTTGAAC AAGTAAAACC AATTATTGAA
AAAGATCTTG GACCAATTAA CGAAAAATTT GATGATATTG ATACAAACTC CATATCTGGC
GCATCATTAG GTCAAGTTTA TCGTGGTTCT ATTTCTGGAC AACAAATTGT CATCAAAGTA
AAAAGACCTG GCATTGAACA AGTTGTAAAA GAGGATCTAA AAGTTCTAAA GAAAATCCTG
CCACTTGCTT TGAGATTTGT TGATCCTAAT CTTAGATACT CTGCAAAAGC AATGCTTTCT
CAATTTATTG AAACCATTAG AGAAGAGATG GATTACACAA ACGAATCTGA GAATCTCAAA
AAAATCAAAC AAGATATGGA AAACAATGAC AAAGTTATTG TTCCCTCAGT TTATGATGAT
TATTCCTCAA AAAATGTCCT GACTATGGAA TATTTACCAG GAATTAAGGT TACAAATGTA
CAGGCACTTG ATGAGAAAGG CGTTGATAGA GAACAACTAG TAATTGATGT TCACAAGGTC
TTTTTCACCA TGCTTCTCAA ACATTCTCTT TTCCATGCAG ATCCTCACCC AGGAAATATC
TCGGTAACTG ATGATGGAAA ACTAATTCTC TATGATTATG GTATGGTGGG AAGAATAAAC
AATGAAACAC GATACAAACT CATACGACTA TACCTTGCAC TAGTTGAAAA AAATCCTCCC
AGAGTAGTCA ACGCTATGGC AGAATTGGGA ATGCTTACTC CTGGATATAA CAGACAAGTT
ATCGAAAAAG GAATTGAACT ATCTATTCGT GCAATGCATG GAAACAAACC TGATGAGATG
GAAGTTCAAA GTTTGATGGA GCTTGCAAAT CAAACAATGA GTAAGTTTCC ATTTGTTTTA
CCAAAGAATT TGGCTCTTTA CATGAGAATG GCATCAATAA TTGAAGGAAT TTACAAAACA
CATGATGTTG ATTTCAAATT TGTCAAAGTT CTAAGAAATA TTTTACAAGA AGAAAACATG
ATTCCAAAAG CATACGTTGA AGAGCTAAAG ATATCATTTG ACAAGTTTGC AAAATCAATT
GATTCAGCAT TACGATTAGG ACCTGAAATG GAAAAGTTCA TGGATGAAGC TAAATTTTCT
ATGCAAAAAA GAAAGCCAAT GGTATTTCTT GGTGGAAGTA TTTTTGCATC TGCAATTTTT
ATTGGTTCAG TATTACTATA TCAAACAAAT GAAATATTTG GCATTTCTGG AATGATTGCT
TCTGGTCTAA TAATGATCGG TTCTGCAATT TTTAGAAAAC ACTAG
 
Protein sequence
MSAGRTIYVL IKLLPSILAL RKDRKKWTQQ EKNTIDSEQF RKNARKALDT FVSLGPVYIK 
LGQWLSSRAD ILPQPYLEEL SKLQDSVPSA PFEQVKPIIE KDLGPINEKF DDIDTNSISG
ASLGQVYRGS ISGQQIVIKV KRPGIEQVVK EDLKVLKKIL PLALRFVDPN LRYSAKAMLS
QFIETIREEM DYTNESENLK KIKQDMENND KVIVPSVYDD YSSKNVLTME YLPGIKVTNV
QALDEKGVDR EQLVIDVHKV FFTMLLKHSL FHADPHPGNI SVTDDGKLIL YDYGMVGRIN
NETRYKLIRL YLALVEKNPP RVVNAMAELG MLTPGYNRQV IEKGIELSIR AMHGNKPDEM
EVQSLMELAN QTMSKFPFVL PKNLALYMRM ASIIEGIYKT HDVDFKFVKV LRNILQEENM
IPKAYVEELK ISFDKFAKSI DSALRLGPEM EKFMDEAKFS MQKRKPMVFL GGSIFASAIF
IGSVLLYQTN EIFGISGMIA SGLIMIGSAI FRKH