Gene Nmar_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0089 
Symbol 
ID5773003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp77330 
End bp78730 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content33% 
IMG OID641315707 
Productnucleic acid binding OB-fold tRNA/helicase-type 
Protein accessionYP_001581427 
Protein GI161527601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.235418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGGAAT TTGATAGTCT AGTTGAAAAG TTAATTGAAC AAAAACCAGA GTTAACTAAA 
GAAATTATTG AAGAGCAAAT TAAATTAAAA AAAGAGAAAA TTGGTGCAGG GTATCTAACT
GATCAAGGAG CTTTATTCTT GATTGCATCA GATTATGGGG TTACATTATC AGGGCCACTA
AAAGTAGAAA TGAGTTTGAA AGATCTCTAT GCAGGAGCAA AAGAAATTTC ATTAGAAACT
AGAGTATTGA ATTTATCACC TGCAAAACAA TTTTCAAGAA AAGATGGTTC TCCATTTTAT
CTCAGAACTA TGACAGTATA TGATGATGCA AATTCTACAG CAAGTGTAAA GTTATGGGAT
GATAAAGCAA ATCTTCCTGG AATTGAAAAT CTGAAACCAG GAGACTTGAT TAAAATTATC
AAAGCTTATG TTAAATCTGA TCTTGATGGT TCACCAACAA TTAACATTGG TTCAGGTTCA
AATGTAGAAG CTACTGATTC TACAAGTGAA ATTCCAACAA TAGACACAAT AACAAAAGAT
GTGAGTGAGT TGCAAGAAGG TCAAAAAGAT CTTGTAGTTT TAGGAGAAAT TGATGGAGTA
ATTAGCGGTA TGGAATTTAC AAACTCTAGA GGTATGCCTG GAAAAGCCTT GAGAATGAGG
CTAAAAGGAA AAGACGGCAG TGGAATGAGA GTAGTGTTAT GGGGAAAAGA TGAATCATCA
ATTCCAAATA TGATTTCACA GTCAGCTAAA GTGAGACTAC TTGGTGTCAA AGTAAAATCT
GGAAATCAAG GATTAGAAAT TCATGGAAAT GATGCAACAA TAATTGAGAT TGAAGGAGGT
AAAGAAGCAG AACCAGTAAT TGCAAGAATT CTTTCAATGT CACCAACAGA AAATGGAAGA
AACATGATTT TAGCGGTAGA TAATCAAAGG AATCTTTACA ACATTAATGA TTCATCAAAT
TCAACCAGTA TTTGTGTAGA AGGAGACGTC ATAGAATGCA TGCCATCCAA AGTTTATGGA
AATTCAATTA CACTTGATGA AAATTCTTTT GTAAGAAAGT TGGATAGTGA TGAAAGTATT
CCTTCATTAT CTCAAATTAG AACAAAGATT AATGATGTTA AAGTTGATGG TAATTACTGT
ATTGAAGCAA TAATTTTGAA AGTGCCAGAA AGACGCGAAG TACAAACAAA AACTGGAGAA
TCAATTGCAC TTTCAGAAAT GTTTGTCGAA GATGACACTG GACAAATTTG GGTAAAAGGA
TGGAGAAATC AAGCTAGACT AATTGACAAA TGTGAATTAG GGGAGATTGT TTCAATAACA
GGTCTTAATG CAAAAGCTGG ACTAGAAGGC AGAATAGAGA TGTTCTTAAC AGCATTTTCT
AAAATTACAA AAAAGAATTA G
 
Protein sequence
MSEFDSLVEK LIEQKPELTK EIIEEQIKLK KEKIGAGYLT DQGALFLIAS DYGVTLSGPL 
KVEMSLKDLY AGAKEISLET RVLNLSPAKQ FSRKDGSPFY LRTMTVYDDA NSTASVKLWD
DKANLPGIEN LKPGDLIKII KAYVKSDLDG SPTINIGSGS NVEATDSTSE IPTIDTITKD
VSELQEGQKD LVVLGEIDGV ISGMEFTNSR GMPGKALRMR LKGKDGSGMR VVLWGKDESS
IPNMISQSAK VRLLGVKVKS GNQGLEIHGN DATIIEIEGG KEAEPVIARI LSMSPTENGR
NMILAVDNQR NLYNINDSSN STSICVEGDV IECMPSKVYG NSITLDENSF VRKLDSDESI
PSLSQIRTKI NDVKVDGNYC IEAIILKVPE RREVQTKTGE SIALSEMFVE DDTGQIWVKG
WRNQARLIDK CELGEIVSIT GLNAKAGLEG RIEMFLTAFS KITKKN