Gene Nmar_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1613 
Symbol 
ID5773987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1470803 
End bp1471993 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content33% 
IMG OID641317266 
Productamidohydrolase 
Protein accessionYP_001582947 
Protein GI161529121 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATCA AGAACATTAG TCTCCTTTTA GGAAAAGAAT TAGAATTTGT TTCAAAAACA 
AATGTACAAA TTCAAGATGG TAGATTCAAA CGAATTCAAC CTAACATTAA ACCAAGTGGT
AAAGAGGATT CTATTGACTG TGAAGATCTT TTATTGATCC CAGGATTTAT CAATGCACAT
ACACACATAG GTGATTCAAT TGGAAAAGAT GTCACTCTAG AAAGTTCTGT AGATAAAAAA
ATACATCCTG TTTTTGGGGC AAAGTCAAAA ATTCTAAAAA ACACTCCTCC TGAAAATTTG
TCTAATTTTA TGAAAAATAC ATGTCATTCT ATGATTAGAA AAGGAATAAC CACCTTTGTT
GATTTTAGAG AAGGGGGTTT AGATGGTGTT ATCTTGTTGA AAAAAACATT ATCTGAAATT
CCAATCCGAT CAATTATTTT GGGTAGGGTT AATTTCTACC AAAATTCAAC TGAAATCAAA
AAAAATCTCC CCATTCCTAA AGAAAAAGCC AAGGAATTGC CTCTAATTCT TCAAAAATGT
GATGGTATTG GAGTTAGTGG TGCAAATGAG AACAGTACTT CAACATTGAA TCATTACTCA
AAGACATCAA AGATTCGAGC AATTCATTCT GCTGAAACAA AACAGAGCGT TTCAAGATCT
AAAAAGATGA CTAGAAAATC TGAAGTGATT CGTGCATTGT CCATGAAACC TCATTTTCTT
ATTCATATGA CTCATGCATC AAACAGTGAT CTTCATCTAG CTGCAAAAAA AACTCGAGGA
ATAGTAGTTT GCCCAAGAGC AAATTCTTCT TTGGCTGAAG GAATTCCTGA CATTACTTTG
ATGCAAAAGG CCGGTTGTAC GCTTGGATTA GGCACAGATA ATGTTATGAT AAACTCTCCT
GACATGTTCA GAGAAATGGA TTATCTTTGG AAAGTCACAA TGGGCATTCA TAAAAAAAGA
ATCAATCCTA AAGAAATTTT GAAAATGGCT ACCGTAAATG GAGGAAAAAT ACTAAAAAAA
GACATTGGAG TAATTGAAAC CAAAAAAATT GCTGATTGCA TATTTCTAAA CAAACATGCA
TTAGATTTAG AACCAATGCA TGAACCATAT GCATCTATTG TACATAGAGC ATCTGAATCT
GCAATCCAAG CAGTAATGAT TGGAGGTAAA ATAGTTCATG GAAAAATCTA G
 
Protein sequence
MLIKNISLLL GKELEFVSKT NVQIQDGRFK RIQPNIKPSG KEDSIDCEDL LLIPGFINAH 
THIGDSIGKD VTLESSVDKK IHPVFGAKSK ILKNTPPENL SNFMKNTCHS MIRKGITTFV
DFREGGLDGV ILLKKTLSEI PIRSIILGRV NFYQNSTEIK KNLPIPKEKA KELPLILQKC
DGIGVSGANE NSTSTLNHYS KTSKIRAIHS AETKQSVSRS KKMTRKSEVI RALSMKPHFL
IHMTHASNSD LHLAAKKTRG IVVCPRANSS LAEGIPDITL MQKAGCTLGL GTDNVMINSP
DMFREMDYLW KVTMGIHKKR INPKEILKMA TVNGGKILKK DIGVIETKKI ADCIFLNKHA
LDLEPMHEPY ASIVHRASES AIQAVMIGGK IVHGKI