Gene Nmar_0584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0584 
Symbol 
ID5773013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp520425 
End bp522080 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content32% 
IMG OID641316218 
Producthypothetical protein 
Protein accessionYP_001581918 
Protein GI161528092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTAA TGGGCGTATT TTTCACGATT TTTCTAAATT CCGCACATGC TCAAACTGTG 
GGGGATCAGA CAACTTTGTC AGGAGATCTA CAAAACAATC CTATTGCCCA AGACATTCTT
AAAAAAATTG AACAAAGTAA AAAATGGATT GCAAAAATTG AACAAAGAAA TTTTGAAGAT
TCTCAACGAC AAGCAGAATT AGAACAAAAG CGTGCTGAAA TTTTACAAAG TTTGGAAGAC
GATTTAAGAA AATGGGAAGA ACTTTGGGGT TACTATACAT TTGATAATAT ACTTGAAAGA
GCATTAGAAA ATAGTCCTGC AAAGGACACT TCTAGCATTT ATGATCATCC TCTAAAATTT
ACTGCTTCTA AAATTAATGC TGGAAAAGAA GCTTTGCAAA AAGTAATCTT AGAAGGGGGA
AATTCTGAAC AAGCAAGAGA CGCATTTGTC AAAGCTGCAA AAATAACAAG AGCTGAAATG
GTGTCTGTTA ATGCATTTTA TAATATTTTG AACAATAATG CTTACTACAA TCAACAAGTA
CTCTTTGAAT CTGATGGTAG ATTCAACTAT GATTTGTCTG GGGAAGAATT GAGAAAATAC
TATCAAGATT TTAGAACAAA TCCTGCATAT TTTGAAGCAA ATCCTCTTGA TGAAGTTTCT
TGGTCTGATC TTGGCAAAAC TAATTTTGAT ACTGAATGTA GAACAGGACA TGTTTTAGTT
TACAGAACTC ATGCAGATGA TTATGTTTGC ACTACTGAAT ATACTGCTGA AATGTGGGTA
CGACATGATA TGGGAAAACT TGCTAATGGA ATTAATGAAG AACGACATAA TCTGCTAAAT
GAACAGAAAT TCAACAAAGA CAGAATTTTA CAAAAGGCAG ATAGTTTAAA CTCTAAAATC
AAAACCATAC AAACACACTA TGAAGCAGAA ATCTCAGAAA TACTTGCAAA ATATGACTCC
CTTATGACTG ATATCGAACT AGACAAACGT GCTGAAGAAA AACAAATTCT TGAAAATTCT
GACTCTGATT CAAAAAAGAC AATCAGTCAA CAAATTGCCA ATATTCGAGA AAAATTTGAT
GAACTTGAAA AAAACACTCT TGATGAAAAA GACGACGTTT TGAAGATTCT AGCAAATCAA
CATATTACCT CAATAGAAGA ATTTGCATCT CTCTATGAAC TTGATGATGA AATCAAAATT
GAATGGAATG CTGATTCTCT GACTTTTTAC CCATCTGCAT ATTATTTTCC ACAACAATCT
GAATCTAGTT TAATCGTAAA GACTAGTTCT GAAAATACTA TTTCTGATTT TCTTGTTGAT
GACACAAGTT TCAAAAATGC ATTTGGTGAA AAAATACATT CACTAAAACC AGGTCAATTA
GTTCAGATTG CTTCTGATGT TACAAATAAT GATAATTTTT CAAAAAAATT TGTTTATCTA
GTTGAAATTA AAGATGAACA AAATCAAATA GTTCAACCTC TAAAGTGGAT AACAGGCCAA
CTTGATTCAG ATCAAGTCCT TAATTTGGGA TTGTCTTGGA TTCCACAAAC TCCTGGTAAT
TTTTATGCAG ATGTTTTTGT TGGAACTAGC TTAGACTTTG TGTCTCACAC AGAAACCATT
TCTATTTCTG TAACTCCACA AGATCATTTG TCCTAA
 
Protein sequence
MLVMGVFFTI FLNSAHAQTV GDQTTLSGDL QNNPIAQDIL KKIEQSKKWI AKIEQRNFED 
SQRQAELEQK RAEILQSLED DLRKWEELWG YYTFDNILER ALENSPAKDT SSIYDHPLKF
TASKINAGKE ALQKVILEGG NSEQARDAFV KAAKITRAEM VSVNAFYNIL NNNAYYNQQV
LFESDGRFNY DLSGEELRKY YQDFRTNPAY FEANPLDEVS WSDLGKTNFD TECRTGHVLV
YRTHADDYVC TTEYTAEMWV RHDMGKLANG INEERHNLLN EQKFNKDRIL QKADSLNSKI
KTIQTHYEAE ISEILAKYDS LMTDIELDKR AEEKQILENS DSDSKKTISQ QIANIREKFD
ELEKNTLDEK DDVLKILANQ HITSIEEFAS LYELDDEIKI EWNADSLTFY PSAYYFPQQS
ESSLIVKTSS ENTISDFLVD DTSFKNAFGE KIHSLKPGQL VQIASDVTNN DNFSKKFVYL
VEIKDEQNQI VQPLKWITGQ LDSDQVLNLG LSWIPQTPGN FYADVFVGTS LDFVSHTETI
SISVTPQDHL S