Gene Nmar_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0657 
Symbol 
ID5773694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp596491 
End bp597801 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content34% 
IMG OID641316293 
Producthypothetical protein 
Protein accessionYP_001581991 
Protein GI161528165 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.568956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAA ATCCAGAACT AACAGAGAGA AAACCTCTAC TTGAGCAATT TAGAGAAACT 
AGAAGTAGAA CTTTAGAGTT AGTAAAAACT CTAGAAAAGG ATGATTTTGT GGTACAAACT
GCATTTTTCA TGAGTCCACC AAAATGGCAC GTCGGACACG TTAGTTGGAT TTATGAAGCA
ATTATGAGTA AACTAGACAA AAATTATGAA TTTTATTCAA AAGAATTTTC AGAATATCTT
AATTCCTATT ATCAACAATT TGGCGTTCCA CATGACAAAG GATTGCGAGG TGTAATTTCT
AGACCAACTG TTGATCAGAT TTTTCAATAT TTCAATACAA TCAATCAAAG AGTAGAGAAA
TTTATTGAAA CTCAGTCTTT AGACAAAAAT GCAGTTCGAT TAATCACAAT GGGATTTCAT
CATGAATGTC AACATCAAGA ATTGTTAGTG TATGATTTAC AACATCTTCT TGCTGAACAA
TATAGACCAG TAAAGAAAAA TCAAATCCAA AAACAAAACA GTGTTGAACA AAGTTCTGTT
CAGATAAACG GGGGATTATA CACTATGGGA TTCAATGGAA AAAAATTCTG CTACGATATT
GAACTTCCAG AACATAAAGT ATACCTGAAT AATTACAAAA TTGATGTGTT TCCAATTACA
AACAAGCAAT ATCTAGAATT TATCGAAGAT GGAGGATATG AGACATACAA GTATTGGTTA
TCAGATGGTT GGGAGAAAGT AAAAGATAAC AAATGGAATG CACCAATGTA TTGGGAAAAA
GTTAATGGTA AATGGAATGT TAGAGACTTT TTAGGAATCC GAGAAATTAA CCCAAATGAA
CCAGTATGCC ATGTCAGTTT CTATGAGGCT GATGCATATT GTAAATGGGC AGGAAAAAGA
TTACCAACAG AAGCAGAATG GGAAAAAGCA GCATGTTGGA ATGAATCAAA ACAAGAAAAA
ACCATTTTCC CATGGGGAAA TGAACAACCA ACAGAGAACA AAGCCAATCT TCTTGAATCA
TATCACTGGG GATGCACAGA GATTGGAACA TATCCAGAAG GAATCAGTTC ATCAGGATGT
CAGCAAATGA TCGGAGACAT ATGGGAATGG ACATCATCAG AATTTACAGG ATATCCAGGT
TTCAAAACAG GATTTGATGA ATATAATGAC AAATGGTTTA CAAATCAAAA AGTTTTGAGA
GGAGGTTCGT TTGGCACACC AAAAATGTCA ATACGTGGAA GTTATAGGAA TTTCTTTAGA
TTAGATGAAA GATGGTTGTT TTCAGGTTTT AGATGTGCTG AAGATATTTA G
 
Protein sequence
MTTNPELTER KPLLEQFRET RSRTLELVKT LEKDDFVVQT AFFMSPPKWH VGHVSWIYEA 
IMSKLDKNYE FYSKEFSEYL NSYYQQFGVP HDKGLRGVIS RPTVDQIFQY FNTINQRVEK
FIETQSLDKN AVRLITMGFH HECQHQELLV YDLQHLLAEQ YRPVKKNQIQ KQNSVEQSSV
QINGGLYTMG FNGKKFCYDI ELPEHKVYLN NYKIDVFPIT NKQYLEFIED GGYETYKYWL
SDGWEKVKDN KWNAPMYWEK VNGKWNVRDF LGIREINPNE PVCHVSFYEA DAYCKWAGKR
LPTEAEWEKA ACWNESKQEK TIFPWGNEQP TENKANLLES YHWGCTEIGT YPEGISSSGC
QQMIGDIWEW TSSEFTGYPG FKTGFDEYND KWFTNQKVLR GGSFGTPKMS IRGSYRNFFR
LDERWLFSGF RCAEDI