Gene Nmar_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1647 
Symbol 
ID5774743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1504301 
End bp1505896 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content33% 
IMG OID641317301 
Producthypothetical protein 
Protein accessionYP_001582981 
Protein GI161529155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA AATTCGTAAA AAAGATCCAA ACAAGTGGAC ATAATTTGAA AAATCTATTC 
CTGTTACTTT TACTGCCATT GTTGTTTGTA TTTACTTTTG ATAGTGCCTT TGGTCATGGT
GTAGGAAGTG AGACATTTCC TCCTGTAAAC CTTGATGGAA AACAAGTAAC TGTAGAGGTT
TCTTCATCAC AAAGTGATCC TGAAGCAGAT GACGATCAAC AAATCTCTAT TTCTCTGATT
GACTTTGATT CAAAAATTAC ACTACGTGAT GTTACATTTC ACATAAAATC TGAGAAAGGA
AACCAATTTC TCTTTGAGCA AGAATTCAAA ACAGATAATG GTTTTCTAGT ATTCAATTTT
GTTTCAGAAG AAACTGATTC TATTGTAATA GAAGAAGAAA CTGGTGCTAA TTTCTTTGGT
TCCATATTGG GATTAGATAG TAGACTAATT CATGTCAAAG GCCCAAAACT CAGTGAGGGT
GGACTGTACA AATTTGATAT CAGTATACTT ACTGCTGATG GATATTCAAA AACACTTGAA
AAACCACTAG TGTTTAATGC AGGAATTTCT ATTGCCCAAA CCTCTAGCCA TGATTTTGTT
GATCCAAACT TTGGAGAACA AAGCATTGAC GTAATTACTT ACTATGATGA AATATCTGAC
TTTGAATATG ATACTGATTC AAAAGAGATT AGATTTTCAA TGCCTTTTGA ATGGAGTCAT
ACAAACATCA ATCAAACATC AGTTGTCCAT GAAGAGTTAG ATATTCCAAA AACCTATGGT
GATTTGCTTG TATCTGGATT CACCATGTAC ATTAATGGAG TAGAACTCTC TGATGACATA
TCTACAATAG ATGACTTTTT TTCTGACGGT CGTGTGGTGC ATTTTATTAT TTATCAACAA
GAATTACTCC GAGTTCTTGA AAATGGCTCA AATGAAAATG GCATGAATTT CCTAATTACT
CCTGATAGAG ATTATCCGCA TATGAGCTCA GTTACTGAAA ATGGACAATT TCGAATTTTT
GCATCTTGGG AACCTGAAAA CTTGCAATCT GGTTCTGATG CAAAAATATT ATTTGATGTA
ACTGATGTTT TCTTGAAAAA CAAACCTATA GCAACAAATT ATGATTTCTC TATTACACAA
AATAACAAAG TCATTTACCA ACAAAGTGGA ACAAGTACTG ATTCAAGAGA AGAACATAAT
GTAGTAGAGT TTACAATTCC ACAAGATGTT ACAGGAATTG TTAATCTAAA TTTTAATAAT
TTAGATAATA ATGATCTTGC AAGAACAACT ATTCCAATTG TAATTGATAG AGTTACATCT
CAAAAAGAAA TTACAATTCC TGATTGGATT AGAAACAATG CATTGTGGTG GTCTGAAGAA
CAAATTGATG ATAATACATT TGTTCAAGGA ATTGAATATC TCATCAAAAA CAAAATAATT
GTAATTCCAT CAACACAACA ACAAGATTCT TCATCCCAAG AAATTCCATC ATGGATTAGA
AACAATGCTG CATGGTGGGC TGCAAAACAA ATAGACGATC AGACATTTGT CCAAGGACTG
GAATATTTGA TTCAAAAGGG AATCATTCGT GTCTGA
 
Protein sequence
MEKKFVKKIQ TSGHNLKNLF LLLLLPLLFV FTFDSAFGHG VGSETFPPVN LDGKQVTVEV 
SSSQSDPEAD DDQQISISLI DFDSKITLRD VTFHIKSEKG NQFLFEQEFK TDNGFLVFNF
VSEETDSIVI EEETGANFFG SILGLDSRLI HVKGPKLSEG GLYKFDISIL TADGYSKTLE
KPLVFNAGIS IAQTSSHDFV DPNFGEQSID VITYYDEISD FEYDTDSKEI RFSMPFEWSH
TNINQTSVVH EELDIPKTYG DLLVSGFTMY INGVELSDDI STIDDFFSDG RVVHFIIYQQ
ELLRVLENGS NENGMNFLIT PDRDYPHMSS VTENGQFRIF ASWEPENLQS GSDAKILFDV
TDVFLKNKPI ATNYDFSITQ NNKVIYQQSG TSTDSREEHN VVEFTIPQDV TGIVNLNFNN
LDNNDLARTT IPIVIDRVTS QKEITIPDWI RNNALWWSEE QIDDNTFVQG IEYLIKNKII
VIPSTQQQDS SSQEIPSWIR NNAAWWAAKQ IDDQTFVQGL EYLIQKGIIR V