Gene Nmar_0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0339 
Symbol 
ID5774421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp302289 
End bp303509 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content31% 
IMG OID641315967 
Producthypothetical protein 
Protein accessionYP_001581673 
Protein GI161527847 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTGG TAATTGATCC TCAAATTGCA GGAATATCTG GGGATATGCT TCTTTCTTCA 
TTAATTGATT TGGGCGCAGA TAAGGGAAAG ATAATTGATG GAATTAAAAA ATCTGAACAA
TTTTTTTCAG ATTCTACTAT TACAAAAATC GATTTTCAAA AAACCAAAAA AAGAGGAATC
GAAGCTATTC AACTCGTTTT AGAAATAGAT GAACATTCTC ATGAAAAAAA AGGCTCTGAA
ATAAAAAAAG CAATTAATGA CTCTACATCA AATTTAGATC TATCAGATAA AGCAAAGATA
TTTGCTGAAT CATGTATCAA TTCACTCATT TCTTCAGAAT CTAAAATTCA TGGTGTTCCA
GAGGATTCTG TGCATTTTCA TGAGGCCTCT AGCATTGATA CCCTAGTTGA CATTGTCGGA
ATTACAATTG CCTTAGATGA TTTGGGATTA TTTGATGAAA AAATTATTTG CATGCCTGTT
TCTGTAGGTG GTGGAAGCGT AACTTTTTCC CATGGCACTA TGTCTAATCC TGCCAGTGCA
ATTTTAGAGA TTTTCAAAGA TTCTTATCTG AAAATTAAAG GTAATGATGC AAATGCGGAA
TTGACCACTC CAACGGGGGC GTGTATTTTG GCTAATCTGA CAAATACTTG TATGGATTAT
TATCCTGCAA TGAAAATTGA TTCAATTGGT TATGGTGCAG GGCAAAAAGA TTTTCAAAAT
TTTTCAAACG TGCTAAAACT AGTTAGAGGC TCTACAAATA ACTTGGAAAG TGACTCAGTA
AAAATTCTTG AAACTAACGT TGATGATATT TCAGGAGAAA TACTTGGAAA TCTAATTGAA
AAGATCATGC AAAAAGGTGC TAGAGATGTT TCAATTTATC ATGGAATTAC AAAAAAAGGA
AGACCTACAA ATTTGGTATC TGTAATATGT GATGATCAAA ATATTGATGA AATTGTTGAT
ACATTGGTAT TAGAAACTGG TACTTTGGGA ATTAGGATAT CTGAATCAAA TAGATTTGTT
GTACCAAGAA CAAATGAAAA CATTTCATTA ACAATTGATG GAAATTCCTT TGATGTGAGA
TACAAAAAAT CAACATTTAA GGGAAAAACT GATTTCAAAA TAGAATTTGA TGATCTTAAG
GATATTTCAA ACACCGTTGA AAAATCAATT AAAGAAATAG AATCATTACT TCGAAAAGAA
ATTGAAAAGT TGGAGAACTA A
 
Protein sequence
MVLVIDPQIA GISGDMLLSS LIDLGADKGK IIDGIKKSEQ FFSDSTITKI DFQKTKKRGI 
EAIQLVLEID EHSHEKKGSE IKKAINDSTS NLDLSDKAKI FAESCINSLI SSESKIHGVP
EDSVHFHEAS SIDTLVDIVG ITIALDDLGL FDEKIICMPV SVGGGSVTFS HGTMSNPASA
ILEIFKDSYL KIKGNDANAE LTTPTGACIL ANLTNTCMDY YPAMKIDSIG YGAGQKDFQN
FSNVLKLVRG STNNLESDSV KILETNVDDI SGEILGNLIE KIMQKGARDV SIYHGITKKG
RPTNLVSVIC DDQNIDEIVD TLVLETGTLG IRISESNRFV VPRTNENISL TIDGNSFDVR
YKKSTFKGKT DFKIEFDDLK DISNTVEKSI KEIESLLRKE IEKLEN