Gene Nmar_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1453 
Symbol 
ID5774094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1321778 
End bp1323226 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content39% 
IMG OID641317101 
Producthypothetical protein 
Protein accessionYP_001582787 
Protein GI161528961 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACA TTACTCCAAA GAAAATTGGT GAGAATCAAT ACCAAATTGA TGCTGATTCT 
AATTTAGGAA TGAAAGTTCC AGTAAAGATT TACGCAAATC AAGGATTACT TGACAAAATG
CTTACAGATA GAACTATCAT GCAAGCAAGA AATGTCTCAT CTATTCCTGG AATCGTAGGA
CACAGTGTAG TTTTACCTGA TGGACATGAA GGGTATGGTT TTCCAGTAGG TGGAGTTGCT
GCTATGGATG CTGAAGAAGG AATGATCAGT CCTGGTGGTG TCGGTTATGA CATTAACTGT
GGAGTGAGAT TGCTCCGCTC TAATCTAACT GAAGAAGTAG TTCGCTCAAA ACTAAAGGAC
TTGGTAACTG ATTTGTTTAG TTCAATTCCT TCAGGAGTTG GCTCTAAAGG TGCAGTAAAA
CTTAGTCACT CAGAACTAGA CGAGGTTCTA GTTAATGGTG TAAACTGGGC AATTGATCAT
GGTTATGGTT CTACAAATGA TTCAGATGTT TGTGAAGAGA ATGGTCAGAT AAAAAATGCA
GACCCTAACA AAGTTTCAGA TAAAGCAAGA AAGAGAGGAG CTCCACAACT TGGAAGTTTA
GGCTCTGGAA ATCACTTTTT AGAAATTCAA AAGGTTGCAG AAGTTCATGA TGAAGAAGCA
GCTGAAAAGA TGGGAATCAA AGAAGGAACA ATTACAGTTC TAGTTCATTG TGGTTCAAGA
GGATTTGGTC ACCAAGTTTG TAGTGATTAT TTGAGAGTAT CAGAACAAGC AATGTCAAAG
TATGACATCA CTCTACCAGA CAGAGAACTT GCATGTGTTC CAAATACTTC TGAAGAAGGA
GAGTCTTACA GAAAAGCAAT GTTTGCAGCT TTAAACTTTG CATGGAGTAA CAGACAGATG
ATCACTCATT GGACAAGAAA ATCTTTTGAA CGCGTATTCA ACCAATCTGA ATCTGATCTT
GACATGAAAC TAGTGTACGA CGTTGCACAC AATATAGCTA AAGTTGAAAA ACACAAAGTA
AACGGAGAAG AAAGAAAACT AGTTGTCCAC AGAAAAGGTG CAACTAGAGC ATTTCCTGCA
AACAGAGATG AGGTTCCAAC AAAATATCGT CATTTGGGGC AACCCGTATT GGTTCCAGGT
TCAATGGGTA CTGCAAGCTG GATACTTTTA GGACAACCAA ATTCTATGGA CTTGAGCTTT
GGTTCTACTG CACATGGTGC AGGAAGAACA ATGTCACGTT CCAAAGCAAG ACGAAATTAC
ACTGAAGATG ATGTTAAAAA ATCCCTAAAT GACAAGGGCA TATTTATCAA GGCATTAACC
CGAGATGGAG TTGTGGAAGA GACACCTCAA GCCTACAAGG ACGTTAATTC TGTAGTTGAT
GTATCTCACA ATCTAGGAAT TGCCACCAAA GTAGCAAAAT TGGTGCCTAT AGGTGTGATT
AAAGGTTGA
 
Protein sequence
MGDITPKKIG ENQYQIDADS NLGMKVPVKI YANQGLLDKM LTDRTIMQAR NVSSIPGIVG 
HSVVLPDGHE GYGFPVGGVA AMDAEEGMIS PGGVGYDINC GVRLLRSNLT EEVVRSKLKD
LVTDLFSSIP SGVGSKGAVK LSHSELDEVL VNGVNWAIDH GYGSTNDSDV CEENGQIKNA
DPNKVSDKAR KRGAPQLGSL GSGNHFLEIQ KVAEVHDEEA AEKMGIKEGT ITVLVHCGSR
GFGHQVCSDY LRVSEQAMSK YDITLPDREL ACVPNTSEEG ESYRKAMFAA LNFAWSNRQM
ITHWTRKSFE RVFNQSESDL DMKLVYDVAH NIAKVEKHKV NGEERKLVVH RKGATRAFPA
NRDEVPTKYR HLGQPVLVPG SMGTASWILL GQPNSMDLSF GSTAHGAGRT MSRSKARRNY
TEDDVKKSLN DKGIFIKALT RDGVVEETPQ AYKDVNSVVD VSHNLGIATK VAKLVPIGVI
KG