Gene Nmar_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1473 
Symbol 
ID5773289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1340205 
End bp1341287 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content31% 
IMG OID641317121 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001582807 
Protein GI161528981 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAG AAGAAATTGG CGTTATCGAT CTATTTGCAG GTTCAGGTGG ATTAAGTTTA 
GGATTTAAGA ATGCAGGGTT CAAAGTCATT GCAGCAGTGG AATTTGATAA GAGTGCTGCT
GAAACATATT CAAAAAATTT CAAGGAGACT AAATTAATTG TAGATGATAT AAAAAACATA
AAATCAAACG AACTAAAAAA AATCACATCA AAAGAGAGAT TTTGTGTAAT AGGAGGACCT
CCTTGTCAAC CTTATTCAAA TGCAAATAAA CAAAATAATG GCAAAAATCA TCCCTTTGCA
AATGCTATAA ATCATTATTT CAGAATAATA TCTGAATTAA AACCTCAGGC ATTTTTATTT
GAAAATGTTA CCAATTTTAG AAATCTCCCA GGTTGGAAAA AATTCCTCAA TGATTTTAAA
AAATTAGGAT ACATTCTGAG CGTATCTGTG ATTGATTGTG AAAAAGCAGG ACTTCCACAA
AAACGAAAAC GACTCTTTGT AACAGGTTTT TTGAATGGAC ATGAATGTAA TTTGAATTTG
ATCAAAATTC CTGTTTCATG CGATAATCTT TTTGATGTCA TTTCAGGATT ACCTTCTATA
AAACAAGGTA CATCTGGAAA CGAAATTATG AAACATCCTA AAAAATTTAA TTCCCCATAT
GGAAAAAAAT TAGGAGGAAA CATCAATAAC TTGTACAATC ATTGGTGTAC GAAACACGGA
GAAGATGTAA TCAATACAAT TTCTGAAATT AACGAAGGAA GAAGTTTACT TGATTCATGG
AAAACACTAT CTAAAAAAAC AAAATCTAGA TTCAAAAATA AAAATTCTCT TCATGGTAAT
ATCTATAGAC GTTTATCTTA TCAACATACA ACTCCTACAA TTGTTCATGC AAGACGAGCT
ATGTTGTTAC ATCCTAAAGA AAATAGAATA ATATCTGTAA GAGAGGCAGC TAGAATACAG
AGTTTTCCAG ACTCATTCAG ATTTTTTGGC ACAAATAATT CACAATATCA GCAAATTGCA
GATGCAGTTC CTCCTTTGGT GGCAGAATCA TTAGCACATG AAATAAGAAA AAATCTCAAA
TAA
 
Protein sequence
MRKEEIGVID LFAGSGGLSL GFKNAGFKVI AAVEFDKSAA ETYSKNFKET KLIVDDIKNI 
KSNELKKITS KERFCVIGGP PCQPYSNANK QNNGKNHPFA NAINHYFRII SELKPQAFLF
ENVTNFRNLP GWKKFLNDFK KLGYILSVSV IDCEKAGLPQ KRKRLFVTGF LNGHECNLNL
IKIPVSCDNL FDVISGLPSI KQGTSGNEIM KHPKKFNSPY GKKLGGNINN LYNHWCTKHG
EDVINTISEI NEGRSLLDSW KTLSKKTKSR FKNKNSLHGN IYRRLSYQHT TPTIVHARRA
MLLHPKENRI ISVREAARIQ SFPDSFRFFG TNNSQYQQIA DAVPPLVAES LAHEIRKNLK