Gene Nmul_A2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2179 
Symbol 
ID3785989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2473854 
End bp2475875 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content50% 
IMG OID637812266 
Productsite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_412863 
Protein GI82703297 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.199223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCTG AACAACGACA ACGCATCATC GAAATCCTAC TTTCCGGTGG AGAAGTAGCG 
CCGGAATGGT CGCGCATCCT GTTCCCGCCT GAGAAGCGTG AATACGAGCT GGTCTATCAA
GGCAAGGAGC GCGAGGAAGA CATCATCGCC AACACGCTGG CAGTACCCTT ACAGCCAGTA
CGAACCTTCA ATAAAAACGG GGTGACGTGG CACAACAAAT TGATCTTTGG GGACAACCTG
CAAGCCATGA AGACGTTGCT GGAAATGAAG CGGCGTGGGG AACTATGCAA CGCCGACGGA
ACGTCTGGTA TCCGCTTGGT ATATATCGAC CCGCCATTTG CAACTCGACA AGAATTTCAG
GGCGCGCAAG ACCAAAAAGC GTATCAGGAT AAGATTTATG GCGCTACATT TATTGAGTTT
TTGCGAAAAA GGCTAATCCT CATTCGTGAC CTTCTTTCTG ATAATGGCCT TCTATACGTT
CACCTGGATT ACCGTAAGTC TCATTACATA AAAGTTATCC TTGATGAGAT TTTTGGCGAG
CAAAACTTTA TGAACGAAGT TGCTTGGTGC TACGGAGAAC GTGAGTTAGC AACACGTCAT
TGGAACAGGA AGCATGACAA TATCTTGGTG TATGCAAAAA ACTTCAAATC AGATCAACAT
GTATTTAACT GGAAAGAGGC GGCTGGTCAG TATTCACAAG GTACTTTGGC AAAATATGAG
CACATCGATG AAGATGGAAG GAAATTTCAA TTAAGGGGAA GAAACGTCAA GGGAAGTCCA
TGGCGCGGTA AGCATGGTAT TCCTTTGGAT GTAGAGGCAG CAAATCCGGA ATGGGTCTAT
AGGGATTACT TCGACACAAA AGAGGGTATT AGACCTCGTG ATTGGTGGAG CGACATTCCA
TTTCTTAACA GAGCATCAAG TGACAGATAC GACTATCCAA GCGCGAAAAA TCCTGCACTT
TTAAATAGAA TCATCAAGGT CTCCTCCAAT ATTGGAGATC TTGTTATGGA TGCATTTGCC
GGATCAGGAA CCACCTGTGC GGTTGCAGAA AAACTTAATC GTCGCTGGAT CGGCATTGAC
TGCGGGAAAC TTGCGATCTA CACCATCCAG AAGCGGATGC TGAATCTCAG CGAAAAGGGC
AAGGCGCTCA AAGCCAAGCC CTTCACCCTT TACAACGCCG GCCTCTACGA CTTCGCCCGT
CTTAAGGAAT TGCCTTGGAG TGACTGGCGG CGGTTTGCCC TCACCCTCTT TAGTTGCCAG
AGTGCCCCCC AACGAATCGG TGGCATCCAA TTTGATGGCA CACTGAAAGC CAGCCCCGTC
ATTATTTTCG ATCACCGCAA GGGGAATGGC GCGACAGTCA GCGAGGAGAC GTTACGCTCT
ATCCATGAAG CGGCGGGCTC CAAGGTTGGC ACACGTATAT TCATTATCGC GCCGGCAATG
GCCTTCGACT TCCAACAGGA TTACATCCAG ATCGGGGAGG TACGCTATTA CGCTCTGCGG
GTGCCGTATT CCATCATTCA CGAGCTGCAT CAACGCGACT TTCTTGCCTT AAAACAGCCG
ACTGACGAGA TGGCGGTGAA CGACACGGTA GATGCCGTAG GCTTCGATTT CATTAAGACC
CCTGAGTTGG AGTACGCCGT GGGTCGAGGC AACCCGGAGG GTGAATTGCT TGAGCAAGCA
TTCATTCGCA TCGACACTTT TTTCAGCGAG GCCGCCGTGC GGGAGCCGAT GCGAAAACGT
GGCAACCGGG AAACATTGTC CATGGTAATG TTGGATTATG ACTACGATGC CGAGAATGAT
GTGTTCGATC TGGACGAGGT GTTTTATGCC GAAGCTATCG AGGGGGCGGC TTGGGAGGTG
CGTTTTCCTG CCAACCGCCC CGGCGAGAAG GTCATGGCGG TTTTCCTAGA CATCTATGGC
AATGAGGCGC GTGTGGTGAT ACCTGCGGCT CATTTTGCGC TAGAGAAACT TGAAAAGCGC
AACGGCTCTC CCATTGCAAA GACAGAGGCA AAAAGAGCAT GA
 
Protein sequence
MTPEQRQRII EILLSGGEVA PEWSRILFPP EKREYELVYQ GKEREEDIIA NTLAVPLQPV 
RTFNKNGVTW HNKLIFGDNL QAMKTLLEMK RRGELCNADG TSGIRLVYID PPFATRQEFQ
GAQDQKAYQD KIYGATFIEF LRKRLILIRD LLSDNGLLYV HLDYRKSHYI KVILDEIFGE
QNFMNEVAWC YGERELATRH WNRKHDNILV YAKNFKSDQH VFNWKEAAGQ YSQGTLAKYE
HIDEDGRKFQ LRGRNVKGSP WRGKHGIPLD VEAANPEWVY RDYFDTKEGI RPRDWWSDIP
FLNRASSDRY DYPSAKNPAL LNRIIKVSSN IGDLVMDAFA GSGTTCAVAE KLNRRWIGID
CGKLAIYTIQ KRMLNLSEKG KALKAKPFTL YNAGLYDFAR LKELPWSDWR RFALTLFSCQ
SAPQRIGGIQ FDGTLKASPV IIFDHRKGNG ATVSEETLRS IHEAAGSKVG TRIFIIAPAM
AFDFQQDYIQ IGEVRYYALR VPYSIIHELH QRDFLALKQP TDEMAVNDTV DAVGFDFIKT
PELEYAVGRG NPEGELLEQA FIRIDTFFSE AAVREPMRKR GNRETLSMVM LDYDYDAEND
VFDLDEVFYA EAIEGAAWEV RFPANRPGEK VMAVFLDIYG NEARVVIPAA HFALEKLEKR
NGSPIAKTEA KRA