Gene Nmul_A2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2226 
Symbol 
ID3784927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2526538 
End bp2527941 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content44% 
IMG OID637812314 
Producthypothetical protein 
Protein accessionYP_412910 
Protein GI82703344 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCTAT CTGGATTAGA AGAAGTGTAT TCTCAAAGGA TGCCAGAAAA GATAGACACT 
TCGGAAATGC TAAATCTGCT CAAAACGGCA TATATTACTA CCGGCGAGCC GGTTGAAGTT
GATTTTCGCT CGCTTGTCGA TTGGGTAAAG ATTGGAGACC AATACTCTCA CCTACTTCAT
CCGTACCCAG CGAAACTTTT ACCACACATA GCCAACTTTT TTTTACGTTG TAGTTCATTG
GTTGGAAAGG AGGCGATGGT TTTGGATCCA TTTTGCGGAT CGGGAACAGT CGCTCTTGAA
GCGTTTCTTG CTGGCCATAC ACCGCTAATA GCAGATGCAA ATCCCTTGGC ACTGTTGATC
GCTAAAGTTA AGACAACTAC ATTTAATGAG ACGGTCTTGG TACAACTTGC TAAGAGCATA
TGTTGTAGGG CAAAGTCTTT TCGAACCGCC CCAACGATAC ATGTTATAAA CGATCATCTC
TGGTATTCGC CGAGAATTAA AATTGCCCTC GAAAAGCTTG TGAGGGCAAT TAGGGAATTG
CCGAGAAATA TCGAGCGGGA ATTTTTTGAG CTTTGCTTAT CAGTGACCGC ACGCCGTCTC
AGTTTTGCTG ATCCACGCAT TTCAGTGCCA GTGCGACTAC GGATAAAACC TAGTCTTGGA
AACGTAGCTT CTAAAATAAT TTCCGAACGC TTGGAATGGC TTCAAGACGT TAACGTGATT
GCAGAGTTCC AAAAAACTGT AGACACAAAT ATCCAACGAA TCCAGCAGAC TAATCGAGCT
GCTAACGGTT GCCGGGCACG AACCAAAATC GTCGGTAGTG ATGCTCGTGA TCTCAGAGAG
TCTGTCTCAG GCACAAAGCT TCAAGACAAT TGCGTGGATT TGGTGATTAC GTCGCCGCCT
TATGGAAGTG CGCAAAAATA TGTTCGTGCC AGCAGTCTTT CATTGAACTG GTTGGGCTAT
GCATCTCCAG ACACACTAAA ACATCTCGAA CGTATCTCTA TTGGGAGAGA GCATGTGCCG
GCATCTTGGC AAATATCGAA TGGAAGTCTT TCAAGTTCTT TCGAGAATCT TTTAGATCGG
GTTGGGCAAA AAAATAGAAC AAGGGAACGA ATAACCAGGA CGTACTTGAT AGAGTTGCGG
CAAGCAATGG TGGAAGTCGC GCGCGTAACC AAAGACGGAG GAACAATCAT TGTAGTAGTA
GGAAATAATC AGGTCTGTGG TGAAACCTTG AGGAATGATG AATACTTGAA GGAAGTGTTA
ACTGATCTTC GATTTGAGAT AACCCTGCAT TTGATTGATC ACATAAAATC CCGTGGCTTG
ATGACAAAGA GAAACAAGAC TGCTTCTATT ATTTCACGGG AAAGTGTTCT CGTTTTCACG
AAGAGTTTGT ATAGAGGGAG CTAA
 
Protein sequence
MNLSGLEEVY SQRMPEKIDT SEMLNLLKTA YITTGEPVEV DFRSLVDWVK IGDQYSHLLH 
PYPAKLLPHI ANFFLRCSSL VGKEAMVLDP FCGSGTVALE AFLAGHTPLI ADANPLALLI
AKVKTTTFNE TVLVQLAKSI CCRAKSFRTA PTIHVINDHL WYSPRIKIAL EKLVRAIREL
PRNIEREFFE LCLSVTARRL SFADPRISVP VRLRIKPSLG NVASKIISER LEWLQDVNVI
AEFQKTVDTN IQRIQQTNRA ANGCRARTKI VGSDARDLRE SVSGTKLQDN CVDLVITSPP
YGSAQKYVRA SSLSLNWLGY ASPDTLKHLE RISIGREHVP ASWQISNGSL SSSFENLLDR
VGQKNRTRER ITRTYLIELR QAMVEVARVT KDGGTIIVVV GNNQVCGETL RNDEYLKEVL
TDLRFEITLH LIDHIKSRGL MTKRNKTASI ISRESVLVFT KSLYRGS