Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0078 |
Symbol | |
ID | 3785803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 82322 |
End bp | 83473 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810148 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_410779 |
Protein GI | 82701213 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.99857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGG GAGGAGATAT TTCGGATTCT ATTGCGAATT CTTTCGCCAC CAGACTCATC CGCTGGCAGC GCGAGCATGG CCGACACCAT CTGCCTTGGC AGAACACGCG TGACGCCTAT TCCATCTGGC TCTCGGAAAT CATGCTGCAG CAAACGCAGG TGGGGACAGT TATTCCCTAT TACCGGCGAT TTCTGCAATG CTTCCCCGAC ATACAAAGCC TTGCCTCTGC ACCGCTGGAT GAGGTAATGG TACAGTGGAG CGGGTTGGGT TACTACTCGC GTGCGAGGAA TCTGCATAAA GCGGCGCAGC GGATTGTGGG AGAGCATGGC GGGATTTTTC CCGAGGAGGT TGCCATCATT CGTCAGCTTC CGGGAATCGG CCGCTCCACC GCTGCGGCAA TTGCGGTGTT TGCATTCGGA AAGCGAGCCG CGATCCTCGA CGGCAATGTA AAACGCATTC TTTCGCGTTG CTTCGGAATC GAGGGTTACC CGGGTGAAAA GCAGGTGGAA GCGCAGCTAT GGCAAAAAGC GGAAGCATTG CTGCCAAAGG GAGATGAGAG CCCGATTGAA CGCGATATCG AGGGCTATAC CCAGGCGCTG ATGGACCTGG GTGCGACTAT CTGCATCCGC GCTCGTCCCA TGTGCGGCTC ATGCCCGCTT CGGCTGGAGT GTGTTGCATT CAGGGATAAC CGCGCCGGCA GCCTGCCCAC CCCTCGGCCG AGGAAGATAT TGCCGGAAAG GGAAGCGGTG CTGCTGCTGG CGGTGGCACA GGGCAAAATC CTCTTGGAAA AACGGCCGAG CACGGGGATC TGGGGCGCCC TATGGAGCTT GCCGGAGATG GGGATGAATG AGAATGTGAT TGAATACTGC CTGCGTTTCG GGATAAATGT GCGGCCGATG TCACAGATGG AAGCGCTCAC TCACACCTTT ACCCACTTCA GATTGCGGAT TTATCCGCTC ATCCTGCAGG TCATTTCCCG CCCGCCGGAT CATTTGACAC CGGAGGTCTT ATCGCAGCCC CGGCGTCCCT GTGTATGGAG GATGCCGGAG GATGCGCTGA AAGCTGCCAT TCCCTCCCCC GTGAGGAAAG TGCTTCTACA ATATGCATCT CAGACAGAGC CGCTTGAAAT TCCGGCTATG ACAGATAACT GA
|
Protein sequence | MSAGGDISDS IANSFATRLI RWQREHGRHH LPWQNTRDAY SIWLSEIMLQ QTQVGTVIPY YRRFLQCFPD IQSLASAPLD EVMVQWSGLG YYSRARNLHK AAQRIVGEHG GIFPEEVAII RQLPGIGRST AAAIAVFAFG KRAAILDGNV KRILSRCFGI EGYPGEKQVE AQLWQKAEAL LPKGDESPIE RDIEGYTQAL MDLGATICIR ARPMCGSCPL RLECVAFRDN RAGSLPTPRP RKILPEREAV LLLAVAQGKI LLEKRPSTGI WGALWSLPEM GMNENVIEYC LRFGINVRPM SQMEALTHTF THFRLRIYPL ILQVISRPPD HLTPEVLSQP RRPCVWRMPE DALKAAIPSP VRKVLLQYAS QTEPLEIPAM TDN
|
| |