Gene Nwi_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2605 
Symbol 
ID3675059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2830554 
End bp2831801 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID637714171 
ProductA/G-specific adenine glycosylase MutY 
Protein accessionYP_319210 
Protein GI75676789 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.443886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.21931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAG CGGCAACGGC GATACGTCAC ACGCACAAGA TGCAAGGCGG AGACCGCAAC 
CGCCCGGTGT TGCTGCTGGA GTGGTACGAC CGCAACCGCC GCCTTCTGCC GTGGCGCGCG
CTGCCCGGCG AGCCGGTCGA TCCCTATCGG GTCTGGCTGT CCGAGATCAT GCTGCAGCAG
ACCACCGTGA AGACGGTCGG GCCTTACTTC GAAAAGTTCC TGGCGCGCTG GCCGGACGTC
GCGGCGATGG CGCGCGCCTC GCTCGACGAT ATCTTACGGA TGTGGGCGGG GCTCGGCTAC
TATTCGCGCG CGCGCAATCT GCATGCCTGC GCCGTCAAGG TGCTGCGCGA TCACGGCGGC
CGGTTTCCCG ACACGGAAGA GGATCTGCGT GCGCTGCCGG GGATCGGTCC CTATACCGCG
GCGGCGATCG CCGCCATCGC CTTCAATCGA CGCACCATGC CGGTCGACGG CAATATCGAA
CGTGTGGTGT CGCGGCTTTT CGCCGTGGAC GAGCCGCTGC CGAAGGCGAA GCCGCGCATT
CACACGCTGG CCGCGACGTT GCTCGGGCCA TCACGATCCG GGCGCGACGG CAAGAGCCGT
GCTGGCGACG TGAAGACACG CGCTGGTCGC GACGGCAAGA GCCGCGCTAG CGACGTGAAG
ACACGCGCTG GGGATATCGC GCAGGCCTTG ATGGACCTTG GCGCCGCCAT TTGTACACCA
AAGAAGCCTT CTTGCGTGCT GTGTCCGCTG AGCGATGATT GCGCCGCGCG GGCGCGCGGC
GATCAGGAGA CCTTTCCTCG CAAGACCCCG AAAAAGGCAG GCGAGTTGCG GCGCGGCGCG
GCCTTCGTCG TGAGGCGCGG CAGCGAGGTG CTGGTTCGTA CACGGCCGGC AAAAGGCCTG
CTCGGCGGCA TGACCGAAGT TCCGACATCG ACGTGGCTCG CCGCGCAGGA TGATACGGCC
GCCCTGAAGC AGGCGCCGTG CCTTGAAAGC GCGCCGCGCT GGCGGCGCAA GGCTGGCGCG
GTCACGCATG TTTTCACGCA TTTCCCGCTG GAGCTTGCTG TCTATACCGC AGCCGTCGCC
AGGCGGACCG CCGCGCCGGA GGGGATGCGC TGGGTCCCGA TCGCCCGGCT GAACGATGAA
GCGCTGCCCA ACCTGATGCG CAAGGTCATT GCGCACGGCT TGGGTGATGA GATGGAAATC
CATGCGCTCG CAACCCGTCG CAAGCAACAA GAAGGCCCTC CGGCGTAA
 
Protein sequence
MARAATAIRH THKMQGGDRN RPVLLLEWYD RNRRLLPWRA LPGEPVDPYR VWLSEIMLQQ 
TTVKTVGPYF EKFLARWPDV AAMARASLDD ILRMWAGLGY YSRARNLHAC AVKVLRDHGG
RFPDTEEDLR ALPGIGPYTA AAIAAIAFNR RTMPVDGNIE RVVSRLFAVD EPLPKAKPRI
HTLAATLLGP SRSGRDGKSR AGDVKTRAGR DGKSRASDVK TRAGDIAQAL MDLGAAICTP
KKPSCVLCPL SDDCAARARG DQETFPRKTP KKAGELRRGA AFVVRRGSEV LVRTRPAKGL
LGGMTEVPTS TWLAAQDDTA ALKQAPCLES APRWRRKAGA VTHVFTHFPL ELAVYTAAVA
RRTAAPEGMR WVPIARLNDE ALPNLMRKVI AHGLGDEMEI HALATRRKQQ EGPPA