Gene Nwi_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0203 
Symbol 
ID3676659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp233410 
End bp234354 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content66% 
IMG OID637711741 
Productmethylated-DNA-(protein)-cysteine S-methyltransferase 
Protein accessionYP_316823 
Protein GI75674402 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.661415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.947991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGGCA GACGGCGAAC GTCCTACATC CTGGCCATGA TGACGCTTGC GAAAATTCCT 
GCTCCGGCTG ATTCGCGCAT GACGCAGCCG GACGCGCCTG GCGCAGCATT GCGCGATTAT
GACGCGGTCC GCCGAGCGAT CGCCTTCATC TCCGAAAACT GGCGCTCGCA GCCCGCCATC
GCAGCGACGG CTGACGCCGC CGGCGTGACG CCGGACGAGT TGCACCATCT GTTCCGGCGC
TGGGCGGGTC TAACCCCGAA ATTGTTCATG CAGGCGCTGA CGCTCGACCA CGCCAAGCGG
TTGTTGCGCA AATCCGCCAG CGTGCTCGAT GCAGCCCTCG ACTCCGGCCT CTCGGGTCCG
GGACGCCTGC ACGACCTGTT CGTCACGCAT GAAGCGATGT CACCCGGCGA ATGGAAGAAT
GGCGGCTCCG GCATGAAGCT CGCTTTCGGT TTTCATCCCT CGCCCTTCGG CATCGCGATT
GTGATCGCCA GCGACCGTGG CCTCGCGGGA CTGGCTTTCG CCGACGGCGG CGACGAGCAG
GCCGCGCTCG CCGACATGAA GCGGCGATGG CCCAATGCAG CTTACGTCGA GGATGCAGCT
CGCACCGGGG CGCTGGCGCA GCGCGTGTTC GATACGAGGC TTTGGCGAGC CGACCAGCCG
CTGCGCGTGG TTCTGATCGG GACGGATTTC GAGGTCCGGG TCTGGCAGAC CCTGCTCAGG
ATTCCCATGG GAAAGGTCAC GACCTACTCA ACCATCGCCG CCAGTATCGA TCGCCCGACC
GCTTCGCGCG CCGTCGGCGC CGCTGTCGGC AAGAACCCGG TGTCATTCGT CGTGCCCTGC
CATCGCGTGC TCGGCAAAAG CGGCGCGCTG ACGGGGTATC ACTGGGGAAT CACCCGCAAG
CACGCGATGC TGGGGTGGGA GGCCGGGCGG ATTGGCCTGG AATGA
 
Protein sequence
MAGRRRTSYI LAMMTLAKIP APADSRMTQP DAPGAALRDY DAVRRAIAFI SENWRSQPAI 
AATADAAGVT PDELHHLFRR WAGLTPKLFM QALTLDHAKR LLRKSASVLD AALDSGLSGP
GRLHDLFVTH EAMSPGEWKN GGSGMKLAFG FHPSPFGIAI VIASDRGLAG LAFADGGDEQ
AALADMKRRW PNAAYVEDAA RTGALAQRVF DTRLWRADQP LRVVLIGTDF EVRVWQTLLR
IPMGKVTTYS TIAASIDRPT ASRAVGAAVG KNPVSFVVPC HRVLGKSGAL TGYHWGITRK
HAMLGWEAGR IGLE