Gene Mvan_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0034 
Symbol 
ID4644815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp41676 
End bp43310 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content64% 
IMG OID639803545 
ProductN-6 DNA methylase 
Protein accessionYP_950891 
Protein GI120401062 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.966218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGA GAAGGGGCAC GAAAAAGGCT GAGCCACTGC TTCCTTCAAC GATGAAGGAG 
CTGAAGGACA CACTGTGGAA AGCCGCCGAC AAACTGCGCG GATCGCTGTC AGCCAGCCAG
TACAAGGATG TGATCCTGGG CCTGGTGTTC CTCAAGTACG TCTCCGACGC CTACGACGAA
CGCCGCGAGG CGATCCGGAC GGAGCTGGAG GCAGACGGCC TCGACGCCGA ACAGATCGAA
GACCTCATCG AGGATCCCGA GGAGTACCAG GGCTACGGCG TGTTTGTCGT CCCGCCCGGC
GCGCGGTGGA AGTTCCTGGC GGAGAATGCG AAGGGTTTAC CGGCCGCCGG TGGCGAGCCC
GCCAAGAACA TCGGTCAGCT GATCGACGAG GCGATGGACG CCGTGATGAA GGCCAACCCG
ACCCTGCAGG GCACCCTGCC GCGGCTGTAC AACAAGGACA ACATCGACCA GCGCCGGTTG
GGTGAATTGA TCGACCTGTT CAACAGCGCC CGGTTCAGCC GCCAGGGTGA CGGCCGAGCG
CGCGACCTGA TGGGCGAGGT CTACGAGTAC TTCCTCGGCA ACTTCGCGCG GGCGGAAGGG
AAGCGGGGTG GAGAGTTCTT CACCCCGCCG AGCGTGGTGA AGGTGATCGT CGAGGTGCTG
GAACCGTCGC GCGGGCGGGT GTATGACCCG TGCTGCGGAT CGGGCGGCAT GTTCGTGCAG
ACCGAGAAGT TCATCTATGA GCACGACGGC GACCCGAAGG AGATCGCCGT CTACGGTCAG
GAGTCCATCG AGGAGACCTG GCGGATGGCC AAGATGAACC TGGCCATCCA CGGCATCGAC
AACAAGGGCC TGGGTGCGCG CTGGGGCGAT ACCTTTGCCC GTGACCAGCA TCCCGATGTC
CAGATGGATT ACGTGCTGGC CAATCCGCCG TTCAACATCA AGGACTGGGC CCGCAACGAG
GAGGACGCCC GCTGGCGGTT CGGCGTACCG CCGGCCAACA ACGCCAACTA CGCCTGGATC
CAGCACATCC TGTACAAGCT GGCGTCCGGC GGTAAGGCCG GTGTGGTGAT GGCCAATGGG
TCGATGTCGT CGAACTCCAA CGGCGAGGGC GATATCCGGG CCCAGATCGT CGAAGCCGAT
CTGGTGTCAT GCATGATCGC GCTGCCCACC CAGTTGTTTC GCAGCACCGG AATCCCGGTG
TGCGTGTGGT TCTTCGCCAA GGACAAAACC GCAGGTAAGC AGGGCTCGGT CGACCGGTCG
GGGCAGGTGC TGTTCATTGA CGCCCGCGAG ATGGGCTACA TGGTCGACCG CGCTGAGCGC
GCCCTCTCCG ACGACGACAT CGTCAAGATC GGCGACACCT TCCATGCCTG GCGCGGATCG
GCGTCGGCGG CAGCGAAGGG CGTTGTCTAC CAGGATGTCC CAGGCTTTTG TAAGTCGGCG
ACCCTAGCCG AAATCAAGGC TGCCGACTAC GCACTGACAC CGGGACGGTA CGTCGGCGCT
GCGGCCGTCG AGGACGACGG CGAACCGATC GACGAGAAAA TCGCCCGGTT GAAGACGGAA
CTGCTTGCGG CGTTTGATGA GTCGGCGCGG CTGGAGAAGG TGGTTCGAGA GCAGTTGGAG
CGGATCGATG CGTGA
 
Protein sequence
MPPRRGTKKA EPLLPSTMKE LKDTLWKAAD KLRGSLSASQ YKDVILGLVF LKYVSDAYDE 
RREAIRTELE ADGLDAEQIE DLIEDPEEYQ GYGVFVVPPG ARWKFLAENA KGLPAAGGEP
AKNIGQLIDE AMDAVMKANP TLQGTLPRLY NKDNIDQRRL GELIDLFNSA RFSRQGDGRA
RDLMGEVYEY FLGNFARAEG KRGGEFFTPP SVVKVIVEVL EPSRGRVYDP CCGSGGMFVQ
TEKFIYEHDG DPKEIAVYGQ ESIEETWRMA KMNLAIHGID NKGLGARWGD TFARDQHPDV
QMDYVLANPP FNIKDWARNE EDARWRFGVP PANNANYAWI QHILYKLASG GKAGVVMANG
SMSSNSNGEG DIRAQIVEAD LVSCMIALPT QLFRSTGIPV CVWFFAKDKT AGKQGSVDRS
GQVLFIDARE MGYMVDRAER ALSDDDIVKI GDTFHAWRGS ASAAAKGVVY QDVPGFCKSA
TLAEIKAADY ALTPGRYVGA AAVEDDGEPI DEKIARLKTE LLAAFDESAR LEKVVREQLE
RIDA