Gene Mvan_5445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5445 
Symbol 
ID4644558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5824202 
End bp5825860 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content72% 
IMG OID639808921 
Productmethyltransferase small 
Protein accessionYP_956221 
Protein GI120406392 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGC TTCCCGGTGA CAACGCTGAC CTGCGCAAGG CGCGCGGCGC GTTCTTCACC 
CCGACGCCGG TGGCGCGGTT CCTGACCGAC TGGGCAGTGC GCCACCCCGA GCACTCCGTG
CTCGAACCGT CCTGCGGCGA AGCCGTTTTC CTGCACGAGA TCGGGCAGCG GGGCGGCCAC
ACCGGTCCGC TCGTCGGGGT CGAGATCCAC CCCGGCTCCG CCGAGGAGTC CCAGCGCAGC
CTGCGGGCGC GCGGCATCGA CGCGACCATC CACGCGCGGG ATTTCTTCTC CCACGCCGAG
TTCGGCAGCT ACGACGCGGT CGTCGGCAAC CCGCCGTACG TCCGCTATCA GGATTTCGCC
GGGGAGGCCC GCGCCCGGGC CCGGCAGGCG GCGCTGCGCG CGGGCGTCGC CCTGTCCAAC
CTCGCCTCGT CGTGGGCGGC GTTCACCGTG CATTCCGCGC TGCACCTGCG CCCGGGCGGT
CGGCTGGGCC TGGTGCTGCC CGCCGAACTG CTGAGCGTCA ACTACGCCGC CGGGGTGCGC
CAGTACCTGA TGGACCACTT CGGACAGGTC GGCCTGGTGC TGTTCGACGA GCGCGTGTTC
CCCGGCGTGC TGGAGGAGGT GGTGCTCCTG CTGGCCGACG GCTACCAGCC CGACGGTGGC
CGGGGCGCCA CCCACATGCG GCTGTCCCAG GTCCGCGACG CCGCCGGCCT CGCCCGGCTC
ACCGGGTCGC GGCGCTGGAG CCCGCCCGGC AACGGCGCCA AGTGGTCGGC CGGTCTGATG
TCGGCGGCTG GCCTGTCGGC GTTCGACTCG GTGGTGCGCG GCGACGCGTT CACCTTCCTG
GAGACCTGGG GCGACACCAC TCTGGGCATG GTCACCGGCA GCAACCGCTT CTTCACGCTC
TCGCCGGACA AGGTGGATGC GCTGGGGCTG GCGCCGTCCG ATCTGATCCC GCTGTCCCCG
CCGGGGTCGC GGCATCTGCG CGGTCTGGCC CTGACACCGA AGGCGCTGCG CGCACTCGGC
GAGGAAGGCC GGTCGACGTA CCTGTTCCGC CCGGCCGACG AGCCGTCCCC CGCCGGTTGG
CACTACATCG CCTCCGGTGA GGACCTCGAC GTGCACCGCG CCTACAAATG CCGGGTCCGC
ACTCCGTGGT GGCGGGTGCC GTATCTGCGG CCCGCGGACC TGTTCCTGAC CTACATGAAT
GCCGACACGC CACGGCTGAC CAGCAACCGC GCCCGCACGC ATCACCTCAA TTCGGTGCAC
GGCGTGTATC TGCGCGACGA GTTCCGCCAG GACGGCATGA ACCTGTTGCC GCTGGCGTCG
CTGAACTCGG TGACCCTGCT TGGCGCCGAG ACGGTGGGCC GCGCCTACGG TGGTGGGATG
CTGAAGATCG AGCCACGCGA AGCCGACGTT CTCCCGGTAC CGTCGCCGGA CCTGGTGCGC
CGCAACGCCG ATCAGCTCCG CCGTATCCGC CCCGCGGTCG CCACCTGCCT GCGCGACGCC
CGCCTGCTCG ACGCCGTCGC CCTGGTCGAC GAGGTGCTGC TACTCGGCAC GTCGACCCTG
TCCGAGCGCG CGCTGCGCGC GGTCCGCCGC GACCACGCCG GCCTCACCGC ACGTCGCATC
ACCCGAGGAA AAGTCGGAAA AGCCCACGGT GGCGAGTAA
 
Protein sequence
MRMLPGDNAD LRKARGAFFT PTPVARFLTD WAVRHPEHSV LEPSCGEAVF LHEIGQRGGH 
TGPLVGVEIH PGSAEESQRS LRARGIDATI HARDFFSHAE FGSYDAVVGN PPYVRYQDFA
GEARARARQA ALRAGVALSN LASSWAAFTV HSALHLRPGG RLGLVLPAEL LSVNYAAGVR
QYLMDHFGQV GLVLFDERVF PGVLEEVVLL LADGYQPDGG RGATHMRLSQ VRDAAGLARL
TGSRRWSPPG NGAKWSAGLM SAAGLSAFDS VVRGDAFTFL ETWGDTTLGM VTGSNRFFTL
SPDKVDALGL APSDLIPLSP PGSRHLRGLA LTPKALRALG EEGRSTYLFR PADEPSPAGW
HYIASGEDLD VHRAYKCRVR TPWWRVPYLR PADLFLTYMN ADTPRLTSNR ARTHHLNSVH
GVYLRDEFRQ DGMNLLPLAS LNSVTLLGAE TVGRAYGGGM LKIEPREADV LPVPSPDLVR
RNADQLRRIR PAVATCLRDA RLLDAVALVD EVLLLGTSTL SERALRAVRR DHAGLTARRI
TRGKVGKAHG GE