Gene Mvan_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3789 
Symbol 
ID4645477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4031094 
End bp4033826 
Gene Length2733 bp 
Protein Length910 aa 
Translation table11 
GC content68% 
IMG OID639807254 
Productpeptidase M4, thermolysin 
Protein accessionYP_954577 
Protein GI120404748 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCAGT GCCCGAGATC GCGCACGCCG AACGGTTCCT GGCGTGGTCT CTACTCCGGA 
GTCGTGCTCG CCGGCATCGG GGTGGGCATG GTCGTCGGGC AGGGTGTCGC GGTAGCGAGC
CCGTCGGAGG CATCGGAACC ATCGAGTAGC TCGCAGTCCG GTGATACGGC CGACGCCGGG
CCCTCGACGG CGTCTCAGAC GCCGACCACC GCGGACGCCG ATCCCGAGGC CGAGGCCGAG
GCCGAGCCCG ACCCGGATCC CGGGCCCGAC GCAGAGCCGG ATCTCGCGGA CGTGGCAGAC
GAAGCCGAAG AAGCCGACGA CATCGAAGTG GCTGCGTCAA CCGACGAGGA TCCGACCCGT
CAGTCACGCA CGCGCACCGA GGAGGCGCCC TCCGCCGACG CCGACGACGC CGACGCCGCC
GACGCCGCCG CCGCCGCCGA CGGCGAAGCC GACGCCGACG CGGCCGAAGA CGGCGAAGCC
GCCCTCGCCA CCTCGGTCGC CGACACCCCG GACCCGGCCG CCGCCGTCGC TGTGGTGTCC
GCAGGCCCGG CGCTACCCGA CGTGGCCACC GCGCCCGCGG CTGCGCTGCG CACCCTGGTC
AGCGCGCGAC CCGTCACCGT GCAGAACATG GTCACCGACG TGCTGACCTG GGTCGGATTG
CGGCCACAGG CCGATGATGG GCCCACCCCG CCGACGCCGG TCTCGGCGCT GGTGCAGTCA
CTGTGGCTGG CGGTACGGCA GACGCAGTAC ACCTGGAACA ATCAGCGGCC CACCGCCGAC
GCAACGACAT CGGGCCCCGG ACTCGACGGC GTCGTGACCG GCAACCTCAA CGCCGTCGAC
TACGACGACT CTGCGCTCAC ATTCACCCTC ACCGGCGGGC CGGCCTACGG CCGGGTCACC
ATCGACGCTT CGGGCGGCTT CACCTACACG CCGGGCGCCG CGGCCGCCGG GCGCGCCGAC
AAGTTCACCG TCCGGATCGA CGACACCTCG GGCAACCCGT TCCACGTGCA CGGGCTACTC
GGGCTGCTGG GAATCACCCG TCCCACCGAG GTGACGGTGG TCATCGCCGC GAGTTCGCCT
GTCCTGCAGA GTGTTCCGTC GGGTCTGACG ATGAGTGAGC TGACGTCGCG GGATGGCGTG
GAGGTCACGC CGGGCCGCAA CGGCGCGGTC GGCGTCATCG ACGGTCGACT CACCGACCAA
CTGGTCGTCA ACGCTGACGA TGCCGCCGCG GTATTGAACT CGCTGGCGTT CGCCTTGGGC
GCCGTCACCG GATTCGCCGA ACCGTCTGCG ATCACCGCCA CCAGCGTCGG CAAGGGAGCC
GACGCCGAGC ACTTCTACCG CTACACCGAA AAGATCGCTG GGGTACCGGT TCTCGGCAGC
GAGGTGATCC TGGTGACCAA CGCCGCCGGC GAGGTCACCA GCGTGTTCAA CTACTACCGC
GGGCTGGGTG AGGGCTTCGG CATCACCCCG GACGCCGCGG TCGACGAGGA TTCCGAGGTG
TGGTCGATCG CCGGCACCGC CTATCTCGGC CCCGACGTAG ACCCTCGCGT GCTCGAAAGC
TTTCTCTCCA CAACCACATT CACCAGGGAA CTGGTGGTCT ACACGCACGA CGACACGACG
TCCGACCTGG CGTGGCGGGT GGTCGTCGTG GTCCCCGACA CCGGTGAGAT GTCGCCTTCA
GGGGCGACGT ACGTCATCGA CGCCGACGGC GCGTCGGCGG GCGATGTCAT CCTGGGCACC
TCCAACGCGC AGGCCGCGAC GTCCATCAGC ATCGCCAAGG ATTGGCTCGG TGAGTCGCGA
GTCATGACCA TCGAAACCCG AACGGTGTTG TGGTTCAGGA CCTACCAACT GATCGACGCG
CCCAGGAACA TCACGACCTA CCGAACGTCC TACTCGTTCT TCGGGCTGGG CAGTCCGAGC
CTGCCGGGAA CCGTCGTCAA GCGCGGCTTC TTCGGTTGGG ATGCGGCCGC GGTGTCGGCG
CACGCCAACA TGGCCGTGGT CTACGACTAC TTCCAGGACG TGCTGGGGCG CACATCCTTC
GACGACGAGG GCGCCCTGGT TTCGGTCAGC ATCCGGTACA ACCCGCGCAC CTCGACCGTA
GGCTATGCGA ATGCGTTCTG GGATCCGAGC CGACAGCTCT TCGCTTTCGG CGACGCCGGC
TACTTCCAGG CGAGCGTCGA CGTCGTCGCG CATGAGTTCA CCCACGCAGT GGTGTCCTAC
GTCGTCGGCG ACGGCGGTTC GGTGCTCGAC AACGGCGAGT CCGGAGCCCT CAACGAGGCC
TACAGCGATA TTTTCGGTGT GCTGGTCGAG GGCAAGACCG GCGACGGAAG ATGGCTGATC
GGCGAGGACT CCGACCACGG TGTGATTCGC AACCTCGCCG ACCCGGAATC CATCAGAACG
GCCTACGGCC CGTACCGGGC CCGCGTTAGC GACATGTACT CCGGCGAGGG CGACGACCGC
GGTGAGCACG TCAACAGCAC CGTGTTCAGC CACGCGGCGT ATCTGATGAT GACCGACGCC
GACACCGCCG GTGTCTCGGA CGAGACGTGG GCCAAGGTTT TCTACCACTC GCTGGGGCTC
GGCACCAGTG CCAGGTTCGT CGACGGCCGG GCCGCTGTGC TCAGCAGCGC CGGCGCGCAG
GGGTTGACGG CCACTCAGCT CGCCGCAATC GCGCGGGCAT TCGACACCGT CGAGATCTAC
GGTGCGGCAC CGTCATCCGT CATCGCCGTC TGA
 
Protein sequence
MDQCPRSRTP NGSWRGLYSG VVLAGIGVGM VVGQGVAVAS PSEASEPSSS SQSGDTADAG 
PSTASQTPTT ADADPEAEAE AEPDPDPGPD AEPDLADVAD EAEEADDIEV AASTDEDPTR
QSRTRTEEAP SADADDADAA DAAAAADGEA DADAAEDGEA ALATSVADTP DPAAAVAVVS
AGPALPDVAT APAAALRTLV SARPVTVQNM VTDVLTWVGL RPQADDGPTP PTPVSALVQS
LWLAVRQTQY TWNNQRPTAD ATTSGPGLDG VVTGNLNAVD YDDSALTFTL TGGPAYGRVT
IDASGGFTYT PGAAAAGRAD KFTVRIDDTS GNPFHVHGLL GLLGITRPTE VTVVIAASSP
VLQSVPSGLT MSELTSRDGV EVTPGRNGAV GVIDGRLTDQ LVVNADDAAA VLNSLAFALG
AVTGFAEPSA ITATSVGKGA DAEHFYRYTE KIAGVPVLGS EVILVTNAAG EVTSVFNYYR
GLGEGFGITP DAAVDEDSEV WSIAGTAYLG PDVDPRVLES FLSTTTFTRE LVVYTHDDTT
SDLAWRVVVV VPDTGEMSPS GATYVIDADG ASAGDVILGT SNAQAATSIS IAKDWLGESR
VMTIETRTVL WFRTYQLIDA PRNITTYRTS YSFFGLGSPS LPGTVVKRGF FGWDAAAVSA
HANMAVVYDY FQDVLGRTSF DDEGALVSVS IRYNPRTSTV GYANAFWDPS RQLFAFGDAG
YFQASVDVVA HEFTHAVVSY VVGDGGSVLD NGESGALNEA YSDIFGVLVE GKTGDGRWLI
GEDSDHGVIR NLADPESIRT AYGPYRARVS DMYSGEGDDR GEHVNSTVFS HAAYLMMTDA
DTAGVSDETW AKVFYHSLGL GTSARFVDGR AAVLSSAGAQ GLTATQLAAI ARAFDTVEIY
GAAPSSVIAV