Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3789 |
Symbol | |
ID | 4645477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4031094 |
End bp | 4033826 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639807254 |
Product | peptidase M4, thermolysin |
Protein accession | YP_954577 |
Protein GI | 120404748 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCAGT GCCCGAGATC GCGCACGCCG AACGGTTCCT GGCGTGGTCT CTACTCCGGA GTCGTGCTCG CCGGCATCGG GGTGGGCATG GTCGTCGGGC AGGGTGTCGC GGTAGCGAGC CCGTCGGAGG CATCGGAACC ATCGAGTAGC TCGCAGTCCG GTGATACGGC CGACGCCGGG CCCTCGACGG CGTCTCAGAC GCCGACCACC GCGGACGCCG ATCCCGAGGC CGAGGCCGAG GCCGAGCCCG ACCCGGATCC CGGGCCCGAC GCAGAGCCGG ATCTCGCGGA CGTGGCAGAC GAAGCCGAAG AAGCCGACGA CATCGAAGTG GCTGCGTCAA CCGACGAGGA TCCGACCCGT CAGTCACGCA CGCGCACCGA GGAGGCGCCC TCCGCCGACG CCGACGACGC CGACGCCGCC GACGCCGCCG CCGCCGCCGA CGGCGAAGCC GACGCCGACG CGGCCGAAGA CGGCGAAGCC GCCCTCGCCA CCTCGGTCGC CGACACCCCG GACCCGGCCG CCGCCGTCGC TGTGGTGTCC GCAGGCCCGG CGCTACCCGA CGTGGCCACC GCGCCCGCGG CTGCGCTGCG CACCCTGGTC AGCGCGCGAC CCGTCACCGT GCAGAACATG GTCACCGACG TGCTGACCTG GGTCGGATTG CGGCCACAGG CCGATGATGG GCCCACCCCG CCGACGCCGG TCTCGGCGCT GGTGCAGTCA CTGTGGCTGG CGGTACGGCA GACGCAGTAC ACCTGGAACA ATCAGCGGCC CACCGCCGAC GCAACGACAT CGGGCCCCGG ACTCGACGGC GTCGTGACCG GCAACCTCAA CGCCGTCGAC TACGACGACT CTGCGCTCAC ATTCACCCTC ACCGGCGGGC CGGCCTACGG CCGGGTCACC ATCGACGCTT CGGGCGGCTT CACCTACACG CCGGGCGCCG CGGCCGCCGG GCGCGCCGAC AAGTTCACCG TCCGGATCGA CGACACCTCG GGCAACCCGT TCCACGTGCA CGGGCTACTC GGGCTGCTGG GAATCACCCG TCCCACCGAG GTGACGGTGG TCATCGCCGC GAGTTCGCCT GTCCTGCAGA GTGTTCCGTC GGGTCTGACG ATGAGTGAGC TGACGTCGCG GGATGGCGTG GAGGTCACGC CGGGCCGCAA CGGCGCGGTC GGCGTCATCG ACGGTCGACT CACCGACCAA CTGGTCGTCA ACGCTGACGA TGCCGCCGCG GTATTGAACT CGCTGGCGTT CGCCTTGGGC GCCGTCACCG GATTCGCCGA ACCGTCTGCG ATCACCGCCA CCAGCGTCGG CAAGGGAGCC GACGCCGAGC ACTTCTACCG CTACACCGAA AAGATCGCTG GGGTACCGGT TCTCGGCAGC GAGGTGATCC TGGTGACCAA CGCCGCCGGC GAGGTCACCA GCGTGTTCAA CTACTACCGC GGGCTGGGTG AGGGCTTCGG CATCACCCCG GACGCCGCGG TCGACGAGGA TTCCGAGGTG TGGTCGATCG CCGGCACCGC CTATCTCGGC CCCGACGTAG ACCCTCGCGT GCTCGAAAGC TTTCTCTCCA CAACCACATT CACCAGGGAA CTGGTGGTCT ACACGCACGA CGACACGACG TCCGACCTGG CGTGGCGGGT GGTCGTCGTG GTCCCCGACA CCGGTGAGAT GTCGCCTTCA GGGGCGACGT ACGTCATCGA CGCCGACGGC GCGTCGGCGG GCGATGTCAT CCTGGGCACC TCCAACGCGC AGGCCGCGAC GTCCATCAGC ATCGCCAAGG ATTGGCTCGG TGAGTCGCGA GTCATGACCA TCGAAACCCG AACGGTGTTG TGGTTCAGGA CCTACCAACT GATCGACGCG CCCAGGAACA TCACGACCTA CCGAACGTCC TACTCGTTCT TCGGGCTGGG CAGTCCGAGC CTGCCGGGAA CCGTCGTCAA GCGCGGCTTC TTCGGTTGGG ATGCGGCCGC GGTGTCGGCG CACGCCAACA TGGCCGTGGT CTACGACTAC TTCCAGGACG TGCTGGGGCG CACATCCTTC GACGACGAGG GCGCCCTGGT TTCGGTCAGC ATCCGGTACA ACCCGCGCAC CTCGACCGTA GGCTATGCGA ATGCGTTCTG GGATCCGAGC CGACAGCTCT TCGCTTTCGG CGACGCCGGC TACTTCCAGG CGAGCGTCGA CGTCGTCGCG CATGAGTTCA CCCACGCAGT GGTGTCCTAC GTCGTCGGCG ACGGCGGTTC GGTGCTCGAC AACGGCGAGT CCGGAGCCCT CAACGAGGCC TACAGCGATA TTTTCGGTGT GCTGGTCGAG GGCAAGACCG GCGACGGAAG ATGGCTGATC GGCGAGGACT CCGACCACGG TGTGATTCGC AACCTCGCCG ACCCGGAATC CATCAGAACG GCCTACGGCC CGTACCGGGC CCGCGTTAGC GACATGTACT CCGGCGAGGG CGACGACCGC GGTGAGCACG TCAACAGCAC CGTGTTCAGC CACGCGGCGT ATCTGATGAT GACCGACGCC GACACCGCCG GTGTCTCGGA CGAGACGTGG GCCAAGGTTT TCTACCACTC GCTGGGGCTC GGCACCAGTG CCAGGTTCGT CGACGGCCGG GCCGCTGTGC TCAGCAGCGC CGGCGCGCAG GGGTTGACGG CCACTCAGCT CGCCGCAATC GCGCGGGCAT TCGACACCGT CGAGATCTAC GGTGCGGCAC CGTCATCCGT CATCGCCGTC TGA
|
Protein sequence | MDQCPRSRTP NGSWRGLYSG VVLAGIGVGM VVGQGVAVAS PSEASEPSSS SQSGDTADAG PSTASQTPTT ADADPEAEAE AEPDPDPGPD AEPDLADVAD EAEEADDIEV AASTDEDPTR QSRTRTEEAP SADADDADAA DAAAAADGEA DADAAEDGEA ALATSVADTP DPAAAVAVVS AGPALPDVAT APAAALRTLV SARPVTVQNM VTDVLTWVGL RPQADDGPTP PTPVSALVQS LWLAVRQTQY TWNNQRPTAD ATTSGPGLDG VVTGNLNAVD YDDSALTFTL TGGPAYGRVT IDASGGFTYT PGAAAAGRAD KFTVRIDDTS GNPFHVHGLL GLLGITRPTE VTVVIAASSP VLQSVPSGLT MSELTSRDGV EVTPGRNGAV GVIDGRLTDQ LVVNADDAAA VLNSLAFALG AVTGFAEPSA ITATSVGKGA DAEHFYRYTE KIAGVPVLGS EVILVTNAAG EVTSVFNYYR GLGEGFGITP DAAVDEDSEV WSIAGTAYLG PDVDPRVLES FLSTTTFTRE LVVYTHDDTT SDLAWRVVVV VPDTGEMSPS GATYVIDADG ASAGDVILGT SNAQAATSIS IAKDWLGESR VMTIETRTVL WFRTYQLIDA PRNITTYRTS YSFFGLGSPS LPGTVVKRGF FGWDAAAVSA HANMAVVYDY FQDVLGRTSF DDEGALVSVS IRYNPRTSTV GYANAFWDPS RQLFAFGDAG YFQASVDVVA HEFTHAVVSY VVGDGGSVLD NGESGALNEA YSDIFGVLVE GKTGDGRWLI GEDSDHGVIR NLADPESIRT AYGPYRARVS DMYSGEGDDR GEHVNSTVFS HAAYLMMTDA DTAGVSDETW AKVFYHSLGL GTSARFVDGR AAVLSSAGAQ GLTATQLAAI ARAFDTVEIY GAAPSSVIAV
|
| |