Gene Mmcs_4770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4770 
Symbol 
ID4113599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5045389 
End bp5047740 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content68% 
IMG OID638033921 
ProductMername-AA223 peptidase 
Protein accessionYP_641930 
Protein GI108801733 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGGA AAAATGTGAT CCGCACGCTG ATCGTGATCG CGGTCGTGTT GTTGCTGGGC 
TGGTCGTTCT TTTATTTCAG TGACGACACC CGTGGATTCA AGCCCGTCGA CACCTCGGTG
GCGATCTCGC AGATCAACAG CGACAACGTC AAGAGCGCCC AGATCGACGA TCGTGAACAG
CAGTTGCGGC TCGAACTGAA GAGCGGCAAC GGCGAGACCG AGGACAGCGA CAAGATCCTC
ACGAAGTATC CGACCGGGTA CGCGGTCACG CTGTTCGAAT CGCTGCAGGA CAAGAACGTC
AAGATCAACA CGGTCGTCAA CCAGGGCAGC GTGCTGGGGT CACTCCTCAT CTACATGCTC
CCGCTGCTGC TGTTGGTCGC GCTGTTCGTC TTCTTCTCCC GGATGCAGAC CGGCGGCCGG
ATGGGCTTCG GCTTCGGCAA GTCGCGGGCC AAGCAGCTCA CCAAGGACAT GCCCAAGACG
ACGTTCGCCG ACGTCGCCGG CGCGGATGAG GCGGTCGAAG AGCTCTACGA GATCAAGGAC
TTCCTGCAGA ACCCGTCGCG CTATCAGGCG CTGGGCGCCA AGATCCCCAG GGGCGTGCTG
CTCTACGGTC CGCCCGGCAC GGGCAAGACC CTGCTCGCCC GCGCCGTCGC AGGTGAAGCG
GGTGTCCCGT TCTTCACGAT CTCGGGTTCG GACTTCGTCG AGATGTTCGT CGGCGTCGGC
GCCTCGCGCG TGCGCGACCT GTTCGAACAG GCCAAGCAGA ACAGCCCGTC GATCATCTTC
GTCGACGAGA TCGACGCGGT CGGTCGCCAG CGCGGCGCCG GCCTCGGCGG CGGTCACGAC
GAACGCGAGC AGACGCTCAA CCAGCTGCTG GTGGAGATGG ACGGATTCGG CGACCGCCAG
GGCGTCATCC TCATCGCGGC CACCAACCGG CCCGACATCC TCGACCCGGC GCTGCTGCGC
CCGGGCCGCT TCGACCGGCA GATCCCGGTC ACCAGCCCCG ACCTCGCCGG CCGCCGCGCC
GTGCTCAAGG TTCACTCGCA GGGCAAGCCG ATGGCCGGTG ACGCCGACCT CGACGGGCTG
GCCAAGCGCA CCGTCGGCAT GTCCGGCGCC GACCTGGCCA ACGTCATCAA CGAGGCCGCG
CTGCTCACTG CCCGCGAGAA CGGCACCGTC ATCACGGGTC TCGCGCTCGA GGAGGCCGTC
GACCGGGTGG TCGGCGGACC CCGCCGCAAG AGCCGCATCA TCAGCGAGCA CGAGAAGAAG
ATCACCGCCT ACCACGAGGG TGGCCACACG CTGGCCGCCT GGGCGATGCC CGACATCGAG
CCCATCTACA AGGTGACGAT CCTGGCCCGC GGCCGCACCG GCGGCCACGC GGTCGCCGTC
CCCGAGGACG ACAAGGGTCT GATGACCCGC TCGGAGATGA TCTCCCGGCT GGTGTTCGCC
ATGGGTGGCC GCGCCGCCGA GGAACTCGTG TTCCGCGAGC CGACCACCGG CGCGGTGTCC
GACATCCAGC AGGCCACCAA GATCGCGCGC GCGATGGTCA CCGAGTACGG CATGAGCAGC
AAGCTCGGCG CGGTGCGGTA CGGCACCGAA CACGGCGACC CGTTCCTCGG CCGCACGATG
GGCACTTCGT CGGACTACAG CCACGAGGTC GCGCAGATCA TCGACGACGA GGTGCGCAAG
CTCATCGAGG CCGCCCACAC CGAGGCGTGG GAGATCCTCA CCGAGTACCG CGACATCCTC
GACACCCTCG CCGGTGAGCT CCTGGAGAAG GAGACCCTGC ACCGCGTCGA ACTCGAGGCG
ATCTTCGGCG ACGTCAAGAA GCGCCCGCGC CTGACGATGT TCGACGACTT CGGTGGCCGC
GTGCCGTCGG ACAAGCCGCC CATCAAGACT CCGGGCGAGC TGGCGATCGA GCGCGGTGAG
GAGTGGCCGC AGCCCAAGCC CGAACCCGCG TTCAAGGCGG CGATCGCGGC GGCCAGCAAG
GCCGCCGAGG AAGCCGCCGC CCGCAGCAAC GGCGCGAACG GATCCAGCGG TGCCAACGGC
TCCCCGAACG GGGCGCCCAA CGGTGCGACG CAGCCCGACT ACGGTGCGCC CGCAGGCTGG
CATGCGCCGG GCTGGCCGCC GCAGCAGCAG CCGTCGGGCC AGCAGGGCGG CTACTGGTAT
CCGCCGCCCT CCCCGAATCC GGGCTGGGGC GAACCGCCGC GGCAGCAGCA GCCCTACCCG
CCGTACCAGC AGCACTACCC GCAGCCCGGT CACGGGCCGC AGGGTGCCAA TCCGCCACGC
GACCCGCAGC AGGAGCGGGG CCGCGACGGC GACCGTCCCG GTGACCGTCC GAACCCGCCT
GCGCAGCACT GA
 
Protein sequence
MNRKNVIRTL IVIAVVLLLG WSFFYFSDDT RGFKPVDTSV AISQINSDNV KSAQIDDREQ 
QLRLELKSGN GETEDSDKIL TKYPTGYAVT LFESLQDKNV KINTVVNQGS VLGSLLIYML
PLLLLVALFV FFSRMQTGGR MGFGFGKSRA KQLTKDMPKT TFADVAGADE AVEELYEIKD
FLQNPSRYQA LGAKIPRGVL LYGPPGTGKT LLARAVAGEA GVPFFTISGS DFVEMFVGVG
ASRVRDLFEQ AKQNSPSIIF VDEIDAVGRQ RGAGLGGGHD EREQTLNQLL VEMDGFGDRQ
GVILIAATNR PDILDPALLR PGRFDRQIPV TSPDLAGRRA VLKVHSQGKP MAGDADLDGL
AKRTVGMSGA DLANVINEAA LLTARENGTV ITGLALEEAV DRVVGGPRRK SRIISEHEKK
ITAYHEGGHT LAAWAMPDIE PIYKVTILAR GRTGGHAVAV PEDDKGLMTR SEMISRLVFA
MGGRAAEELV FREPTTGAVS DIQQATKIAR AMVTEYGMSS KLGAVRYGTE HGDPFLGRTM
GTSSDYSHEV AQIIDDEVRK LIEAAHTEAW EILTEYRDIL DTLAGELLEK ETLHRVELEA
IFGDVKKRPR LTMFDDFGGR VPSDKPPIKT PGELAIERGE EWPQPKPEPA FKAAIAAASK
AAEEAAARSN GANGSSGANG SPNGAPNGAT QPDYGAPAGW HAPGWPPQQQ PSGQQGGYWY
PPPSPNPGWG EPPRQQQPYP PYQQHYPQPG HGPQGANPPR DPQQERGRDG DRPGDRPNPP
AQH