Gene Mmcs_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3470 
Symbol 
ID4112302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3689389 
End bp3692892 
Gene Length3504 bp 
Protein Length1167 aa 
Translation table11 
GC content71% 
IMG OID638032605 
Productamino acid adenylation 
Protein accessionYP_640633 
Protein GI108800436 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3433] Aryl carrier domain 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGCCG TTGTGACCAG TTCGCAGACT GTACGGGCCG AGGTTGCCGA ACTTCTGGGA 
ATCGAGGAGT CCGCACTCGA TCCGGATGCC GACCTGATCG CCTCGGGCCT GGACTCCATC
CGCATGATGT CGCTGTCGGG ACGCTGGCGG AAACAGGGCA TCGACGTGCG GTTCGCGGCG
ATGGCGGCGA ACCCCACCGT GGCCGCCTGG ACCCGGCTCG TCGGTGAACG CACCGCGGAA
AGCCCCGGTG CGGCAACGCA ATCGGGTGAC ACTGCGGCGA GCGCCGGAGA TCCCGACGCG
CCGTTCCCGC TGGCGCCGAT CCAGCACGCG CTGTGGGTGG GCCGCAACGA GCTCACCGAA
CTCGGCGGCG TGGCCGCCCA CCTCTACGTC GAATTCGACG GTGCGGGGGT GGATCCCGAG
CGCCTCCGTA CCGCGGCCGC CGCGCTCGCC GCGCGCCACC CCATGCTGCG CGTCGACATC
CTCGGCGACG GCATGCAGCG CATCAGCGAC CGCGATCTGC CCGTCAAGGT GACCGACCTT
CGACACCTCG ACGTCGCCGA CGCCGAACAA CAGCTCGAGG TCATCCGCCA CGCCAAATCA
CACCAACTGC TCGAGGGCGA GGTGCTGGAG CTGGCACTGA CCCTGTTGCC CGACGGGCGC
ACCCGTCTGC ACGTCGACCT CGACATGCAG GCCGCCGACG CCGTGAGCTA CCGCAATTTC
ATGGCCGACC TCGCCGCGCT CTACCGCGGC GCGCAGCTGC CCGAGTTGCA GTACACCTAT
CGCCAATACC GCAGCGCGTT CACCGCCACG CCCGCGCCGA CCGTCGACGA GGACCGCCGG
TGGTGGACCG AGCGCATCCC GGATCTCCCC GAACCGCCCG CGCTGCCGCT GGTTCCGCGT
GCCGAACAGC GTGACCCGCG CCGCGGCACC CGACGCTGGC ACTTCCTCGA CACCGACATC
CGTGACCGGC TCTTCGCCGC CGCCCGCGCG CGCGGCATCA CACCGGCGAT GGCGTTCGCC
GCGAGCTACG CCGGCACGCT CGCCCGGTGG TCGACCAGCC GCCACTTCCT GCTCAACCTG
CCGATGTTCG GCCGCGAGCC GTTCCACCCG GACGTCGACA AACTCGTCGG CGACTTCACC
TCGTCACTGA TGCTCGACGT CGACTTCACC GAGGCGCACA CCCCGGCGCA GCGGGCGCGG
GTGATGCAGG AGGCGCTGCA CACCTCCGCG GAACACGCGA CCTACTCGGG TCTGTCGGTG
CTGCGCGATC TGAGCCGCCA TCACGGTTCG CCCTCGCTGG CGCCGTTCGT GTTCACCAGC
GCGCTCGGCC TGGGCGATCT GTTCGCCGGT GACGTCACCG ACCAGTTCGG CACCCCGGTC
TGGCACATCT CCCAGGGCCC GCAGGTGCTG CTCGACGCGC AGGTGACGCC GTTCGACGGG
GGACTGCTGG TCAACTGGGA CGTCCGCGAG GACGCGTTCC GGCCCGGCGT CATCGACGCG
ATGTTCGCCT ACCAACTCGC CGAACTCGAA CGGCTCGCCG CCGACGACGC GGCCTGGGAC
GCCGCCGATC CGCCCGCGGT GCCACCCGCG CAACGCGCGG TCCGCGACGC GGTGAACGAC
ACCGGCGCCC GGCGCAGCGA CGACGCGCTG CACGACGGGT TCTTCCGCAC CGCCGCACAC
ACACCCGACG CCACCGCGGT GATCGGCTCG ACCGGCACCC TCACCTATGC CGAACTGCGC
GAACGGGTGC TGGCGGTCAC CGGTGCGCTT CAGGTGGCGG GCATCAAGCC GGGGGACACC
GTCGCGGTGA TGGGCCCCAA GTGCGCCGAT CAGGTCACCG CGCTGCTGGC CATCCACGCC
GCAGGCGCGG TGTACGTACC GATCGGCGCC GATCAACCCG CCGACCGTGC GGACAGCATC
CTGCAGACCG CAGGCGTGCG GATGGCGCTG GCGTGCGGGG ACGAACCCCC GACCTTCCTG
CCCGCACTGA CCATCGCCGA GGCGGTCCGG GTCGGATCGC GGGTGCACGG TGTCACCCCC
GCCACGGTCG AGCCCGACCG GGTCGCCTAC GTGCTGTTCA CGTCCGGCTC CACCGGCGCC
CCCAAGGGCG TCGAGGTCAC CCACGCCGCG GCGATGAACA CCCTCGAATT CATCAACGAC
CACTTCGGGA TCGGGCCTTC CGACCGGAGC CTCGCGTTGT CCACGCTCGA AGGTGATCTG
TCCGTACTCG ACGTCTTCGG GATGCTGCGC GCCGGCGGGT CACTGGTCGT GGTGGACGAA
GCGCAGCGCC GCGACCCCGA CAGCTGGGCA CGGTTGATCG CCGAGCACTC GGTGACGGTG
TTGCACTGGA TGCCCGGCTG GCTTGAGATG CTGCTCGAGG TGGGCGGTGC GCTGCCGTCG
GTGCGGGTGG TGCCCACCGG CGGCGACTGG GTGCGCACCG AGATGGTTCG CGAACTGCGC
AGGGCCGCAC CGGGTGTGCG GTTCGCCGGC CTCGGCGGCG CGACGGAGAC CGCGATCCAC
AACACCATCT GCGAACCCGG TGAGCTGCCG CGGGAGTGGT CCGCCGTCCC GTTCGGCCGT
CCGCTGCCCA ACAACGCCTG CCGTGTCGTC GCCGCCGACG GTGCCGACTG CCCGGACTGG
GTGCCCGGAG AACTGTGGGT CGGCGGACGC GGCATCGCCC GCGGGTACCG GGGTAGGCCC
GACCTGACCG CCGAACGGTT CGTCGTCCAC GACGGCCGGA CCTGGTACCG CACCGGCGAT
CTCGTCCGTT ACCTGCCCGA CGGTCAGATC GACTTCGTCG GTCGCGCCGA CCACCGCGTC
AAGATCAGCG GATACCGCAT CGAACTCGGC GAGGTGGAGG CTGCACTGCG CCGCATCGCC
GGCGTCGAGG CCGCCGTGGC CGCGGTGCTG ACGGCCCCCG GTGACGGCCG CGGCGAGCAG
CTGGCCGCCA TCGTGCGGGC ATCGTCGCCC GCGGTGACGG TCGACGAGCT GACCCGCCGT
ATGGCCGAAC TCGTTCCGCC ACACATGGTT CCGAGCCACA TCGCGCTGGT CGAGGCGGTC
CCGTTCACGG TCGGCGGCAA GATCGACCGC AGGGCGGTCA CCGCGGAGCT GACCCGCAGC
ATGGCCGAAC GAGCGAACGC TCAGGCGCCG ACGTACCGGG TGCCGTCGAC GGCGCTCGAG
CGGGCGCTGG CCGACATCGT GTCCACCGTG CTGGACCGCG ACAGCGTCGG CGCCGACGAC
GACTTCTTCG AACTCGGCGG CGATTCGGTG CTGGCCACCC AGGCGGTGGC GCGGATCCGC
GAGTGGCTCG ACTCCCCGGG GGTGATGGTC ACCGACATCT TCGCCGCACG CAGGGTCGGT
GCGCTGGCCC GGCGGCTGGT CGACCACGAG TCCGGCAGCG ACCGCCTCGA AGGCGTCGCC
GAGCTCTACC TCGAAGTCGC GGACATGAAC TCCGCCGATG TGGCGTCGGC CCTGCACTCG
ACGTCCGCGC AGGCGTCGCG ATGA
 
Protein sequence
MEAVVTSSQT VRAEVAELLG IEESALDPDA DLIASGLDSI RMMSLSGRWR KQGIDVRFAA 
MAANPTVAAW TRLVGERTAE SPGAATQSGD TAASAGDPDA PFPLAPIQHA LWVGRNELTE
LGGVAAHLYV EFDGAGVDPE RLRTAAAALA ARHPMLRVDI LGDGMQRISD RDLPVKVTDL
RHLDVADAEQ QLEVIRHAKS HQLLEGEVLE LALTLLPDGR TRLHVDLDMQ AADAVSYRNF
MADLAALYRG AQLPELQYTY RQYRSAFTAT PAPTVDEDRR WWTERIPDLP EPPALPLVPR
AEQRDPRRGT RRWHFLDTDI RDRLFAAARA RGITPAMAFA ASYAGTLARW STSRHFLLNL
PMFGREPFHP DVDKLVGDFT SSLMLDVDFT EAHTPAQRAR VMQEALHTSA EHATYSGLSV
LRDLSRHHGS PSLAPFVFTS ALGLGDLFAG DVTDQFGTPV WHISQGPQVL LDAQVTPFDG
GLLVNWDVRE DAFRPGVIDA MFAYQLAELE RLAADDAAWD AADPPAVPPA QRAVRDAVND
TGARRSDDAL HDGFFRTAAH TPDATAVIGS TGTLTYAELR ERVLAVTGAL QVAGIKPGDT
VAVMGPKCAD QVTALLAIHA AGAVYVPIGA DQPADRADSI LQTAGVRMAL ACGDEPPTFL
PALTIAEAVR VGSRVHGVTP ATVEPDRVAY VLFTSGSTGA PKGVEVTHAA AMNTLEFIND
HFGIGPSDRS LALSTLEGDL SVLDVFGMLR AGGSLVVVDE AQRRDPDSWA RLIAEHSVTV
LHWMPGWLEM LLEVGGALPS VRVVPTGGDW VRTEMVRELR RAAPGVRFAG LGGATETAIH
NTICEPGELP REWSAVPFGR PLPNNACRVV AADGADCPDW VPGELWVGGR GIARGYRGRP
DLTAERFVVH DGRTWYRTGD LVRYLPDGQI DFVGRADHRV KISGYRIELG EVEAALRRIA
GVEAAVAAVL TAPGDGRGEQ LAAIVRASSP AVTVDELTRR MAELVPPHMV PSHIALVEAV
PFTVGGKIDR RAVTAELTRS MAERANAQAP TYRVPSTALE RALADIVSTV LDRDSVGADD
DFFELGGDSV LATQAVARIR EWLDSPGVMV TDIFAARRVG ALARRLVDHE SGSDRLEGVA
ELYLEVADMN SADVASALHS TSAQASR