Gene Mmcs_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4036 
Symbol 
ID4112866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4303958 
End bp4308013 
Gene Length4056 bp 
Protein Length1351 aa 
Translation table11 
GC content71% 
IMG OID638033179 
Producthypothetical protein 
Protein accessionYP_641197 
Protein GI108801000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTTG CTGTTCGGTC ATACTTGACC GCCGGTGTGG CTGTTGTAGG CGCTACATCG 
ATCGCGTTGG CTCCGGTCGA AGTGCTGCCG CCGGACTTCC AGATTCGTGG CGACCGGATG
GTCGCGGTGC TCGAAGATGT CTCGCTGTCC TCACTGGCTG ATCTGATCGC CGCCGCGCAG
GCTGCGTGGG CGCCGGTCGG CGGTGCGGTG GACAGCGCCT CTCAGGCCGC CCAGGCCGCC
ATCGTCGGGC TGGGTGGGGC CCTCGAAGCG GCCCTCGTGA GCGCCGTCGA AGAAGGCAAC
AACGCCTTCC AGGCGGTCGT CGAAGGGCTG CTCGACGGCG GCAGTGTGCT CGCGACCGCA
ATCGACAACG CGATCGCCAA CTTCCCGTCG CCGCAGGCGT TCGTCGATGC CGTGGTGTCG
GCCTTCGCCT CCGTCAACCT CGATCTCGCG GGCGTCATCG AGGTCGCGCT CGAGGCCGCC
CTCGAACTGG GCATCGAGAT CGACGCCGGC GCCATCGGCG AGGCGTTCCT CGAGGCCGGG
GCCGCGCTGG AGGCCGCGTT CAACGCCGCC CTGGCCGCCT TCCCCACGCC CGAGGAGTTC
GTCGCCGCAT TCGAGGCCGC GCTCGGTGAT GTGATCCCGG GCTTCAACGC CGTCGCCACG
GCCGCGGTCG AGGCGTTCAC CGAACTCTCG GCCGAACTCG CCGCCGCGAT CGATGCGGGT
ATCGCCGCGG GCGTCACCGC ATTCGACGCG TTCGTCGCAG GCCTGCTCGA CGCCGGTGGT
GAACTGGCCG CGGCGTTCAA CGCCGCACTG GAAGCCTTCC CGACCCCCGA GGCGTTCATC
GACGCCCTGG TGTCGGCCTT CGGCGCCATC GACGTCGACC TCGGGGCTGC CCTCGACGTC
GCACTCGAGG CGCTCGTCAA CCTGGGCATC GAGATCGACG CCGGCGCCAT TGCCGACGCG
TTCATCGAGG CCGGCGCCGA TCTGGCGGCC GCCTTCGAGG GAGTCTTCGA GGGCTTCCCG
ACCCCGGCCG AGTTCTTCGC CGCCTTCAAC GCCGCGATCG CGGGCGTCCT GCCCGGGCTC
AACGCGTTCG CCGAAGCCGC CGCCGCATCG CTCACCGAGC TGTCGGGTGC GCTGGCCGCG
GCCATCGACG CGGGCGCGGC TGCCGGCATC GAGGCGTTCG ACGCGATCGT CACGGGCCTG
CTGGACGCGG GTAGCGATCT GTCCGCCGCG TTCGCCGCGG CGATCGAGGC CTTCCCGACA
CCCGAGGCAT TCCTCGACGC GCTGGTGAAC GCGTTCGCCG CCATCGACGT CGACCTGGGT
GCCGCGCTCG ACGTCGCACT CGAGGCGCTC GTCGACATCG GCCTCGCACT CGACACCACG
GCCATCGCCC AGGCTCTCAT CGACGCCGGC GCCGACCTGG CCGCCGTGTT CGAGGGTGCG
ATCGAGGGGT TCCCGACCCC GGCCGACTTC TTCGCCGCCT TCAACGCCGC CATCGCCGGA
CTCGTCCCGG GCTTCAACGC ATTCGCCGAA GCCGCCGCCG AGGCGTTCAC CGAACTGTCG
GGTGCGCTGT CGGGCGCCCT GGAGGCCGGT TTCGCCGCGG GTCTGGACGC CTTCGAATCC
GTCATCGCCG CGCTGCAGGA CGCCGGGGGT GAGCTCGCCG CCGCGTTCGC CGCAGCGATC
GAGGCCTTCC CGACGCCCGA GGCGTTCGTC GACGCCCTGG TGTCGGCCTT CGGCGCCATC
GACGTCGACC TCGGCGCCGC GCTCGACGTC GCGCTCGAGG CGCTCGTCGA GCTCGGCATC
GACCTGGACG CCGGTGCGAT CGCCGACGCG CTCCTCGAGG CCGGGGCCGA CTTCGCCGCA
CTGTTCAACG GTGTGCTGGA AGGCTTCCCG ACTCCGGAAG AGTTCATCGC CGCCTTCAAC
GCCGCTCTCG AGGGCGTCGT CCCGGGCTTC GATGCGCTCG CTGCGGCCAC CGCCGAGGTG
ATCGCCGATC TCTCGGGCGC CCTGACGACG GCGCTGGAGA CCGGTTTCGC CGCCGGAGTG
GAGGCGTTCG ACGCCATCGT CGCGGGTCTG CTCGACGCGG GCAGTGAGCT CGCGGCGGCG
TTCAACGCGG CCATCGAGGC CTTCCCGACG CCCGAGGCGT TCATCGACGC CCTCGTCACG
GCCTTCGGGG CCATCGACGT CGACCTGGCC GGCGCCATCG ACGCCGCGCT CGAGGTCTTC
GTGGAGCTGG GGCTCAACCT GGACGCCACC GCGCTCGGCG AGGCCCTCGC CGAAGCGGGT
GCGGAGCTGG CGGTGATCTT CGATGCCGCG CTCGAAGGCT TCCCGTCACC CGCGGAGTTC
GTCGCCGCCT TCGAGGCGGC CCTGGAAGGT GTGATCCCGG GGCTGACCGC GCTGGCCGAT
GCCACCGGCA CGGCGATCGC GAACTTCTCG AGCGCGCTGA CGGCCGCGAT CGAAGCCGGC
ATCGAGGCCG GCGAGACGGC CTTCGAGGCG GTCATTGCCG GACTGCAGGA CGCGGGCGGC
CAGCTGGCCG CGGCGTTCGA GGCCGCGCTG AGCGCGTTCC CGACGCCGGA GGCGTTCGTC
GAGGCCCTCG TCAGCGCACT GGGTGCGATC GACGTCGACC TCGGCGCCTC CCTCGAGGCG
GCGCTGGAGC TCATCGTCGA CCTCGGCATC GACCTGGACG CGGGGCTCAT CGCCGACGTG
CTCCTCGATG CCGGAGCAGA ACTGGCAGCT GCCTTCAACG CGGCGTTCGA GGGCTTCCCC
ACCCCGGCGG AGTTCGTCGC CGCGATCGAG GGCGCCATCG AGGCTTTCGC ACCCGGCTTC
AACGCGATCG CCGATGCGAC CGCCGAGGCC ATCGGCAACC TGTCGGCCGC GCTGACCGCC
GCCATCCAGT CGGGCATCGA CATCGGCGGC GCCGCGTTCA GTGGCTTCCT CGAGGCCCTG
GTCGAGGGCG GTGGCACCCT GGCCGCCGCG TTCGAGGCCG CCCTGGAGAA CCTGCCCAGC
CCGGAGGTCT TCATCAGCGC CCTGATCGAG GCGTCCGCCT CGCTCGGACC GGTGCTGGCC
GACCTCGGCG CCTCACTGCA GGCCGGCCTG GAAGCGGGCC TCGACCTGGG TGCCGCAGCG
CTGGAGGGCT TCGCCGAGAT CGGCGGAGAG ATCTCCGCCG CGATCGGCGC GAGCCTGGAA
GGCCTGCCGA CGCCCGCCGA GTTCGTCGGC TCCTTCGCCG AAGCCGGTGC GCAGTTCGGT
GCGGCGGCGG ATGCCATCAC CGAGATCGGT GTCGACGCGG TCAACAACTT CGCCGCGGCG
GCCGACGCCG TCGTCGAAGC GTTCCGCGCC ACGGCGGTGT CGGCCCTGGA GAACGGCAAC
GATGCGCTCC AGGCCATCGG CGCCGGTATC TACGCCGCGG TGAACGCGTT CGCCGAGGCC
GCGAGTGACG TCAACATCGG CGGTGCCATC AGCGCCGCGA TCAGTGGCGC CATCCGCGGG
GCGCTGGGTG GTTTCGGTGG CAGCATCGGC GGTGAGCTCG GCGGCGGGAT CAACACGCTG
AGCGCCGGTA CGGGTGAGGT CGCGTCGCCG GAGGTCACTT CGACCCGGAT CGCGCCGGAG
TCCCGGACGT TGTCGGTGGC CACGGCCCCC GTCGAAGAGA AGGCACCGGC CGCCGAGGCG
AAGCTGCCTG AGGTGAAGGT CAACCCGGTG TTGCCGGCTC CTCCCGCGCC GGTGGCCGAC
CTGCAGAAGC AGATCCAGAA GGGTCAGGTG CAGCTGCGCG AGGCCCTCGA CACCGCGGGC
AAGCAGGTCA ACGACGGTCT CAACCAGACC CGCAAGAACC TCGAGGGGGC CGCGGAGCAG
ACCCGCAAGA ACCTCGAGGG GGCCGCGAAC CAGACCCGCA AGAACCTCGA GGGAGCGGCG
AACCAGACCC GCAAGAACCT CGACGGGGTC CGGAAGAACA TCGAGAACGC TGTCGGCGGT
AGCAAGAAGC CGGCCGGCGA GTCGACGAAG AAGGAATCGG CTGACACCTC CTCGAAGAAG
AAGGAGTCCG CCAGCTCCAG CTCCGCCTCG GAGTAA
 
Protein sequence
MEVAVRSYLT AGVAVVGATS IALAPVEVLP PDFQIRGDRM VAVLEDVSLS SLADLIAAAQ 
AAWAPVGGAV DSASQAAQAA IVGLGGALEA ALVSAVEEGN NAFQAVVEGL LDGGSVLATA
IDNAIANFPS PQAFVDAVVS AFASVNLDLA GVIEVALEAA LELGIEIDAG AIGEAFLEAG
AALEAAFNAA LAAFPTPEEF VAAFEAALGD VIPGFNAVAT AAVEAFTELS AELAAAIDAG
IAAGVTAFDA FVAGLLDAGG ELAAAFNAAL EAFPTPEAFI DALVSAFGAI DVDLGAALDV
ALEALVNLGI EIDAGAIADA FIEAGADLAA AFEGVFEGFP TPAEFFAAFN AAIAGVLPGL
NAFAEAAAAS LTELSGALAA AIDAGAAAGI EAFDAIVTGL LDAGSDLSAA FAAAIEAFPT
PEAFLDALVN AFAAIDVDLG AALDVALEAL VDIGLALDTT AIAQALIDAG ADLAAVFEGA
IEGFPTPADF FAAFNAAIAG LVPGFNAFAE AAAEAFTELS GALSGALEAG FAAGLDAFES
VIAALQDAGG ELAAAFAAAI EAFPTPEAFV DALVSAFGAI DVDLGAALDV ALEALVELGI
DLDAGAIADA LLEAGADFAA LFNGVLEGFP TPEEFIAAFN AALEGVVPGF DALAAATAEV
IADLSGALTT ALETGFAAGV EAFDAIVAGL LDAGSELAAA FNAAIEAFPT PEAFIDALVT
AFGAIDVDLA GAIDAALEVF VELGLNLDAT ALGEALAEAG AELAVIFDAA LEGFPSPAEF
VAAFEAALEG VIPGLTALAD ATGTAIANFS SALTAAIEAG IEAGETAFEA VIAGLQDAGG
QLAAAFEAAL SAFPTPEAFV EALVSALGAI DVDLGASLEA ALELIVDLGI DLDAGLIADV
LLDAGAELAA AFNAAFEGFP TPAEFVAAIE GAIEAFAPGF NAIADATAEA IGNLSAALTA
AIQSGIDIGG AAFSGFLEAL VEGGGTLAAA FEAALENLPS PEVFISALIE ASASLGPVLA
DLGASLQAGL EAGLDLGAAA LEGFAEIGGE ISAAIGASLE GLPTPAEFVG SFAEAGAQFG
AAADAITEIG VDAVNNFAAA ADAVVEAFRA TAVSALENGN DALQAIGAGI YAAVNAFAEA
ASDVNIGGAI SAAISGAIRG ALGGFGGSIG GELGGGINTL SAGTGEVASP EVTSTRIAPE
SRTLSVATAP VEEKAPAAEA KLPEVKVNPV LPAPPAPVAD LQKQIQKGQV QLREALDTAG
KQVNDGLNQT RKNLEGAAEQ TRKNLEGAAN QTRKNLEGAA NQTRKNLDGV RKNIENAVGG
SKKPAGESTK KESADTSSKK KESASSSSAS E