Gene Mmcs_5609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5609 
Symbol 
ID4114477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp183697 
End bp186891 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content68% 
IMG OID638034764 
Producthypothetical protein 
Protein accessionYP_642765 
Protein GI108802569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0379241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.525859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGC CGGCGCCGAG CATTGATCGC CATGTCGTCA TCATCGCCGC GGAAGGTTCG 
TTCACTGCGG CGGGAGACCA GCTCACCGGC GCGGTCGATA CCGTCGACAA ACTCGAGAAG
CTCATCGAGT GGGCGCATCA GCGAGGGGGG CTCCAGCCCG TACCGGCTGG TGAGGAAGAG
CATGAGCCGG CCCGCGTATG GGTGGTCGGG GCCGCATGCG ACCTACTGGC GGGAACGCCC
CTCGCGCAGG CTGCCGACGA CAACGAAGCC GAGCGCATTG GGCAAGTCCT CGCCCCGCTG
GTCACACGCG GGTGGGAACT GCGCGGCAGA CCGACCTCAG CGCTCCTGCT CACCCGCGGC
CAGGGTGCGC AGCGGATCTA CGTTGAGATC CTCGCCGAGC GCCAACCCTG GCTCGCTGCC
GGGCATGCGG CGGTTGTCGG GAATCCGGCA GAACTCGGCC GCCGGTTACG CGCGTGGTAC
GCCGCAGTGG GAACGCTGCC GGCCCGCAGC GGCGCAGCCT CGGCAGCCGT CCTCCACGAC
CACATCATGC GCGCCCGGAC AGGCCGTCGC GGCGCTGTCG TGTCGACCCC TGGCGTCCTG
CCAGCCTGGG TGCAGCCCGA TATGCGGATC CAACCAGCCT GGTGTGTCTC CGTGGACGAG
GCGGAGCGTG AGTTCGAGCG CTCGGACGAA CTGGTGTGTT TGACACAGCT GTGCCCGCAA
CTGGCCTCCG CTGGAATGCT GACCCTCGGC TACGGCCAGC CACAGGTGCT CGACGGCCAG
GCCGCGGCCG CCGCCGCGGC CGAATCCAAG CGGCCATTCG GCCTGTGGCG CGCGACTCTG
CCCGCCGCCG AGAAGTCCAA CCTTCCCGCG ATGCTGCCGC TGCCGCATCC CCAGATGCGG
CCCGACCAGC CCACCCAGGC GTGGCTGACC ACCGAGGACC TCGACGGTCT GGCTAAAGAC
ACCCGTGACG GCGGCGCCGG CCTGAGCGCC GAGCAGCTCG CCATCGACGA GGCGATCGTC
TGGCCGCAGC AGGGCCGGGT CCTAGAAGTG TGGGCGACGC GGCTGAGGGA GGCCCGAGAA
ACGTTCCGTG ACGATTCGGT ACTGCAGTCG CTGGTCGACG TCGCCGCGGC CGACTACCTC
ACCGCGCTGG CCGCCCCGGA CACCTGGCGC GAGGACGCCT GGCGCCACCA CTTCCAGCCC
GCGTGGGCCG CGGCGATCGC CGCCCACATC CGGTTCCGCG GCCGGCGAGC AGCGATGCGG
CTGTCCCGTG AATACCGCAG CTGGCCGGTC TGGGCGCACG ACGCCGCCAT GATCTACACC
CCGGGCCGAG ACGACACCAC CGGCGAGCCA ATCGACCTGT CGGACACCCA CACTCGACTC
GGGCGGCTCG TGGTCTCGCA TCGCTGCGCA CTCACCGACC AAACCGTGCT CGCCGTCCTA
CTGGCCGAGT CCACCCTTGA GGTGGCCGAC GCATTCATCA CAGCGCTTGG CATCACCGCC
CACCAAGGCA GCGAAGCACG GCCGACTCGC CACAGCCTCG ACGTCGCGGA CGAAGGCGGC
GCCGCGGTCA CCGGCGAACC GACATCAATG CCGCAGCCCA CACCAACCGG CGGCGACGAC
GAGCAGCCAG CGCAGCCGAC CGGCGGCGAC AAACCCACCG GCCCGGCATC GCAACCACGG
GTCTCGGCAA CCCGCACACA CGCCGCCAGC GGCGGCGCAC CGGCCGCTGT CCTCCATACC
GACGGTCTCT GGCTTCCCGA CGGCACGCAC ATCGAACTCG ACGAGCCGAT CCTGCACGTC
GGACAAGTCG CCGAACTGGC CTACATCCAC CGCATCGGAT ACCAGCTCAC CCCGAAATAC
ACAGAAGCCG CCCAAATTTG GGTCACCGCC GACGCCTGCG CAGCCTTCGG CATCGACGTA
GAAGCCATCA GCCGGCGCGA CCGGGCCAAG TCGCTGCGCC AGCTCACCGA GGGCATCGAC
TTCGTGGTCC TAGCGGTCAA CGACGGCTGG AGCTTGGGCG GGGCAGCCGA AGATCCGACC
ACCCAACGCC TCGGCACATG GACACGGGTG TACCGCGACG ACAAACGCGG CGTCATGGTC
GCCCTGATCC CCGGAATGGG CGCCGGACAC GAAGAGATGC CCATCTTGGC CGACGACCCC
ACCCCAGCGC AGATCGCGCG GCGCCTGCAG CTCCTCGCAG ACGCACTGCA CTTCCCCTGG
AAAATCAACG CCGGCGTGAC CGCCGTGGAC TTGATGCTGC AGACCCGCAC CAAAAAGTGG
TCGCCCCAGG AGTGGAAAGA AGTCGTGTTC GCGCCCTCGA CGACCAGCCC ACCATTTGGC
ATCGGCGACG TCGAATCCGA CTTCGACTGG TCGCGACCGC CGACTGCCGA AGAAAGCCAG
CGTCGCTATC TGCACGCCTA CGACCGCGGC GGGTCCTATG TCGCAGGCAT CGCCGGTCTC
GAACTGCCCA TCGGCGATCC AGTCCACTAT CCCGAAGGCA CGCAGTTCGA CGCCAAGACA
CCCGGCTACT GGCTAGCTGA AATCCCTGAG GCCTCCGACT GGCGCATGCC ATATGTGCTC
AATCCCAGAG GAATTCAATT CACCGAACCC AAATGGGTCA CGACACCGAC CCTGGAACGT
GCCTTCGCGC TCGGTTACAA CCCAGCGATC CTCGAAGCGT GGACCTGGCC GCAACACGGC
CGCGTTCTGC TCGGATGGTA CGAGCGATTC CGCGACGCAA GTGGTGCCCT CGATACCGAC
GATCTCGACG CTCAGGCGGC ACGCAACCAG GCCAAGATCA TTCGCACCCA CGGCATCGGC
ATCATCGGCT CCGACGAACA CCTCAAGGGC AAGACCGGGT ACAGCCCCGA GCGGCGACTG
CATGTGCTGG CCAAAGCCAA GGCCAACATC GTCTACCGGC TACAGCAGAT CGGCGAGCGC
ACCGATCAAT GGCCGGTGGC CGTGGCCACC GACACCGTGC TGTACGCCTC TGACGACCCA
GACCCCGTGA CGGCATGGCC CGGCGGACCT GACTCATTCG GCCGCGGCTT CGGCCAGTAC
AAGCCCGAAG GATCGGCACT ACTCGCCGAC CATCTCGACT TCCTCAACGG ACGTGACTAT
CGCGGCAAGC GAGAGCTGAC GCCGGTTGGG CAGTGGCGAC GCCAGGTGCT CGACAAAGAC
GACAGGAGCC ACTGA
 
Protein sequence
MSGPAPSIDR HVVIIAAEGS FTAAGDQLTG AVDTVDKLEK LIEWAHQRGG LQPVPAGEEE 
HEPARVWVVG AACDLLAGTP LAQAADDNEA ERIGQVLAPL VTRGWELRGR PTSALLLTRG
QGAQRIYVEI LAERQPWLAA GHAAVVGNPA ELGRRLRAWY AAVGTLPARS GAASAAVLHD
HIMRARTGRR GAVVSTPGVL PAWVQPDMRI QPAWCVSVDE AEREFERSDE LVCLTQLCPQ
LASAGMLTLG YGQPQVLDGQ AAAAAAAESK RPFGLWRATL PAAEKSNLPA MLPLPHPQMR
PDQPTQAWLT TEDLDGLAKD TRDGGAGLSA EQLAIDEAIV WPQQGRVLEV WATRLREARE
TFRDDSVLQS LVDVAAADYL TALAAPDTWR EDAWRHHFQP AWAAAIAAHI RFRGRRAAMR
LSREYRSWPV WAHDAAMIYT PGRDDTTGEP IDLSDTHTRL GRLVVSHRCA LTDQTVLAVL
LAESTLEVAD AFITALGITA HQGSEARPTR HSLDVADEGG AAVTGEPTSM PQPTPTGGDD
EQPAQPTGGD KPTGPASQPR VSATRTHAAS GGAPAAVLHT DGLWLPDGTH IELDEPILHV
GQVAELAYIH RIGYQLTPKY TEAAQIWVTA DACAAFGIDV EAISRRDRAK SLRQLTEGID
FVVLAVNDGW SLGGAAEDPT TQRLGTWTRV YRDDKRGVMV ALIPGMGAGH EEMPILADDP
TPAQIARRLQ LLADALHFPW KINAGVTAVD LMLQTRTKKW SPQEWKEVVF APSTTSPPFG
IGDVESDFDW SRPPTAEESQ RRYLHAYDRG GSYVAGIAGL ELPIGDPVHY PEGTQFDAKT
PGYWLAEIPE ASDWRMPYVL NPRGIQFTEP KWVTTPTLER AFALGYNPAI LEAWTWPQHG
RVLLGWYERF RDASGALDTD DLDAQAARNQ AKIIRTHGIG IIGSDEHLKG KTGYSPERRL
HVLAKAKANI VYRLQQIGER TDQWPVAVAT DTVLYASDDP DPVTAWPGGP DSFGRGFGQY
KPEGSALLAD HLDFLNGRDY RGKRELTPVG QWRRQVLDKD DRSH