Gene Mmcs_1415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1415 
Symbol 
ID4110252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1532772 
End bp1535783 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content67% 
IMG OID638030536 
Producthypothetical protein 
Protein accessionYP_638583 
Protein GI108798386 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTATGC GGCCCCCCGC GAGGATGCCG AAGCTGACGC GACGAAGCCG GGTGCTGATC 
GGTGTCGCCC TCGCCGCGGT GGTGCTGCTG TTGATCGGGC CACGGTTCAT CGACACCTAT
GTGAACTGGC TGTGGTTCGG TGAACTCGGC TACCGGTCGG TGTTCACCAC GGTGCTGTTG
ACCCGCGTCG TCGTGTTCCT GGTCGTCTCG CTGCTGATCG GCGCGATCGT CTTCGCCGGC
CTTGCGTTGG CCTACCGGAC GCGGCCGGTG TTCGTCCCGA CCGCCGGCCC CAACGATCCG
ATCGCCCGCT ACCGCACGAC GGTGATGGCG CGGCTGCGCC TGTTCGGCTT CGGCGTCCCG
GCGTTCATCG GCATCCTGTC CGGCATCGTC GCGCAGAGCT ACTGGGTGCG CATCCAGCTG
TTCCTGCACG GTGGTGAATT CGGGGTCACC GACCCGCAAT TCGGCCTCGA CCTCGGGTTC
TACGCCTTCG ACCTGCCGTT CTACCGGCTG GTGCTGAGCT ATCTGTTCGT CGCGACGTTC
CTCGCCTTCA TCGCGAACCT GTTGGGCCAC TACCTGTTCG GCGGGATCCG GCTGACCGGG
CGCAACGGGG CGCTGACCCG CTCGGCGCGG ATCCAGCTCG TCACCCTGGT CGGCATCCTG
ATCCTGCTCA AGGCGTTCGC CTACTGGCTC GACCGCTACG AGCTGCTCAG CCACACCCGC
GGCGGCAAGC CGTTCACCGG CGCCGGCTAC ACCGACATCA ACGCCGTGCT GCCCGCCAAG
CTGATCCTGC TGGCGATCGC GGTGATCTGC GCGGTCGCGG TGTTCTCCGC GATCGTGCTG
CGCGACCTGC GCATCCCCGC CATCGGTGTG GTGCTGCTGC TGCTCTCGTC GCTGGTCGTC
GGCGCCGGCT GGCCGCTGGT GGTCGAGCAG TTCAGCGTCA AGCCCAACGC CGCGCAGAAG
GAAAGCGAAT ACATTTCGCG CAGCATCGCC GCGACCAGGC AGGCCTACGG TCTGACCGAC
GAGGTGGTCA CCTACCGCGA CTACCCGGGC AACGCGCCGG CCACCGCCCA GCAGGTGGCC
GCCGACCGGT CCACCACGTC GAACATCCGC GTGCTCGACC CGAACATCGT CAGCCCGGCG
TTCACCCAGT TCCAGCAGGG CAAGAACTTC TACTTCTTCC CCGAACAGCT GGCGATGGAC
CGCTACCGCG ACGCCGACGG CAACCTGCGC GACTACGTGG TCGCCGCCCG CGAGCTCAAC
CCGGACCGGT TGATCGACAA CCAGCGCGAC TGGATCAACC GACACACCGT CTACACCCAC
GGCAACGGGT TCATCGCCTC ACCGGCGAAC ACGGTCCGGG GGATCGCCAA CGACCCCAAC
CAGAACGGCG GCTACCCGGA GTTCCTCGCG AGCGTGGTGG GCGCCAACGG TGCCGTCGTC
TCGCCCGGGC CGGCGCCGCT GGACCAGCCG CGCATCTACT TCGGCCCGGT CATCGCCAAC
ACCGCGTCGG ATTACGCGAT CGTCGGGGAG AACGGCACCC CGCGCGAATA CGACTACGAG
AACAACGTCG AGACCCGCAA CTACACCTAC ACCGGCTCCG GCGGTGTGCC GATCGGCAAC
TGGCTGACGC GAAGCCTGTT CGCGGCGAAG TTCGCCGAGC GCAACTTCCT GTTCTCCAAC
GTCATCGGTG AGAACAGCAA GATCCTGTTC AACCGTGACC CCGCCGACCG GGTCGAGGCC
GTCGCGCCGT GGCTGACCAC CGACACCACG GTGTACCCGG CGATCGTCAA CAAGAAGATC
GTGTGGATCG TCGACGGCTA CACGACGCTC GACAACTACC CCTATTCGGA GTTGACGTCG
CTGTCGTCGG CGACCGCGGA CTCCAACGAG GTGGCCGTCA ACCGGTTGGC GCTCAACAAG
CAGGTGTCCT ACATCCGCAA CTCGGTGAAG GCCACCGTCG ACGCCTACGA CGGCACGGTG
ACGCTGTACG CCCAGGACGA GACCGACCCG GTGCTGCAGG CGTGGATGAA GGTGTTCCCC
GACACCATCA AGCCCAAGAG CGAGATCAGC CCCGAGTTGC AGCAGCACCT GCGCTATCCC
GAGGACCTGT TCAAGGTGCA GCGCGCGCTG CTGGCCAAGT ATCACGTCGA CGACCCGGTG
ACGTTCTTCT CGACGTCGGA CTTCTGGGAC GTCCCGCTGG ACCCGAACCC CACGGCCAGC
AGCTTCCAGC CGCCGTACTA CATCGTGGCC AAGGACCTCG CGGAGAACAA CAACTCGGCG
GCGTTCCAGC TGACCAGTGC GATGAACCGG TTCCGTCGTG ACTTCCTCGC CGCGTACATG
AGCGCGAGTT CGGATCCGGA GACCTACGGA AAGATCACGG TGCTGACCAT CCCGGGTCAG
GTCAACGGTC CCAAGCTGGC GTTCAACGCG ATCAGCACCG ACACCGCGGT CAGCCAGGAT
CTCGGCGTCA TCGGCCGCGA CAATCAGAAC CGGATCCGGT GGGGCAACCT GTTGACCCTG
CCGGTGGGCC CGGGCGGTCT GCTGTACGTG GCGCCGGTGT ACGCCTCACC GGGTACCAGC
GATGCGGCGT CGACCTACCC GCGCCTGATC CGTGTGGCGA TGTTCTACAA CGATCAGGTC
GGATACGGCC CGACGGTGCG CGACGCGCTG ACCGACCTGT TCGGTGCGGG TGCCGACGCC
ACCGCCACCG GACCCGCGCC GGCCAACCTG CCCGACGGTC AGCCGGCGGC CCAGCCGCCG
AACGGTCAGC AGCCCGCGGC ACAGACCCCC GGCAACCAGG CGGGTCGGGC GCCGACGCCC
CCTCCGGCCG CCATCCCGTC GGGGCCGTCG GGTCCCCAGC AGTTATCCGA GGCGAAAGCC
GCGGCGCTGC AGGAGGTCCA GGAGGCGATG AGCGGTCTGC AGGACGCGCA GCGCAGCGGC
AACTTCGCCG AATACGGTGA GGCGCTGCAG CGTCTGGACG ACGCGATGAA CAGGTACTCC
GAGGCGCGCT GA
 
Protein sequence
MGMRPPARMP KLTRRSRVLI GVALAAVVLL LIGPRFIDTY VNWLWFGELG YRSVFTTVLL 
TRVVVFLVVS LLIGAIVFAG LALAYRTRPV FVPTAGPNDP IARYRTTVMA RLRLFGFGVP
AFIGILSGIV AQSYWVRIQL FLHGGEFGVT DPQFGLDLGF YAFDLPFYRL VLSYLFVATF
LAFIANLLGH YLFGGIRLTG RNGALTRSAR IQLVTLVGIL ILLKAFAYWL DRYELLSHTR
GGKPFTGAGY TDINAVLPAK LILLAIAVIC AVAVFSAIVL RDLRIPAIGV VLLLLSSLVV
GAGWPLVVEQ FSVKPNAAQK ESEYISRSIA ATRQAYGLTD EVVTYRDYPG NAPATAQQVA
ADRSTTSNIR VLDPNIVSPA FTQFQQGKNF YFFPEQLAMD RYRDADGNLR DYVVAARELN
PDRLIDNQRD WINRHTVYTH GNGFIASPAN TVRGIANDPN QNGGYPEFLA SVVGANGAVV
SPGPAPLDQP RIYFGPVIAN TASDYAIVGE NGTPREYDYE NNVETRNYTY TGSGGVPIGN
WLTRSLFAAK FAERNFLFSN VIGENSKILF NRDPADRVEA VAPWLTTDTT VYPAIVNKKI
VWIVDGYTTL DNYPYSELTS LSSATADSNE VAVNRLALNK QVSYIRNSVK ATVDAYDGTV
TLYAQDETDP VLQAWMKVFP DTIKPKSEIS PELQQHLRYP EDLFKVQRAL LAKYHVDDPV
TFFSTSDFWD VPLDPNPTAS SFQPPYYIVA KDLAENNNSA AFQLTSAMNR FRRDFLAAYM
SASSDPETYG KITVLTIPGQ VNGPKLAFNA ISTDTAVSQD LGVIGRDNQN RIRWGNLLTL
PVGPGGLLYV APVYASPGTS DAASTYPRLI RVAMFYNDQV GYGPTVRDAL TDLFGAGADA
TATGPAPANL PDGQPAAQPP NGQQPAAQTP GNQAGRAPTP PPAAIPSGPS GPQQLSEAKA
AALQEVQEAM SGLQDAQRSG NFAEYGEALQ RLDDAMNRYS EAR