Gene Mmcs_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5100 
Symbol 
ID4113929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5394691 
End bp5395761 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID638034258 
Producthypothetical protein 
Protein accessionYP_642260 
Protein GI108802063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCCG CTGACCTGCC CGGGATACCC GGCGTCTTCA CGCCGGACCA GTGCCGGCAG 
ACGGCTGAAT CGATTGCCGC CGAACAGGAG TCCTCGGGCG CGATTCCGTG GTTCACCGGT
GGGCACACCG ACCCGTGGGA CCACGTCGAG TCGGCGATGG CGCTGACCGT CGCCGGTCTG
CTCGAACCGG CGCGGGCGGC CTACGAGTGG TGCCGGATCA CCCAGCGTCC CGACGGGTCC
TGGCCGATCC AGTTCCGCAA CGGCGTGATC GAGGACGCCA ACAGCGACAG CAACTTCTGC
GCCTACATCG CCGCCGGTGT CTGGCACCAC CTCCTGATCA CCCGGGACCG CTCCTTCGCG
GAGACGATGT GGCCGGTGGT CGCCAAGGCG ATCGACTTCG TGCTGGGCCT GCAACGGCCC
AACGGCGAGA TCTCCTGGGC GCAAAGCGAA GACGGTCCGA TTCCGGAGGC GCTGCTGACC
GGGTGCGCCA GCATCCACCA CAGCATCCGG TGCGCGCTCG CGCTCGCGGA CTACATCGGT
GAGCCGCAGC CCGAGTGGGA GGTGGCCGTC GGGCGGCTGG GGCACGCGAT CATCGCGCAT
CCCGGATCGT TCGTGCCGAA GGACCGGCAC GCGATGGAGT GGTACTACCC GGTGCTGTGC
GGTGCGCTGC GGGGTGCGGC CGCCCACGAC CGCATCCACG AACGCTGGGA CGATTTCGTA
GTCCCGGGTC TGGGCATCCG TTGCGTGGAC GACCGCCCCT GGGTCACCGG CGCGGAGACC
TGCGAGTTGG TGATGGCGCT CGACGCGATC GGCGATTCGG CGCGTGCGCA CCAGCAGTTC
GCCGCCATGC ACCATCTGCG TGAGGGTGAC GGATCCTATT GGACCGGACT GGTTTTCGCC
GATGGGAAGC GTTGGCCCGA GGAGCGCACC ACGTGGACCG GTGCCGCGGT CATCCTGGCC
GCGGACGCCC TGACGGGCAC CACGCCGGGC AGCGGGATCT TCCGCGGCGC GGATCTGCCG
CGCGGCCTCG AGGACGAATT CGACTGCGCC TGCGCAGTCA GCGACCGCTA G
 
Protein sequence
MPAADLPGIP GVFTPDQCRQ TAESIAAEQE SSGAIPWFTG GHTDPWDHVE SAMALTVAGL 
LEPARAAYEW CRITQRPDGS WPIQFRNGVI EDANSDSNFC AYIAAGVWHH LLITRDRSFA
ETMWPVVAKA IDFVLGLQRP NGEISWAQSE DGPIPEALLT GCASIHHSIR CALALADYIG
EPQPEWEVAV GRLGHAIIAH PGSFVPKDRH AMEWYYPVLC GALRGAAAHD RIHERWDDFV
VPGLGIRCVD DRPWVTGAET CELVMALDAI GDSARAHQQF AAMHHLREGD GSYWTGLVFA
DGKRWPEERT TWTGAAVILA ADALTGTTPG SGIFRGADLP RGLEDEFDCA CAVSDR