Gene Mmcs_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2420 
Symbol 
ID4111253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2570262 
End bp2571593 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content69% 
IMG OID638031545 
Productextracellular solute-binding protein 
Protein accessionYP_639584 
Protein GI108799387 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.783182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCC CGAAGCTCGG CCGGCGCCCA GCGCGGCGGA AAGCCACGCG CTGGATGCCG 
GCACTGGCCA TGACCGCAGG ACTCGCTTTG GCCGGCTGCG CCGGATCCGG TGGCTCCGGC
GACGAGCAGA GCTCGTCGGG GTTGGGGGAC ATCCCGACCG ACACCAACGC AACCGTGCGG
GTGCTCATGG AGAACGTGCC CGACACCGAC ATCGTGCAGG GCATGGTCGG ACAGTTCAAC
GAGAAGTACC CCGACATCAA GGTCCAGATC GAGACCATGA CGTTCGACCA GATGCGTGAC
CGCCTGGTGT CGTCGTTCCA GTCGGCCGAA CCCGCCTACG ACCTGATCGT CGTCGACAAC
CCGTGGATGG ACGATTTCGC CGCCGCGGGC TTCCTCGAGC CGCTGAACGA CCGGATCTCC
TCGACCCCCG ACTACCAGCC GGACGACTTC TTCCCGTCGC TGACCGACAT CACCGACGTC
GACGGCACGA CCTACGGGGT GCCGTTCTAC AACTACGCAC TGGGCTACAT CTACAACAAG
CCGGATCTGC AGGCCGCGAA CCTGCAGGTG CCGACCGACC TCGACGCGCT GGTGTCGACG
TCGCAGCGGC TCAAGGCGGG CGACCGCGCC GGTATCGCCA TGCAGCCGCA GCGCGGGTAC
AAGATCTTCG AGGAATGGGC GAACTGGCTG TTCGCCGCGG GCGGGTCGAT CTACGACGCG
GACGGTAAGC CGACGCTGAA CACCGAGCAG GCCGCCCGCG CCCTGGACGC CTACATCGAG
ACCTACCGCA CCGCGGCCCC GGCCAACAGC CTGAACTGGG GCTTCGACGA GGCCTTCCGC
TCGGTCTCCG GCGGCAACGC CGCCTCGATG ATCGGCTACA ACTGGAACCT GCCCGCGCTC
AACGACCCCG CCGGGGCGTC GGGTGCGCGC GCCGGACAGT TCGCGTTGGC GCCGATTCCC
GGCGGCAAGT CCGCGCTGGG CCTGTGGAGC TGGGCGATCC CGGCGAACTC GGCGGCTCCG
GACGCGGCCT GGGCGTTCAC GTCCTGGATC ACCTCACCCG CCGTCGACGC CCAGCGCGTC
GCCGAGGGCG GTGCGGTGAC CCGCAAGGGT TCGCTGACCG ATCCAAAGGT GCTGGCCGAC
GGGTACGGCG AGGAGTACTA CCGCGTCGTC GGTGAGATCC TGGCCGACGC GGCCCCGCTC
TCCCAGGGCC GCGGTGGTGA GGAGATGATC CAGGCCGTGG GAACCGAGCT CAACGACGCG
GCGGCGGGCA ACAAGAGCGT GGCCGACGCA CTGCGCGACG CCCAGGCGGC CGCAGAGCGA
ATCCAGCAGT GA
 
Protein sequence
MKIPKLGRRP ARRKATRWMP ALAMTAGLAL AGCAGSGGSG DEQSSSGLGD IPTDTNATVR 
VLMENVPDTD IVQGMVGQFN EKYPDIKVQI ETMTFDQMRD RLVSSFQSAE PAYDLIVVDN
PWMDDFAAAG FLEPLNDRIS STPDYQPDDF FPSLTDITDV DGTTYGVPFY NYALGYIYNK
PDLQAANLQV PTDLDALVST SQRLKAGDRA GIAMQPQRGY KIFEEWANWL FAAGGSIYDA
DGKPTLNTEQ AARALDAYIE TYRTAAPANS LNWGFDEAFR SVSGGNAASM IGYNWNLPAL
NDPAGASGAR AGQFALAPIP GGKSALGLWS WAIPANSAAP DAAWAFTSWI TSPAVDAQRV
AEGGAVTRKG SLTDPKVLAD GYGEEYYRVV GEILADAAPL SQGRGGEEMI QAVGTELNDA
AAGNKSVADA LRDAQAAAER IQQ