Gene Mmcs_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4357 
Symbol 
ID4113187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4636873 
End bp4638240 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID638033503 
Productextracellular solute-binding protein 
Protein accessionYP_641518 
Protein GI108801321 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.200959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGTTC AGACAATAGC GCGCCGGCTG GCGGGAGGTC TGGCCGCTGC CGGGCTGGTG 
TTCACGTCCG GCTGTTCCGG GGCCGGCAGC CTCGGATCGT CCGACAACGA GGTGACGATC
GCCCTGGTGT CCAACTCGCA GATGACCGAC GCCCAGCAGC TGTCCTCGGA GTTCGAGAAA
GAGAACCCGG GCACGAAGCT CAAGTTCATC ACGCTGTCCG AGAACCAGGC ACGCGCCAAG
ATCACCATGT CGGCCGCAAT GGGCGGCAGT GAGTTCGACG TCGTGATGAT CAGCAACTTC
GAGACCCCGC AGTGGGCCAA AGACGGCTGG CTGGTGAATC TCTCGGAGTA CGCGAAGAAC
ACCCCCGGCT ATGACGAGGA CGACTTCATC TCGTCGCTGC GGGAATCGCT GTCGTACGAG
GGAAACATGT ACGCGGTTCC CTTCTACGGC GAATCGTCGT TCCTGATGTA CCGGAAGGAC
CTGTTCGAGC AGGCCGGCAT CACGGTCGAC CAGAACCCCG ACTATCAGCC CACCTGGCCT
GAGGTCGCCC AGTGGGCCGA GACGCTCAAG ACCGACGACC GCGCCGGCAT CTGCCTGCGG
GGAAAGCCCG GCTGGGGTGA GGTACTCGCA CCGTTGGACA CCGTCATCAA CACCTTCGGT
GGACGCTGGT TCGACGAGCA GTGGAACGCC CAACTCGACA GCCCCGAGGT GAAGAAGGCC
GTCAACTTCT ACGTCGACAC GGTCAAGAAC TTCGGTGAAC TGGGTGCGGC GTCAACAGGA
TTCCAGGAGT GCGCGAACCT GTTCGGCCAG GGGCAGACCG CGATGTGGTA CGACGCGACG
TCGGCGGTCT CGGTGCTCGA GGACCCCAAG GAGTATCCCG ACCTGGTCGG CAAGATCGGA
TACCTGCCCG CTCCGATCGT CGAGAAGCCG AACTCGGGCT GGCTCTACAC CTGGGCGCTG
GGCATCCCCA AGGGTGCCAA GAATCCTGAC GGCGCATGGG AGTTCATCTC GTGGATGACC
AGTAAGGACT ACATGAAACT GGTCGGGGAG AAGCTCGGCT GGGCGCGTGT CCCACCGGGC
AGCCGGACGT CGACCTACAC CGACCTGCCC GAGTACGAGG CCATCTCGAA GTCCTACGGG
CCGCTGACGC TGAAGTCGAT CGAGAGCGCG ACCCCGAATC AGCCAACGGT GCAACCGGTT
CCGTACACCG GCATCCAGTT CGTCGGCATC CCGGAGTTCC AGGATCTCGG GACCCGGGTG
AGCCAGCAGA TCAGCGCGGC GATCGCCGGA CAGAAGTCGG TGGACGACGC GCTCGCCCAG
TCACAGGAAT ACGCCGAGGT CGTCGGCCGC ACGTATCAGG AGAAGTGA
 
Protein sequence
MKVQTIARRL AGGLAAAGLV FTSGCSGAGS LGSSDNEVTI ALVSNSQMTD AQQLSSEFEK 
ENPGTKLKFI TLSENQARAK ITMSAAMGGS EFDVVMISNF ETPQWAKDGW LVNLSEYAKN
TPGYDEDDFI SSLRESLSYE GNMYAVPFYG ESSFLMYRKD LFEQAGITVD QNPDYQPTWP
EVAQWAETLK TDDRAGICLR GKPGWGEVLA PLDTVINTFG GRWFDEQWNA QLDSPEVKKA
VNFYVDTVKN FGELGAASTG FQECANLFGQ GQTAMWYDAT SAVSVLEDPK EYPDLVGKIG
YLPAPIVEKP NSGWLYTWAL GIPKGAKNPD GAWEFISWMT SKDYMKLVGE KLGWARVPPG
SRTSTYTDLP EYEAISKSYG PLTLKSIESA TPNQPTVQPV PYTGIQFVGI PEFQDLGTRV
SQQISAAIAG QKSVDDALAQ SQEYAEVVGR TYQEK