Gene Mmcs_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3983 
Symbol 
ID4112813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4249104 
End bp4250501 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content68% 
IMG OID638033126 
Productextracellular solute-binding protein 
Protein accessionYP_641144 
Protein GI108800947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGCTC GGCGGCTCTG TGCTGCAGCA GTCACCGCGC TGACGACCGC GTCGATGGTT 
TCGGCGTGTG GATCCGGCGG CGGTGGGCTG GTCATCAACT ACTACACCCC GGCGAACGAG
GCGGCGACGT TCACCGCCGT CGCCAACCGC TGTAACGAAG AACTGGGTGG CCGGTTCCGG
ATCGAACAGC GCAACCTGCC GAAGGGCGCC GACGACCAGC GCCTACAGCT GGCGCGCCGG
CTCACCGGCA ACGACACCTC ACTGGACGTG ATGGCGCTCG ACGTGGTGTG GACCGCCGAA
TTCGCCGAAG CAGGCTGGGC GCTGCCGCTC TCGGACGATC CGGCCGGACT TGCCGAGGCC
GATGCCACCG CCAACACGCT GCCCGGACCC CTTGAGACCG CGAAATGGCA GGACAAGCTC
TACGCGGCGC CGATCACCAC CAACACCCAG TTGTTGTGGT ACCGGGCCGA TCTCATCCCG
CAACCGCCTA CGACGTGGGA CGGGATGGTC GCTGAGGCCA CCCGGTTGCA TGCCGCGGGT
GGTCCGAGTT GGATCGCGGT GCAGGGCAAG CAGTACGAGG GCCTGATGGT CTGGTTCAAC
ACGCTACTGG AGAGCGCCGG CGGGCAGGTG CTCTCCGACG ACGGTGAGCG TGTCACCTTG
ACCGACACCC CGGAGCACCG GGCCGCCACG GTCAAGGCGC TGCAGATCAT CAAGTCTGTG
GCGACCGCCC CGGGGGCGGA TCCGTCGATC ACCCAGACCG ACGAGACGAC CGCACGCCTC
GCGCTCGAGC AGGGCAAGGC CGCGCTCGAG GTCAACTGGC CCTACGTGCT GCCGTCGCTG
CTGGAGAACG CCGTCAAGGG TGGCGTGCCG TTCCTGCCGC TGAATCAGGA CCCGTCGCTG
ACCGGGGCGA TCAACGACGT GGGCTCTTTC TCACCGAGCG ACGAGCAGTT CCGGACCGCC
TTCGACGCGA GCCGGCCCGT GTTCGGCTTC GCGCCGTATC CCGGGGTGCA ACCCGGCGAC
CCGGCGCGGG TCACGCTGGG CGGTCTGAAC CTGGCCGTGG CCAGGACGAC CCAGCACCGC
GCCGAGGCCT TCGAGGCGAT CCGGTGCCTG CGCAACGTCG AGAACCAGCG GTACACCTCG
GTCGAGGGTG GACTGCCCGC GGTGCGGGCG TCGCTGTACG ACGATCCGGC GTTCCAGGCG
AAATACCCGC AGTACGAGAT CATCCGCGAT CAGTTGACGA ACGCCGCGGT CCGGCCGGCC
TCACCGGTGT ACCAGGCGAT GTCGACGCGG ATCACGGTGA CGCTGGCACC GATATCGGAC
ATCGATCCGG AACGGACCGC CGATGAACTC ACCGAGCAGG TCCAGAAGGC CATCGACGGC
AAGGGGTTGA TCCCGTGA
 
Protein sequence
MRARRLCAAA VTALTTASMV SACGSGGGGL VINYYTPANE AATFTAVANR CNEELGGRFR 
IEQRNLPKGA DDQRLQLARR LTGNDTSLDV MALDVVWTAE FAEAGWALPL SDDPAGLAEA
DATANTLPGP LETAKWQDKL YAAPITTNTQ LLWYRADLIP QPPTTWDGMV AEATRLHAAG
GPSWIAVQGK QYEGLMVWFN TLLESAGGQV LSDDGERVTL TDTPEHRAAT VKALQIIKSV
ATAPGADPSI TQTDETTARL ALEQGKAALE VNWPYVLPSL LENAVKGGVP FLPLNQDPSL
TGAINDVGSF SPSDEQFRTA FDASRPVFGF APYPGVQPGD PARVTLGGLN LAVARTTQHR
AEAFEAIRCL RNVENQRYTS VEGGLPAVRA SLYDDPAFQA KYPQYEIIRD QLTNAAVRPA
SPVYQAMSTR ITVTLAPISD IDPERTADEL TEQVQKAIDG KGLIP