Gene Mmcs_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0541 
Symbol 
ID4109387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp600484 
End bp601479 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content68% 
IMG OID638029667 
Productextracellular solute-binding protein 
Protein accessionYP_637718 
Protein GI108797521 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.169713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTC GCGCGAAGAA CAGACAGCGG GTGATGAAGA GGCTGCTCGC GTTGCTCAGT 
GCCACGGTGC TGGTGCTGGC GGGTTGTGGC CAGGCGGCGT CGGTGGTGCC GACACCCGGG
GTGACGCTCG CGCCGCCCAC CCCGGCGGGG ATGGAGGAAC TTCCACCCGA ACAGGCGCGG
CTGCCGATAC CCGAGACCGA CGACTGCAAC CGCCGGGCGA GCCTGCGGCC CTTCCCCACC
CGCGCCGAGG CCGACGCCGC GGTGGCGTAC ATCCGCGAAC GCGGCAGGCT CATCGTCGGA
CTGGACATCG GCAGCAATCT GTTCTCCTTC CGCGATCCGA TCACCGGCGA CATCACCGGG
TTCGACGTCG ACATCGCCGG TGAGATCGCC CGGGACATCT TCGGCAGCCC CGCCCAGGTG
GACTACCGGA TCCTGTCGTC GGCCGACCGG ATCGTCGCGC TGCAGAACAA TCAGGTGGAC
GTCGTCGTGA AGTCCATGAC GATCACGTGT GAGCGCAAGA AGAAGGTCGG CTTCTCCACG
GTGTACCTCA ACGCCGACCA GCGGATCCTG GCGCCACGGG ATTCGGCGAT CACGCGGGCC
GCGGACCTGT CGGGCCGTCG GGTGTGTGTG GTGAAGGGGA CGACGTCGCT GCGGCGGGTC
CAGCAGATCA GCCCCCCGCC GATCATCGTG TCCACCGTGA CGTGGGCGGA CTGCCTGGTG
GCGTTGCAGC AGCGGCAGGT CGACGCGGTC AGCACCGACG ACGCGATCCT GGCGGGGCTC
GTGGCGCAGG ACCCCTATCT GCACATCGTC GGGCCGAGTA TGAACCAGGA GCCCTACGGG
ATCGGGGTGA ACCTGGAGAA CACCGGGTTG GTGCGGTTCG TCAACGGGAC GCTGCAGCGC
ATCCGCAACG ACGGCACCTG GTATGCGCTG TACCGCAAGT GGTTGACGGT GCTGGGTCCC
GCACCCGCAC CGCCGGTGGC GAGGTACGTG GACTGA
 
Protein sequence
MSARAKNRQR VMKRLLALLS ATVLVLAGCG QAASVVPTPG VTLAPPTPAG MEELPPEQAR 
LPIPETDDCN RRASLRPFPT RAEADAAVAY IRERGRLIVG LDIGSNLFSF RDPITGDITG
FDVDIAGEIA RDIFGSPAQV DYRILSSADR IVALQNNQVD VVVKSMTITC ERKKKVGFST
VYLNADQRIL APRDSAITRA ADLSGRRVCV VKGTTSLRRV QQISPPPIIV STVTWADCLV
ALQQRQVDAV STDDAILAGL VAQDPYLHIV GPSMNQEPYG IGVNLENTGL VRFVNGTLQR
IRNDGTWYAL YRKWLTVLGP APAPPVARYV D