Gene Mmcs_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1847 
Symbol 
ID4110681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1990277 
End bp1991857 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID638030967 
Productextracellular solute-binding protein 
Protein accessionYP_639012 
Protein GI108798815 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.068039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCC GCCTGAAGAC CGTCGCCGCA CTCGCGGCGG CCGCCGCACT GACCCTGAGC 
GCCTGTGGCG GTTCGGATTC CGGTGGCGGC GCCCCCAGTG CCGCACCCAC CGACAAGGTG
CTGCACCTGT CGTTCCTGCA GGACCCGGGG CAGCCACCGG ACCCGGACGT CTTCTACGCC
GGACAGGGTC TACTGCTCAC CACCAACGTG TACGAAGGCC TGATGCAGTA CAAGGGTGGC
ACCGAGAAGG CGGAGATCGA ACCGCTGCTG GCCACGGAGT GGACCGAATC ACCGGACCAC
CGCGTGTTCA CCTTCAAGCT GCGGGAGGGG GTGACGTTCC ACGACGGGAC ACCGTTCACC
GCCGAGGCGG TCAAGGCGTC CTTCGACCGG AGGCTGGCCG TCGACCAGGG CCCCGCGTAC
ATGGTCGCCG ACGTGGAATC GATCACCACC CAGGGCGACC ATGCCGTCAC GATCACCCTC
AAGGCACCGA ACGCGGCGTT CCTGGACTAC CTCGCCTGCC CGTACGGTCC GCGCATGCTC
AGCCCGAAGG GGTTGGCCGA CAACGCCGGT GACGACCACG CCCAGAACTA CCTGACCACC
CACGATCTCG GCACCGGACC GTACACGCTG ACCGCGGCCG AAGTGGGATC GCGCTACGCA
CTGGCCGCCT ACCCCGGATA CTGGGGCGAG AAGCCGTATT TCGAGCAGGT GGAGATCCCG
GTCATCACCG ACGTGTCCGC CCAGCAGCTT CAGTTCAACA ACGGTCAGAT CGCCGCGATC
CTGCACGATC TGCCGTCGTC GGCGGTCGAG TCGTATCTCA ACAACGACAA GTACGCCCAC
TTCTCGCTGC CGACGATGAT GTCGAACTAC CTCTACCTCA ACCCGCGCCG CGGCATGCTC
ACCGACCCGA AGAACCGCGC CGCCGTGCTC GCCGCCATCG ACGTCGACGC GCTGGTCAAA
CAGACCTACT TCGGACGCGG CAAGAAGGCA GAACAGCTCT ACCCGCCGAA CATGATCGCC
CCGGAGTTGG CCAAGCAGAA CGTCACCCAC GACCCCTCGC TGCTCACCGA GATCGCGGCC
GGACTGCCCG CCGACCAGAA GGCCGTCACC ATCGGATACG ACTCCTCCAA CCCCGACAAC
CAGCTGATCA ACAACCTGAT CCAGACTCAG CTGGCCGCAG CCGGGCTCAA CGCCAAGGTG
CAGAGCTACC CGACCTCGGA GATCTACGGC TGGATCGGCA ACGACGCCCC CAACGCGCCG
GACATCCTGA CCGGTACGGC GTGGCCGGAT GCGCCGTCGC CCTACACCTG GGGTCATATC
TCCTGGGACG CCGACGGCGG GTTGAACTAC CTGGGCTGCT CGGCGCCCCC GGTGACCAGC
GCACTGGCTC GTGGTCTGGA AACCGGTGAC CCGCAGGTGT TCTCGGAGGC CGCCAAGGCC
GCCGCCGACA CCGGCTGCTG GCTCAACATC GCCGACGTCG ACGACTTCGT AGTCGCCCAG
CCGTGGCTCG CAGGGGTCGA GGAGGCGCAC GTGGTGACCA ACCCGAACTC GCTTCGGCTC
TTCGAACTCT CGGTCGCCTG A
 
Protein sequence
MIRRLKTVAA LAAAAALTLS ACGGSDSGGG APSAAPTDKV LHLSFLQDPG QPPDPDVFYA 
GQGLLLTTNV YEGLMQYKGG TEKAEIEPLL ATEWTESPDH RVFTFKLREG VTFHDGTPFT
AEAVKASFDR RLAVDQGPAY MVADVESITT QGDHAVTITL KAPNAAFLDY LACPYGPRML
SPKGLADNAG DDHAQNYLTT HDLGTGPYTL TAAEVGSRYA LAAYPGYWGE KPYFEQVEIP
VITDVSAQQL QFNNGQIAAI LHDLPSSAVE SYLNNDKYAH FSLPTMMSNY LYLNPRRGML
TDPKNRAAVL AAIDVDALVK QTYFGRGKKA EQLYPPNMIA PELAKQNVTH DPSLLTEIAA
GLPADQKAVT IGYDSSNPDN QLINNLIQTQ LAAAGLNAKV QSYPTSEIYG WIGNDAPNAP
DILTGTAWPD APSPYTWGHI SWDADGGLNY LGCSAPPVTS ALARGLETGD PQVFSEAAKA
AADTGCWLNI ADVDDFVVAQ PWLAGVEEAH VVTNPNSLRL FELSVA