Gene Mmcs_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4055 
Symbol 
ID4112885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4325259 
End bp4326884 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content67% 
IMG OID638033198 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_641216 
Protein GI108801019 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.647451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGTGC TCGCCGCCGA GACCATCGGC AACCCCGTCG CCAACATGTC GATCTTCGCC 
CTGTTCGTCC TGGTGACGCT CTTCATCGTC ATCAAGGCGA GTAAGAAGAA CGCCACCGCC
ACCGAGTTCT TCACCGCGGG CCGCGCCTTC ACCGGTCCGC AGAACGGCAT CGCGATCAGC
GGTGACTACC TGTCGGCCGC GAGCTTCCTC GGCATCGCCG GCGCCATCGC CGTCTACGGC
TACGACGGGT TCCTGTACTC GATCGGATTC CTGGTCGCCT GGCTGGTGGC GCTGCTGCTG
GTCGCCGAAC TGCTGCGCAA CACGGGAAAA TTCACCATGG CCGACGTGCT GAGCTTCCGG
CTCAAACAAC GTCCGGTGCG GTTGGCCGCG GCCACCAACA CCCTGGCGGT GTCGTTGTTC
TACCTGCTCG CCCAGATGGC CGGCGCCGGC GTGCTGGTCG CACTGCTGCT CAACATCGAA
AGCGACCTCG GACAGTCGAT CGTGATCGCC GTCGTGGGCG TGCTGATGAT CGTCTACGTC
CTGGTCGGCG GGATGAAGGG CACCACCTGG GTGCAGATCA TCAAGGCGGT CCTGCTGATC
GGCGGCGCGG GGATCATGAC GATCATGGTG CTGGCGAAGT TCAACTTCAA CTTCTCCGAG
ATCCTCGGCA GCGCACAGGC GATGGTGAGC AGCAGCGAGG ACGCCAAGGT CGCCTCGCGC
GACGTGCTTG CCCCCGGCGC GCAGTACGGC GCGTCGCTGA CCACGCAGAT CAACTTCATC
TCGCTGGCGC TGGCCCTGGT GCTCGGCACC GCCGGCCTGC CGCACGTGCT GATGCGCTTC
TACACGGTGC CCACCGCCAA GGAGGCCCGC CGGTCGGTGG TCTGGGCGAT CGCGCTCATC
GGCGCGTTCT ACCTGTTCAC CCTGGCCCTG GGTTACGGCG CCGCGGCCCT GGTCGGACCC
GACCGCATCC TGGCCGCCCC CGGTGGCGTG AATTCCGCTG CGCCGCAACT GGCGTTCGAA
CTCGGCGGCG TAGTGCTGCT GGGCGTCATC TCCGCGGTGG CGTTCGCGAC GATCCTCGCG
GTCGTCGCCG GTCTGACCAT CACCGCGTCG GCGTCCTTCG CGCACGACAT CTACGCCAGC
GTGATGAAGA GCCATCAGGT CACCGAGAGC GAGCAGGTCA AGATCTCGCG GATCACCGCG
GTGGTGCTGG GCACGCTGGC GATCGGGTTG GGCATCCTGG CCCGCGAGCA GAACGTCGCG
TTCCTGGTGG CGCTCGCGTT CGCGGTGGCC GCCGCGGCGA ATCTGCCGAC CATCCTCTAC
TCGCTGTACT GGCGGCGTTT CAACACCCGC GGTGCGCTGT GGAGCATGTA CGGCGGGTTG
ATCTCGACGA TCGTGCTGAT CGTATTCTCG CCCGCGGTCT CGGGCACGGC GACCTCGATG
ATCAAGGGCG CCGACTTCGC CTGGTTCCCG CTGGCCAACC CGGGCATCGT GTCGATCCCG
CTGGCGTTCA TCCTCGGCAT CGTCGGCACC CTGACCTCAC CAGACGACGA GGATCCGACG
ATCGCCGCCG AGATGGAGGT GCGCTCGCTG ACCGGGGTGG GTGCGGAAAA GGCCGTCTCG
CACTGA
 
Protein sequence
MTVLAAETIG NPVANMSIFA LFVLVTLFIV IKASKKNATA TEFFTAGRAF TGPQNGIAIS 
GDYLSAASFL GIAGAIAVYG YDGFLYSIGF LVAWLVALLL VAELLRNTGK FTMADVLSFR
LKQRPVRLAA ATNTLAVSLF YLLAQMAGAG VLVALLLNIE SDLGQSIVIA VVGVLMIVYV
LVGGMKGTTW VQIIKAVLLI GGAGIMTIMV LAKFNFNFSE ILGSAQAMVS SSEDAKVASR
DVLAPGAQYG ASLTTQINFI SLALALVLGT AGLPHVLMRF YTVPTAKEAR RSVVWAIALI
GAFYLFTLAL GYGAAALVGP DRILAAPGGV NSAAPQLAFE LGGVVLLGVI SAVAFATILA
VVAGLTITAS ASFAHDIYAS VMKSHQVTES EQVKISRITA VVLGTLAIGL GILAREQNVA
FLVALAFAVA AAANLPTILY SLYWRRFNTR GALWSMYGGL ISTIVLIVFS PAVSGTATSM
IKGADFAWFP LANPGIVSIP LAFILGIVGT LTSPDDEDPT IAAEMEVRSL TGVGAEKAVS
H