Gene Mmcs_2868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2868 
Symbol 
ID4111700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3034570 
End bp3035769 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content72% 
IMG OID638031992 
Productmajor facilitator transporter 
Protein accessionYP_640031 
Protein GI108799834 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCCG GAGTGCTCGA CGTGACCGCG GTGCGCGCCC GCCGTGCCCG GGTCGCGGTG 
GCCGCGCAGT TCCTCACCAA CGGGGCGTTG TTCGCGAACC TGCTGCCGCG CTTTCCGGAG
ATCAAGACCG ATCTGGCGCT GTCGAATGCC GTGTACGGGC TCACGATTGC CGCGTTCTCC
GCGGGGGCGT TCGTCGCCGG CCTCACCGCA GCCGCCCTGA TCCGCCGCTT CAGTTCGGCG
CGCGTCGCGG TCGGGGGAAC CATCGCCATC GCGGTCTTCG TCTTCGCGGC CGGGCTCGCG
CCGTCGGCGG TGCTGGTGGC CTGTGCGCTG TTCCTGGCGG GCGCGTCCGA CGCGGTCACC
GATGTGGCGC AGAACGCGCA TGCCCTTCGG GTCCAGCGCA TCTACGGCCG ATCCATCATC
AACTCGGTGC ACGCCGTGTG GGCCGGCGGC GCGGTCCTCG GCGGCCTCAT GGGCGCGGCG
GCGATCGCCC TGCACATCCC GCGGCCGGTG CACCTCGGGG TGGCCGCCGT CGTGTTCACC
GGTGTGGTGC TGGTCGCCTA CCGCTTCATG CTGCCGGGCG CCGACCAGGA CGACCATCCG
GCGTCCGGGT ATGCCGAGGG CGAGCGCGCC GGCCGGAGGG TGTACCTCGT GCTGGTCGCG
CTGGCCGTCA TCGCCATCGC AGGGGCGATG GTCGAGGACG CCGGAAGTTC GTGGGCCACC
TTGTATCTGC GCGACAGTGT CGGCGCACCG GGGGCGATCG CGGCGTTCGG GTACATCGCC
CTCGCGGCGT TCATGTTCGT CGGCCGGCTC ATCGGCGACC GGCTGGTCGA CCGGTTCGGC
GAAACCGCGG TGGCCCGGGC CGGGGGAGCG CTGGCCGCGG CCGGGATGGG TGTGGCGCTG
GCCTTCCCGA GCGTTCCCGC GACGATCGCC GGCTTCGCCG CCGCCGGACT CGGCGTGGCC
ACCGCGATCC CGGCGGCCAT GCACGGCGCC GATCAGCTTC CGGGACTGCG ACCGGGGACC
GGTCTGACCA TCGTCACGTG GCTGCTGCGG ATCGGCTTCC TGGCCTCACC GCCACTCGTC
GGCCTGATCG CCGACTGGAC CAGTCTGCGG ATCGGACTGC TGACCGTGCC GGTTGCCGGG
CTGGTGATCA TGGTGCTCGC GGGTGCGCTC AATGTGAGAG GCCGGCCAGT TCGATCGTGA
 
Protein sequence
MASGVLDVTA VRARRARVAV AAQFLTNGAL FANLLPRFPE IKTDLALSNA VYGLTIAAFS 
AGAFVAGLTA AALIRRFSSA RVAVGGTIAI AVFVFAAGLA PSAVLVACAL FLAGASDAVT
DVAQNAHALR VQRIYGRSII NSVHAVWAGG AVLGGLMGAA AIALHIPRPV HLGVAAVVFT
GVVLVAYRFM LPGADQDDHP ASGYAEGERA GRRVYLVLVA LAVIAIAGAM VEDAGSSWAT
LYLRDSVGAP GAIAAFGYIA LAAFMFVGRL IGDRLVDRFG ETAVARAGGA LAAAGMGVAL
AFPSVPATIA GFAAAGLGVA TAIPAAMHGA DQLPGLRPGT GLTIVTWLLR IGFLASPPLV
GLIADWTSLR IGLLTVPVAG LVIMVLAGAL NVRGRPVRS