Gene Mmcs_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2387 
Symbol 
ID4111220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2534620 
End bp2536287 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content69% 
IMG OID638031512 
Productmajor facilitator transporter 
Protein accessionYP_639551 
Protein GI108799354 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0964642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCCA CCGAGGTCGG GCCGGCCATC GCACCGGCGA ACAGCCGAAG CCGTCGGATC 
GCGATCAGCG CGGGCAGTCT CGCGGTCCTG CTCGGCGCGC TCGACACCTA TGTCGTCATC
GCGATCATCC GCGACATCAT GTTCGACATC GGGATCGCGA TAAACCAGAT CCAACGCGTC
ACGCCGATCA TCACCGGGTA CCTGCTCGGC TACATCGCCG CGATGCCGCT GCTGGGCCGG
GCGTCCGACC GGTTCGGACG CAAGCTGGTG CTGCAGCTCA GCCTGGCCGG CTTCGCGATC
GGGTCGGTGG TCACGGCGTT GTCGAATGAC CTGACACCCA TGGTGATCGG CCGCTTCATC
CAGGGTGTGG CCAGCGGCGC ACTGCTGCCG GTCACCCTGG CACTGGCCGC CGACCTGTGG
GCGACCCGCA ACCGCGCCTC GGTGCTCGGC GGGATCGGCG CGGCGCAGGA ACTGGGCAGC
GTGCTGGGCC CGCTCTACGG AATCGCCGTG GTGGCCGCGC TGAGCACCTG GCGTGACGTG
TTCTGGATCA ACGTGCCGCT CGCGGCGATC GCGATGGTGC TGATCCACTT CAGCCTGCCT
GCCCGTCTCA AACCGGAGCG ACCGGAGAAG GTCGACGTGG TCGGCGGCGT CCTGCTGGCG
ATTGCGCTCG GCCTGTCGGT GATCGGCCTG TACAACCCGG CCCCCGACGG CAAACAGATC
CTGCCCGACT ACGGCCCGCC GACGCTGATC GGCGCCGTCG CCGCGTTCGT GGCCTTCTTC
GTGTGGGAGC GATTCGCCCG CACCCGTCTG ATCGAACCGG CCGGTGTGCA CTTCCGGCCG
TTCCTGGCGG CGTTGGGCGC GTCGCTCTGC GCGGGTGCGG CGCTGATGGT CACGCTGGTC
AACGTCGAAC TCTTCGGCCA GGGCGTGCTC GGCAAGGATC AGACGGAGGC CGTCCTGCTC
CTGCTTCGGT TCCTGATCGC CCTGCCGATC GGAGCGCTGC TCGGTGGCTG GCTGGCCAGC
CGCATCGGTG ACCGGCTGGT GGCGTTCGCC GGCCTGATGA TCGCCGCGGG CGGATACCTG
TTGATCTCGA AGTGGCCGGT CGACCTGCTC TCGGCCCGTC ACGATCTCGG GTTCGTCACC
CTGCCGGTCC TCGACACCGA CCTGGTGATC GCCGGGATCG GCCTCGGCCT GGTGATCGGT
CCGCTGACGT CGGCGTCGCT GCGGGTGGTG CCCGCCGCAC AGCACGGCAT CGCGTCGGCC
GCGGTGGTGG TGTCCCGGAT GATCGGCATG CTGATCGGGC TGGCGGCGCT GTCCGCGTGG
GGGCTGTACC GGCTCAACCA GCACCTGCAG ACGCTGCCGT TCCCGCCCGG GGCCGACACG
CTGGCCGAGC GGTTGGCCGC CGAGGCGGAC CGCTACCGCG CGGCGTACGT GCTGCAGTAC
GGCGACATCT TCATCGTCAC CACGATCATC TGTGTGGTCG GCGCACTGCT CGGGCTGCTG
ATCAGCGGCA GGAACGAGCA TGCGGACGAG TCCCCGGTGC CCGTGGGTGC CGATCACGGG
GACGATGCGC CCACCCAGTT CATCAACGTG GCAGGCGCCT CGGGAGACGC CGATCAGACC
ACGCGCCTAC CGCGGCAGAC ACCGGGCAGG CACCGCGACG AAGCCTGA
 
Protein sequence
MQPTEVGPAI APANSRSRRI AISAGSLAVL LGALDTYVVI AIIRDIMFDI GIAINQIQRV 
TPIITGYLLG YIAAMPLLGR ASDRFGRKLV LQLSLAGFAI GSVVTALSND LTPMVIGRFI
QGVASGALLP VTLALAADLW ATRNRASVLG GIGAAQELGS VLGPLYGIAV VAALSTWRDV
FWINVPLAAI AMVLIHFSLP ARLKPERPEK VDVVGGVLLA IALGLSVIGL YNPAPDGKQI
LPDYGPPTLI GAVAAFVAFF VWERFARTRL IEPAGVHFRP FLAALGASLC AGAALMVTLV
NVELFGQGVL GKDQTEAVLL LLRFLIALPI GALLGGWLAS RIGDRLVAFA GLMIAAGGYL
LISKWPVDLL SARHDLGFVT LPVLDTDLVI AGIGLGLVIG PLTSASLRVV PAAQHGIASA
AVVVSRMIGM LIGLAALSAW GLYRLNQHLQ TLPFPPGADT LAERLAAEAD RYRAAYVLQY
GDIFIVTTII CVVGALLGLL ISGRNEHADE SPVPVGADHG DDAPTQFINV AGASGDADQT
TRLPRQTPGR HRDEA