Gene Mmcs_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3147 
Symbol 
ID4111979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3336122 
End bp3337600 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content67% 
IMG OID638032277 
Productsugar transporter 
Protein accessionYP_640310 
Protein GI108800113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.060115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGTC ACGGAGGGGC ACCGGTAGGG GGCGACCTGC CCATCGGAGA CGACGAATTC 
ACGTCCGGCA AGACCGCCAT TCGGATCGCC TCGGTCGCCG CGCTCGGCGG CCTGCTGTTC
GGCTACGACA GCGCGGTCAT CAACGGCGCC GTCGACTCCA TCCAGGAGGA CTTCGGAATC
GGTAACGCCT CGCTCGGCTT CGCCGTCGCG TCGGCGCTGC TGGGCGCCGC GGCCGGTGCG
ATGACCGCGG GCCGCATCGC CGACAAGATC GGCCGCATCG CGGTGATGAA GATCGCCGCG
GTCCTGTTCC TCATCAGCGC CTTCGGCACC GGTTTCGCCC ACGAGGTGTG GGCCGTCGTG
CTGTTCCGCA TCGTCGGCGG CATCGGCGTC GGCGTGGCAT CGGTGATCGC CCCCGCCTAC
ATCGCCGAGA CCTCACCACC GAGCATCCGT GGCCGCCTCG GCTCCCTGCA GCAACTGGCG
ATCGTGTCCG GCATCTTCCT GTCCTTCGTC GTCAACTGGC TCTTGCAGTG GGCGGCCGAC
GGACCCAACA AGCCGCTGTG GTTCGGCGTC GACGCCTGGC GCTGGATGTT CCTCGCGATG
GCGGTGCCCG CCGTCGTGTA CGGCGCACTG GCCTTCACGA TCCCGGAGTC ACCGCGGTAT
CTCGTTGCCA GCCACAAGAT CCCGGAGGCC CGCCGGGTGC TGAGCACGCT GCTCGGCAAG
AAGAATCTCG AGATCACCAT CACCCGCATC CGGGAGACGC TGGAGCGCGA GGACAAACCG
TCGTGGCGTG ATCTGCGCAA GCCCACCGGC GGCCTCTTCG GCATCGTCTG GGTGGGTCTC
GGACTGTCGA TCTTCCAGCA GTTCGTCGGT ATCAACGTGA TCTTCTACTA CTCCAACGTG
CTGTGGCAGG CGGTCGGCTT CAGCGCCGAC GAATCGGCGG TCTACACCGT CATCACCTCG
GTGATCAACG TGCTCACCAC GCTCATCGCG ATCGCGCTGA TCGACAAGAT CGGCCGCAAA
CCGCTGCTGC TCATCGGCTC GTCGGGTATG GCGGTCACGC TGATCACCAT GGCGGTGATC
TTCGCCAACG CGACGCTCGA CGCCGACGGC AACCCCAGCC TGCCCGGCGC GTCCGGGGTG
ATCGCGCTGA TCGCGGCGAA CCTGTTCGTC GTCGCGTTCG GCATGTCGTG GGGTCCGGTC
GTGTGGGTGC TGCTCGGCGA GATGTTCCCC AACCGCATCC GGGCCGCCGC GCTGGGCCTG
GCAGCCGCCG GTCAGTGGGC GGCCAACTGG CTGATCACCG TCACCTTCCC GGGGCTGCGC
GAGCACCTCG GTCTCGCCTA TGGCTTCTAC GGGATGTGCG CGATCCTGTC CGGCCTGTTC
GTGTGGCGCT GGGTGCGGGA GACCAAGGGG GTGTCCCTGG AGGACATGCA CGGCGAGATC
CTCAAGCACG ACCTGCCCAC GGGCGCCAAG GAGGGCTGA
 
Protein sequence
MAGHGGAPVG GDLPIGDDEF TSGKTAIRIA SVAALGGLLF GYDSAVINGA VDSIQEDFGI 
GNASLGFAVA SALLGAAAGA MTAGRIADKI GRIAVMKIAA VLFLISAFGT GFAHEVWAVV
LFRIVGGIGV GVASVIAPAY IAETSPPSIR GRLGSLQQLA IVSGIFLSFV VNWLLQWAAD
GPNKPLWFGV DAWRWMFLAM AVPAVVYGAL AFTIPESPRY LVASHKIPEA RRVLSTLLGK
KNLEITITRI RETLEREDKP SWRDLRKPTG GLFGIVWVGL GLSIFQQFVG INVIFYYSNV
LWQAVGFSAD ESAVYTVITS VINVLTTLIA IALIDKIGRK PLLLIGSSGM AVTLITMAVI
FANATLDADG NPSLPGASGV IALIAANLFV VAFGMSWGPV VWVLLGEMFP NRIRAAALGL
AAAGQWAANW LITVTFPGLR EHLGLAYGFY GMCAILSGLF VWRWVRETKG VSLEDMHGEI
LKHDLPTGAK EG