Gene Mmcs_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0903 
Symbol 
ID4109744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1002708 
End bp1003949 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content72% 
IMG OID638030025 
Productmajor facilitator transporter 
Protein accessionYP_638075 
Protein GI108797878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGC AGACGACGGC GGCGCCGCAG GCGCGCGGTG CCCTCGGGCT GATGTTCGAC 
CCGGTCTTCG GGGCGTTGTT CTGGGGCAAG ATGTTCTCGG TCGTTGCCGT GTGGACGCAC
GGCATCATCG CGGCGATCGT CATGTACGAG GCGACGGGTT CGGCGCTCAT GGTCGGCCTG
GTCGGCGTCG TGCAGTTCGG ACCGCAGTTG ATCCTCAGCC CGGTCAGCGG CAAGTGGGCC
GACACCGGCA ACCCGGCCCG GCAGATCCTG CTCGGCCGGG TGCTGTGCAT GGTGGGTTCC
GGGTCGATCG CGGTGTGGTT GGCGATCACC GAGGCGCAGG CCGCGTTGGC GGTGCTGCTC
GGCACGCTGC TGGTCGGGGT CGGGTTCGTG GTGGGCGGCC CGGCGATGCA GTCGATCGTG
CCGAACCTCA TACGCACCGG CGAGCTGTCG ACGGCCATGG CGCTCAACAG CATTCCGATG
ACGGTCGGCC GGATGATCGG CCCGGTCATC GGCGCCTACC TGGCCGCACA CCTCGGCTAC
GCCGAAGGCT TCGCGGCCAG CGCGGGCCTG CACCTGATCT TCGCGATCTT CCTGCTGGTG
GTCCGCTTCC CCGCTCCCCC GGTGCGGCGC GAAGGGGCGG ACTACCGCGT GCGCGCGGCG
CTGAAGTACG TGTGGCGCGA CAAGCCGTTG TTCCTGGCCC TGCTGGCCGT CACGACGGTC
GGGTTCGCCG CGGACTCGTC GATCACGCTG ACGCCGTCGA TGGCCGACGC GCTGGGCGGG
GACACCCGAC TCGTCGGTGC GCTGTCGGCG GTGTTCGGCG TCGGCGCGGC GCTCGGCATG
GCGGTGCTGG CGCTGTTGCG CGGACGGATC GCGGCCGGCT GGGTGTCGTC GGTCGGGTTG
TGGCTGTTGT GCGCCGGATG CGCTGTCCTG GCGTTCGGGA CCGTGACGCC GGTGGCGGTG
GCCGGGTTCT GGCTCGCCGG TCTCGGCTTC GGCTGGGCGA TGACGGGCCT GAGCACGGTG
GTGCAGGAGC GGGCGCCCGA GGAGCTGCGG GGCCGGATCA TGGCGCTGTG GCTGGTCGGG
TTCCTGGGCT CGCGACCCAT CGCGGCGGCC GTACTCGGCG GCGCGGCCGA CGCGGTGAAC
GTGTTCGTGG CGTTCGGCAT CGCGGCGGCG TCGGTGGTGG GCGTCGCGGT GATGTGCCGG
CCGTCGACGC TGATCGGCGG CCTGCCCGCT TCGCGAGACT GA
 
Protein sequence
MTAQTTAAPQ ARGALGLMFD PVFGALFWGK MFSVVAVWTH GIIAAIVMYE ATGSALMVGL 
VGVVQFGPQL ILSPVSGKWA DTGNPARQIL LGRVLCMVGS GSIAVWLAIT EAQAALAVLL
GTLLVGVGFV VGGPAMQSIV PNLIRTGELS TAMALNSIPM TVGRMIGPVI GAYLAAHLGY
AEGFAASAGL HLIFAIFLLV VRFPAPPVRR EGADYRVRAA LKYVWRDKPL FLALLAVTTV
GFAADSSITL TPSMADALGG DTRLVGALSA VFGVGAALGM AVLALLRGRI AAGWVSSVGL
WLLCAGCAVL AFGTVTPVAV AGFWLAGLGF GWAMTGLSTV VQERAPEELR GRIMALWLVG
FLGSRPIAAA VLGGAADAVN VFVAFGIAAA SVVGVAVMCR PSTLIGGLPA SRD