Gene Mkms_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2447 
Symbol 
ID4613270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2567125 
End bp2568318 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID639792116 
Productmajor facilitator superfamily transporter 
Protein accessionYP_938435 
Protein GI119868483 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.118578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.46629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGGT CGTTTTCGTT TGTCCGCGGG CGTGACCCCC TGGTCATCGC CTTCGGCACC 
AGCTTCATCG CCGCCATGTA CGGGCTCGTG CGGTTGGCCT ACGGGTTGTT CCTTCCCGAC
ATCCAGGCCG ACCTGCACCT CGGTGCGGCC GGCGCCGGAT ACATCTTCTC GGCGTCGTCT
CTGCTGTATT GCGCGGCCGC GGCCACCGGC TTCGTCCTCG GTCATCGGAT GCCGCGGGCG
TTGGTCGTGG TGGCCGCCGT GACAGCCTGC GGCGGCGTGT GGGGGATGGC GGCGGCGCCC
GGCGTCGCCG TCTTCAGCGT CTTCGCCGTC CTCAGCTCCG CCGGTGCCGG CCTGGCTTCA
CCGGCGCTGG TGAGCATCGT GGCCCGCAAC GTCGACCCCC GCCGCGTCGA CAGCGCGCAG
TCGATGGTGA ACGCCGGCAC GGGTCCCGGT CTGGTCGCAG CCGGGGTGCT GGCCCTGGTG
CTCCTTCCTC AGTGGCGTGT CACGTGGGTG CTGATCGGAG TGCTCACCGC GGTATGCGCG
CTCGCCGTGC TGACCGTCGA CCGACGCCGC ACCGGGCCGG CGCAGGGGCA GCACCTCAGA
CCCTCGAAGG CCTGGGTCGC CCGCCACACC GGCGCGATCG CCGCTGCGAT CCTCATGGGC
GCGGGGTCCA GTGCGGTGTG GACCTACGGC CGCAGCCTGC TCGTCGAGAC CGGCGCCAGT
TCGCAACGCG GCACGATCAC CGCGTGGATC GTGCTCGGCC TGGGCTCTGC CGCACTGCTG
CTGACCGCCA AACCACTGGC CCGACTCACG CCGGTGGCCG CCTGGATCCT GACGTGTTCG
GTGATGACCG GCGCCATCGC CCTGCTCGCG ATCGTCCCGC AGGTCACGGT GGCGGCACTG
GTGGGCTGCT TCGCCTTCGG CTGGGGATTC ACCGCGGCCA CCTCGTCGCT CATCCTGTGG
ACCACCGCCA TCGACCCGGC GCATGCGGCG GCGGGCACCG CCATGTTGTT CATCGCGCTG
ATGTTCGGTC AGGCGGTCGG CGCCACCGCG CTGGGCGTGA TCATCAGCGG CGCGCACTTC
GCCCCGGCGT TCGGCGTCGC CGCGGCGCTG TCGGCGGCGT CGATCGTCCT CGCGGCATAC
CAGCGGCGCC GCCGGGGCAT ACCGAAATCT GAAACACCCG TAGCGGTTTC CTGA
 
Protein sequence
MKRSFSFVRG RDPLVIAFGT SFIAAMYGLV RLAYGLFLPD IQADLHLGAA GAGYIFSASS 
LLYCAAAATG FVLGHRMPRA LVVVAAVTAC GGVWGMAAAP GVAVFSVFAV LSSAGAGLAS
PALVSIVARN VDPRRVDSAQ SMVNAGTGPG LVAAGVLALV LLPQWRVTWV LIGVLTAVCA
LAVLTVDRRR TGPAQGQHLR PSKAWVARHT GAIAAAILMG AGSSAVWTYG RSLLVETGAS
SQRGTITAWI VLGLGSAALL LTAKPLARLT PVAAWILTCS VMTGAIALLA IVPQVTVAAL
VGCFAFGWGF TAATSSLILW TTAIDPAHAA AGTAMLFIAL MFGQAVGATA LGVIISGAHF
APAFGVAAAL SAASIVLAAY QRRRRGIPKS ETPVAVS