Gene Mkms_4427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4427 
Symbol 
ID4612370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4657314 
End bp4658489 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID639794113 
Productmajor facilitator superfamily transporter 
Protein accessionYP_940408 
Protein GI119870456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0420753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.320223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGG TGCCCACTCG GCAGCCGTGG CTCACCCGGA ATGTCCGACT GTTGTCGGCG 
GTGTCGTTCC TGCAGGACGC CGCCAGCGAG TTGCTCTACC CGCTGCTGCC GATCTACCTG
ACCTCGGTAC TGGGTGCGCC GGCGGCGGTG GTCGGGGCGG TCGAGGGTGC CGCCGAGGGG
GCGGCCTCGC TGACCAAGCT GGCCTCTGGA CCGCTGGCGG ACCGGTTCGC CAGACGGCCG
CTGATCGCCA CCGGCTACGG GATGGCGGCG CTGGGCAAGG TGATCGTGGC CGCCGCCGGG
ACCTGGCCTG GCGTGCTGGC CGGACGGGTC ACCGACCGGC TCGGCAAGGG TCTGCGCGGC
GCCCCGCGCG ATGCGCTGCT GGTGGACGGT ATCGACCCCG ACGCACGAGG CCGGGTCTTC
GGATTCCACC GTGCCATGGA CACTTTCGGC GCGGTGGTCG GGCCGCTGCT CGGGCTGGCC
GGCTACGAAC TGCTCGACCA TCAGATCCCA CCGCTGCTGT GGGTGGCGGT GATTCCCGCG
GTGCTGTCCG TGGCGCTGGT GTTCTGCGTG CGCGAACGGA CCAGGGCGGT GCCGGCGGTG
GCGCGGCCGC CGCTGTTGTC GCGGGTCCGG GACCTCCCTC GGCGCTACTG GCGGGTGACC
GCGCTGCTGG TCGGCTTCGG CGTGGTGAAC TTCCCCGACG CGCTTCTCCT GTTGCGGCTC
AATGAGATCG GTTTCTCCGT CGTCGAGGTG ATCCTGGCCT ACGTCGGCTA CAACGTCGTG
TACGCGGTCG CGAGTTACCC GGCCGGCGCG TTGGCCGACC GCGTCGGCAC ACCGGCCGTG
TTCGCCGTCG GGTTGGGGTT CTTCGCCGTC GGTTACACCG GCCTGGGGCT GACCACCGAC
ACGCTCACCG CATGGATGCT GATCGGGGTC TACGGGTTGT TCACCGCGTG CACCGACGGG
GTCGGCAAGG CGTGGATCTC ATCGCTGGTC GGCGCGGACG TGCAGGCCAG CGCCCAGGGC
GTGTTCCAGG GCGCAAGTGG GTTCGCGATC CTGGCGGCCG GGCTGTGGGC GGGTCTGCTG
TGGGGCGCCG ACGGGACGGT GCCGCTGCTG ATCTCCGGCC TGGCGGGCGG GGTGTTCGCC
GTGATCGTCG CGGTGTTCGC CGTGCGGCAC CGCTGA
 
Protein sequence
MTRVPTRQPW LTRNVRLLSA VSFLQDAASE LLYPLLPIYL TSVLGAPAAV VGAVEGAAEG 
AASLTKLASG PLADRFARRP LIATGYGMAA LGKVIVAAAG TWPGVLAGRV TDRLGKGLRG
APRDALLVDG IDPDARGRVF GFHRAMDTFG AVVGPLLGLA GYELLDHQIP PLLWVAVIPA
VLSVALVFCV RERTRAVPAV ARPPLLSRVR DLPRRYWRVT ALLVGFGVVN FPDALLLLRL
NEIGFSVVEV ILAYVGYNVV YAVASYPAGA LADRVGTPAV FAVGLGFFAV GYTGLGLTTD
TLTAWMLIGV YGLFTACTDG VGKAWISSLV GADVQASAQG VFQGASGFAI LAAGLWAGLL
WGADGTVPLL ISGLAGGVFA VIVAVFAVRH R