Gene Mkms_3905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3905 
Symbol 
ID4611840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4111829 
End bp4113139 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID639793584 
Productgeneral substrate transporter 
Protein accessionYP_939887 
Protein GI119869935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0290855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA TCGACGGACA CCGCCCGCGG GAGGCAGAAC CGCTCGCGGT GCGGCGCGCG 
GTGCGGGGCG CCGCGATCGG CAACACCGTC GAGTGGTTCG ACTTCGCGAT CTACGGCTTC
CTCGCCACCT ACATCGCGGA GAAGTTCTTC CCCACCGCCG ATGACACCGC CGCACTGCTC
AACACCTTCG CGATCTTCGC CGCCGCCTTC TTCATGCGGC CGCTCGGCGG GTTCTTCTTC
GGCCCCCTCG GGGACCGGAT CGGGCGTCAG CGTGTGCTGG CCCTGGTCAT CCTGCTGATG
TCGGCGTCGA CGTTCGCGAT CGGCCTGCTG CCGAGCTACG ACGCCATCGG TGTCGCCGCC
CCGCTGCTGC TGCTCCTGCT GCGGTGTCTG CAGGGATTCT CCGCCGGCGG CGAATACGGT
AGCGGCGCCT GCTATCTCGC CGAATTCGCA CCGGACCGCC ACCGCGGGTT CGTCGTGAGC
TTCCTGGTGT GGTCGGTCGT GGTCGGATTC CTGCTCGGGT CGTTGACGGT CACCGCGCTC
GAGGCGGTGC TCGACGAATC GGCGATGGAC TCCTACGGGT GGCGCATCCC GTTCCTCATC
GCCGGCGTGC TGGGCGGCGT CGGACTCTAC ATCCGGTTGC GGCTGGGTGA CACACCGGAT
TTCGAGGAGC TCAAGGAATC CGGCGAGGTG TCCACCTCAC CGCTCAAGGA TGCGGTGAGA
ACCTCGTGGA AACCGATCCT GCAGATCGCC GGGCTCGTCA TCATCCACAA CGTCGGCTTC
TACACGGTGT TCACCTACCT GCCGAGCTAC ATGACCGAGA CGCTCGACTT CACCAAAACC
GATGCGTTCC TCTCGATCAC CGTGGCCAGT GTGGTCGGAT TGGTGCTCAT CCTCCCGCTC
GGCGCACTGT CGGACCGCCT CGGCCGCAAA CCGTTGCTGA TCGCGGGTGC GGTGGGCTTC
GCCGTCTTCG CCTATCCGCT GTTCGCACTC TTCAACACCG GATCGCTGCT CGGCGCGATC
GCCGGACACG CCGGTCTGTC GGCACTCGAG GCGCTGTTCG TCGCGGGCTC ACTGGCCGCG
GGCGCGGAGT TGTTCGCCAC CCGGGTGCGC TCCAGCGGCT ACTCGATCGG CTACAACACG
TCGGTGGCGC TGTTCGGCGG TACCGCACCG TATGTCGCGA CCTGGCTCAC CGACCGCACC
GGAAACGACC TCGCACCGGC CTACTATCTG ATCGTCGCCG CCGTCATCTC GTTGGCCACC
ATCCTGACCA TGCGCGAAAC CGCCACCCGC CCACTACGGA TGCGGGTCTG A
 
Protein sequence
MATIDGHRPR EAEPLAVRRA VRGAAIGNTV EWFDFAIYGF LATYIAEKFF PTADDTAALL 
NTFAIFAAAF FMRPLGGFFF GPLGDRIGRQ RVLALVILLM SASTFAIGLL PSYDAIGVAA
PLLLLLLRCL QGFSAGGEYG SGACYLAEFA PDRHRGFVVS FLVWSVVVGF LLGSLTVTAL
EAVLDESAMD SYGWRIPFLI AGVLGGVGLY IRLRLGDTPD FEELKESGEV STSPLKDAVR
TSWKPILQIA GLVIIHNVGF YTVFTYLPSY MTETLDFTKT DAFLSITVAS VVGLVLILPL
GALSDRLGRK PLLIAGAVGF AVFAYPLFAL FNTGSLLGAI AGHAGLSALE ALFVAGSLAA
GAELFATRVR SSGYSIGYNT SVALFGGTAP YVATWLTDRT GNDLAPAYYL IVAAVISLAT
ILTMRETATR PLRMRV