Gene Mkms_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2600 
Symbol 
ID4615796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2734124 
End bp2735551 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content65% 
IMG OID639792268 
Productextracellular solute-binding protein 
Protein accessionYP_938587 
Protein GI119868635 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGAG ATCGGTTCGC GCAGCAACGT CAGCTGTCGC GCCGGAACAT GTTGGCCGCC 
ATGGGAATCG CCGGAGCGGC GGCCGCGAGC CTGCCGGTGC TCTCGGCCTG CGGCGTCGGC
GGCAAGACCA GCGCCCCGAA CGGCGCCTCG GAGGTGAGCG GCGGATTCGA CTGGCGCAAG
GCGGCCGGGT CGACGATCAA CATCCTGCAG ACCCCGCACC CGTACCAGCA GAGCTACCAG
CCGCTGCTCA AGGAGTTCAC CGAGCTCACC GGGATCAACG TCAACGTCGA TCTCGTGCCG
GAGGCGGACT ACTTCACCAA GCTCAACACC GAACTGGCGG GCGGCACCGG CAAGCACGAT
GCGTTCATGC TGGGTGCCTA CTTCATCTGG CAGTACGGTC CGCCCGGTTG GATCGAGGAT
CTCAACCCGT GGCTGCAGAA CGCCTCGGCG ACCAACGCCG AGTACGACTT CGAGGACATC
TTCGAGGGTC TGCGCACCTC CACGCGGTGG GACTTCACAT TGGGCAACCC ATTGGGCACC
GGCGGTCAGT GGGCGATCCC GTGGGGGTTC GAGAACAACG TCGTCGCCTA CAACAAGGCC
TATTTCGACC GGCGGGGCAT CAGGAAACTG CCCGACAACT TCGACGATTT CATCCAGCTG
GCCGTGGACC TGACCGACCG CTCGGAGAAC CGGTACGGCA TCGCCACCCG CGGATCGAAG
TCGTGGGCCA CGATCCACCC GGGCTTCATG ACGCAGTACG TCCGCGAAGG CGCCGTCGAC
TACACGTTCG ACGGCCGCGA TCTGGTCGCC GAGATGGACA GCGACAAGGC CGTCGACTTC
ACCGAGAAGT GGATCCGGAT GCAGCACGAG GCCGGCCCCA CCTCGTGGAC CACCTACGAC
TACCCGAACG CCACCGGTGA TCTCGGTGAC GGCAAGGCGA TGATGGTCTA CGACGCCGAC
AGTGCGACGT ATCCGAAGAA CAAGCCAGGC GCGAGCGCGC AGGCGGGGAA CCTCGGCTGG
TATCCGGGTC CGGCCGGCCC CGACGGCAAC TACAAGACCA ACCTGTGGAC CTGGACATGG
GCGATGAACG CCAACTCCCG CAACAAACTG CCGGCCTGGC TGTTCATCCA ATGGGCCACC
GGCAAGGAGT CGATGAACAA AGCCGTCGAG GGCGGCATCT ACGCAGATCC GGTGCGGCAG
TCGGTGTTCG ACACGACGTT CAAGCGGATC GCCGCCGATC AGCACGGCTA CCTCGAGACC
TTCGAGACGG TGATCCCCAC CTCCAAGATC CAGTTCACCC CGCAGAAGAA GTTCTTCGAC
ACCACCAAGG ACTGGGCCGT TGCGCTGCAG GACATCTACG GCGGGGACGA CGCCGCGTCC
CGGCTGCGCA GCCTGGCCAA GACCAACACC TCCAAGGTCA ACCTCTAG
 
Protein sequence
MSRDRFAQQR QLSRRNMLAA MGIAGAAAAS LPVLSACGVG GKTSAPNGAS EVSGGFDWRK 
AAGSTINILQ TPHPYQQSYQ PLLKEFTELT GINVNVDLVP EADYFTKLNT ELAGGTGKHD
AFMLGAYFIW QYGPPGWIED LNPWLQNASA TNAEYDFEDI FEGLRTSTRW DFTLGNPLGT
GGQWAIPWGF ENNVVAYNKA YFDRRGIRKL PDNFDDFIQL AVDLTDRSEN RYGIATRGSK
SWATIHPGFM TQYVREGAVD YTFDGRDLVA EMDSDKAVDF TEKWIRMQHE AGPTSWTTYD
YPNATGDLGD GKAMMVYDAD SATYPKNKPG ASAQAGNLGW YPGPAGPDGN YKTNLWTWTW
AMNANSRNKL PAWLFIQWAT GKESMNKAVE GGIYADPVRQ SVFDTTFKRI AADQHGYLET
FETVIPTSKI QFTPQKKFFD TTKDWAVALQ DIYGGDDAAS RLRSLAKTNT SKVNL