Gene Mjls_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2594 
Symbol 
ID4878310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2720208 
End bp2721635 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content65% 
IMG OID640139891 
Productextracellular solute-binding protein 
Protein accessionYP_001070867 
Protein GI126435176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0396421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.707884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGAG ATCGGTTCGC GCAGCAACGT CAGCTGTCGC GCCGGAACAT GTTGGCCGCC 
ATGGGAATCG CCGGAGCGGC GGCCGCGAGC CTGCCGGTGC TCTCGGCCTG CGGCGTCGGC
GGCAAGACCA GCGCCCCGAA CGGCGCCTCG GAGGTGAGCG GCGGATTCGA CTGGCGTAAG
GCTGCTGGGT CGACGATCAA CATCCTGCAG ACCCCGCACC CGTATCAGCA GAGCTACCAG
CCGCTGCTCA AGGAGTTCAC CGAGCTCACC GGGATCAACG TCAACGTCGA TCTCGTGCCG
GAGGCGGACT ACTTCACCAA GCTCAACACC GAACTGGCGG GCGGCACCGG CAAGCACGAT
GCGTTCATGC TGGGTGCCTA CTTCATCTGG CAGTACGGTC CGCCCGGTTG GATCGAGGAT
CTCAACCCGT GGCTGCAGAA CGCCTCGGCG ACCAACGCCG AGTACGACTT CGAGGACATC
TTCGAGGGTC TGCGCACCTC CACGCGGTGG GACTTCACGT TGGGCAACCC ATTGGGCACC
GGCGGTCAGT GGGCGATCCC GTGGGGGTTC GAGAACAATG TCGTCGCCTA CAACAAGGCC
TATTTCGACC GGCGGGGCAT CAGGAAACTG CCCGACAACT TCGACGATTT CATCCAGCTG
GCCGTGGACC TGACCGACCG CTCGGAGAAC CGGTACGGCA TCGCCACCCG CGGATCGAAG
TCGTGGGCCA CGATCCACCC GGGCTTCATG ACGCAGTACG TCCGCGAAGG CGCCGTCGAC
TACACGTTCG ACGGCCGCGA TCTGGTCGCC GAGATGGACA GCGACAAGGC CGTCGACTTC
ACCGAGAAGT GGATCCGGAT GCAGCACGAG GCGGGCCCCA CCTCGTGGAC CACCTACGAC
TACCCGAACG CCACCGGTGA TCTCGGTGAC GGCAAGGCGA TGATGGTCTA CGACGCCGAC
AGCGCGACGT ATCCGAAGAA CAAGCCCGGC GCGAGCGCAC AGGCGGGGAA CCTCGGCTGG
TATCCGGGTC CGGCCGGCCC CGACGGCAAC TACAAGACCA ACCTGTGGAC CTGGACGTGG
GCGATGAACG CCAACTCCCG CAACAAACTG CCGGCCTGGC TGTTCATCCA GTGGGCCACC
GGCAAGGAGT CGATGAACAA AGCCGTCGAG GGCGGCATCT ACGCAGATCC GGTGCGGCAG
TCGGTGTTCG ACACGACGTT CAAGCGGATC GCCGCCGATC AGTACGGCTA CCTCGAGACC
TTCGAGACGG TGATCCCCAC CTCCAAGATC CAGTTCACCC CGCAGAAGAA GTTCTTCGAC
ACCACCAAGG ACTGGGCCGT TGCGCTGCAG GACATCTACG GCGGGGACGA CGCCGCGTCC
CGGCTGCGCA GCCTGGCCAA GACCAACACC TCCAAGGTCA ACCTCTAG
 
Protein sequence
MSRDRFAQQR QLSRRNMLAA MGIAGAAAAS LPVLSACGVG GKTSAPNGAS EVSGGFDWRK 
AAGSTINILQ TPHPYQQSYQ PLLKEFTELT GINVNVDLVP EADYFTKLNT ELAGGTGKHD
AFMLGAYFIW QYGPPGWIED LNPWLQNASA TNAEYDFEDI FEGLRTSTRW DFTLGNPLGT
GGQWAIPWGF ENNVVAYNKA YFDRRGIRKL PDNFDDFIQL AVDLTDRSEN RYGIATRGSK
SWATIHPGFM TQYVREGAVD YTFDGRDLVA EMDSDKAVDF TEKWIRMQHE AGPTSWTTYD
YPNATGDLGD GKAMMVYDAD SATYPKNKPG ASAQAGNLGW YPGPAGPDGN YKTNLWTWTW
AMNANSRNKL PAWLFIQWAT GKESMNKAVE GGIYADPVRQ SVFDTTFKRI AADQYGYLET
FETVIPTSKI QFTPQKKFFD TTKDWAVALQ DIYGGDDAAS RLRSLAKTNT SKVNL