Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_3401 |
Symbol | |
ID | 4879113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | + |
Start bp | 3565935 |
End bp | 3567473 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640140704 |
Product | extracellular solute-binding protein |
Protein accession | YP_001071670 |
Protein GI | 126435979 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.475057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGTG ACGCGGTGCA GGCTGCCGGC CTGGCCCTGA TCGTGGCGCT GCTGCTGGCG GTCACGGGCT GTTCCACCGG AGAACGCGTA GACCTCGGTG ACGCCACCTC GGGCAATCTC GTAGCCGCGA TCGCGGGCGA ACCCGACCAA CTCGACCCGC ACAAGACCAG CGCGTACTTC TCGTTCGAGG TGCTCGAGAA CGTGTTCGAC ACCCTGGTCG AACCCGACGC CGCCCTCGAG ATGCGCCCGG CGCTCGCCGA GAGCTGGGAG GTCAGCCCCG ACCAGCGGGT GTGGACGTTC CACCTGCGGC GCGGCGTGAC CTTCCACGAC GGCTCGCCGT TCACCGCCGA CGACGTCGTC TACTCCTACC GCCGCATCAT CGACGAGGAA CTCACCAACG TCGACAAGTT CAGCGCCGTC ACCGACGTCA CCGCCGTCGA CCCCGCCACC GTGCGCATCA CCGTCGACAA ACCCACCCCG AACCTGCTGA CCAACCTCGG CGGGTTCAAG GGGATGGCGA TCGTGTCGCG GGCCAACGTC GAGAGCGGAC GCATCGCCAC CCATCCCGTC GGGACCGGAC CGTTCTCGTT CGCCGGCGCC ACCAGCGGTG ACTCGATCAC GTTGCGCGCC AACCCCTCGT TCTGGGGTGG ACCGCCCCGC ATCTCGGGGG TCACGTTCCG GTTCATCTCC GAACCGTCGA CGGCGTTGTC GGCCCTGCAG GCAGGCGAGA TCGACTGGAC CGACTCGGTT CCCCCGCAAC GGGTTTCGCA GCTGCGTGAC GACGAGTCCC TGCGCCTCGC CGTCACACCG AGCAATGACT ACTGGTATCT GGCGCTCAAC GAGGCGCGTG CCCCGTGGAA CGACGTGCGG GTGCGCCAGG CGATCGCCTA CGGCATCGAC CGCGAGGCGA TCGTCGCCGC CACGAGCTAC GGCACCGCGG CCGAGAATCA GCTCGCGATC CCCGAGGGCA ACCCCTGGTA CACCCCGTAC GACCGGTACT CCGCCGATCT CGAGAAGGCC AGGAGCCTGC TGGCCGAGGC GAACGCCGAA CCGGACCGGC TCGACATGCT GGTCACCAGC GAGTATCCCG AGACGGTCAC CGCCGCGCAG ATCATCGCCG ACAACCTCGC CCCGCTGGGG ATCACGGTCG ACATCCGCAC CGTGGACTTC GCCACCTGGC TCGACGAACA GAACAACGGC AACTTCGACA TGCTGATGAT GGGTTGGCTC GGCAACATCG ACCCCGACGA CTTCTACTAC GCCCAGCACC ACACGAACGG CACCAGCAAC GCCCAGAAGT TCTCCGACCC GGAGGTGGAC CGCCTGCTCG ACGCCGGCCG CGTGGAGACC GACCGCGACG CGCGCCACGA CGTCTACGCC AAGGCCGCCA CCCGCATCGC CGACGAGGTC AGCTACATCT ACCTCTACAA CCCGTCGGTC ATCCAGGCCT GGACCCCGGC CCTGTCGGGA TACGAGGCAC GCCGCGACGG TGCGGTCCGC TTCCGCGACG CCGTCCTCGG TGAGGACGAA AGCTCATGA
|
Protein sequence | MRRDAVQAAG LALIVALLLA VTGCSTGERV DLGDATSGNL VAAIAGEPDQ LDPHKTSAYF SFEVLENVFD TLVEPDAALE MRPALAESWE VSPDQRVWTF HLRRGVTFHD GSPFTADDVV YSYRRIIDEE LTNVDKFSAV TDVTAVDPAT VRITVDKPTP NLLTNLGGFK GMAIVSRANV ESGRIATHPV GTGPFSFAGA TSGDSITLRA NPSFWGGPPR ISGVTFRFIS EPSTALSALQ AGEIDWTDSV PPQRVSQLRD DESLRLAVTP SNDYWYLALN EARAPWNDVR VRQAIAYGID REAIVAATSY GTAAENQLAI PEGNPWYTPY DRYSADLEKA RSLLAEANAE PDRLDMLVTS EYPETVTAAQ IIADNLAPLG ITVDIRTVDF ATWLDEQNNG NFDMLMMGWL GNIDPDDFYY AQHHTNGTSN AQKFSDPEVD RLLDAGRVET DRDARHDVYA KAATRIADEV SYIYLYNPSV IQAWTPALSG YEARRDGAVR FRDAVLGEDE SS
|
| |