Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3336 |
Symbol | |
ID | 8327526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3913209 |
End bp | 3914822 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644943847 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003101087 |
Protein GI | 256377427 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTGG ACCGCAGGGC TTTTTTAGGT CTTCTCGCCG CAGCGGGCGG AACGGCGCTG GTCGGGTGCG AGGCGCCTGC CCCGCGCGGC GGTTCCGCGC TCGGCGGCGA CGCGCTGGCG GCGCTGCTGC CCGCGCACCG GCCGGTGGAG TTCGCCGAGC CCGACCTGCC CGGCGTCAAC GGCTCGGTGC CCGGCTACCT GACCTACCCG GCCAACCCGG TGCGCGGCGT GCGGGGGCCG GTGCTCAGCG GCGAGGTCAC CGCGATGACC CCGTCGTTCT GGCCGCCGGC GCCGGGGCCG GGGCGGAACT CCTACTACGA CGCGGTCAAC GAGCGCCTCG GCGGCGCGGT GCGGTTCGAG ACCGTGTCCG GGGCCGACTA CCAGGCCAAG CTCTCCGCCC TGATGGCGGC GCGGCAGGTG CCGGAGCTGA CCGTGGTGCC CACCTTCACC ATGCCGCCCC GGTTCAGCGA GGGCGTGGGC GAGGTGTTCC GCGACCTGAC CGACTTCCTG TCCGGCGAGC GCGTCGCCGA CTACCCGATG CTGGCCAACA TCCCCACCGA CTCGTGGCAC GCGTGCGTGC ACAACGGGCG CCTGCACGGC GTTCCCTACC CCGGCCAGCT GTTCCCCGAG GTGCTGTTCT ACCGGGACGA CGTGTTCGAG CAGCTCGGGG TGGAGCCGCC GCGCAGCGCC GAGGAGTTCG CGGCGATGGC CAAGCGGCTC AACGACCCCG CGAACGACCG GTGGGCGCTC GGCGACGTGT TCCGCTCGCT CGTCCGCGCC TTCGGCGGCC GGGGCGACTG GGTGCGCGAC GACTCGGGCA AGCTGCTGAA CCAGCTCGAA ACCCCCTGGT ACGCCGAGGC GGTGCGGTTC ACCCGGTCCC TGTACGACGC CGGGTGCGTG CACCCGGACA TCGTGGCGGG CAACTGGAAC CGCGGCAACG AGCTGTTCGC GGCCAAGCGG ATGATCGTCA ACCAGGGCGG GATGGGCGCG TGGGCCGAGC AGGTCGCGCA GCAGCGGCCC GCCGACCCCG GTTTCCGGAT GACCGCGCTG CCGCTGTTCG CGCACGACGG CGGCGAGCCC GCCTACCCGG TGGCCGCGCC GACGGTGATG GTCGCGTTCG TCCGCAAGGA CGTGACCGAC GAGCGGGTGC GGGAACTGCT GCGGCTGTGC GACTTCGCCG CCGCGCCGAT CGGCACCGAG GAGCACCGGC TGCTGCGGTA CGGGGTGGAG GGCGTGCACA GCGAGCGCGA CGCGCGGGGC AACCCGGCGC TGACCCCGCT GGGGCAGAAG GAGATCACGC TGACCTACGG GTTCGCGGCG GGCCCGCCGG AGGCGATCAC CACCACCGAC CACCCGGACC TCGTGCGGGC CCAGCACGCC TGGTACGCGC GGGAGTGGGG ACACCAGACC AAGCCGCTGG CGTTCGGGCT GCGCCTGGAG GAGCCGCCGG AGTTCGCGAC CCTGGCGAAG GAGTTCGCGG ACCGGACCAC GGACGTCCTG CGCGGGCGGG CCGAGCTGTC CGAGGTGGAC GGACTGGGCG AGCGGTGGCG CAAGGCGGGC GGGGATCGGC TGCGGGAGTT CTACGACAAG GCGCTGCGGG ACGCCGGGCG CTGA
|
Protein sequence | MPVDRRAFLG LLAAAGGTAL VGCEAPAPRG GSALGGDALA ALLPAHRPVE FAEPDLPGVN GSVPGYLTYP ANPVRGVRGP VLSGEVTAMT PSFWPPAPGP GRNSYYDAVN ERLGGAVRFE TVSGADYQAK LSALMAARQV PELTVVPTFT MPPRFSEGVG EVFRDLTDFL SGERVADYPM LANIPTDSWH ACVHNGRLHG VPYPGQLFPE VLFYRDDVFE QLGVEPPRSA EEFAAMAKRL NDPANDRWAL GDVFRSLVRA FGGRGDWVRD DSGKLLNQLE TPWYAEAVRF TRSLYDAGCV HPDIVAGNWN RGNELFAAKR MIVNQGGMGA WAEQVAQQRP ADPGFRMTAL PLFAHDGGEP AYPVAAPTVM VAFVRKDVTD ERVRELLRLC DFAAAPIGTE EHRLLRYGVE GVHSERDARG NPALTPLGQK EITLTYGFAA GPPEAITTTD HPDLVRAQHA WYAREWGHQT KPLAFGLRLE EPPEFATLAK EFADRTTDVL RGRAELSEVD GLGERWRKAG GDRLREFYDK ALRDAGR
|
| |