Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3472 |
Symbol | |
ID | 8327662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 4036682 |
End bp | 4037923 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644943972 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003101212 |
Protein GI | 256377552 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCACGA CACCGGCCGC CCTGGCGGTC CTGATCGCCC TGCTACCCGC GTTGCCCGCC TGCTCCGCCG CCCCGGCCGA CGGCGGCGAG GTCCTCACCG TCGTCGGCTG GAAGGGCGGC GGCGGCGAGG AGGCCCGATT ACCCGAGCTG AACGCGGCCT TCGAGCGGGC CCACCCCGGC GTGCGGGTCG AGCACCACTA CGTCGGGCCC GGCAACTACG AGACGTACAA CAACCCCAAG CTCGCGTCCG GCACCGCCGC CGACGTGATC ATGGTCGACA AGGCCAAGAC GCGCACCTGG ACCGACCAGG GCTACCTCGC CGACCTGTCC GACCAGCCCT GGGTGCCCGC GGTCCACCCG GAGCTGGCGC CGTTCACCAC GGTCGACGGG CGCACCCGCC AGTTCACCCA GGAGAACATC GGCATCGGCC TGTACGCGAA CCTCGACCTG CTGGCCTCGG CCGGGATCAC CGAGGTCCCC CGCGACTGGC CCACCCTGCT CGCCACGCTC GACACCCTGC GCGAGAAGGG CAAACCCGGC CTGCTGGTGC CCAACAAGGG CGGTTGGGGC GGGGTGCAGC TCGCGCTCGC GCTGGCCGTG AACCGGCTCG CCCCGACCTG GAGCGCCGAC TACAACACCG GCCGCGCCCG CTTCGGCCCG GACTGGGCGC CGGTCGTGGA CGAGCTCAAG CGCGCGGTCG CCTCCGGCGC GGTCGACGGC AGGCTCATGC TCGGCCTGGA CGCCTGGTCG GACGCGCTCA CCGAGTTCAA GGCGGGCCGC TGGGCGTTCC TCATCCAGGG CGCGTGGAAG CTCGCCGACT TCCGCGCCGA CCTCGGCTTC CGCTTCTCCC TGAGCCCCAT CCCCGCGGGC CCGGCCGGGA GCGAGCCGGT GGCAGTGACG TTCGTGGGCA CCGGCTGGGC CGTCAACGCG GACGCCCAGC GCCCCGACCT GGCCCGCGAG TACGTGAAGT TCATGGCCGA ACCCGGCAAC GCCCGCCTGT ACTGCGAGGC CGAGGGCGCG TTCTCCACCC TCGTCGGCGG GACCACGACC CTGCCCGCCG AGGCGTCCGC GCTGGTGGCG GCCTTCGACG CGGGCCGCTG GGCCGGATCG CCCGCGCAGG GCCTCGACTT CCCCGGAGCC GAGGAGGTCA TGGGCACCGC CCTGCAGGAG GTGTTCCTGG ACCCGACCAC CCCCACCGGG GACGTGCTCG CCGCCCTCGA CCGGATGCCC GCGCGGGGGT GA
|
Protein sequence | MRTTPAALAV LIALLPALPA CSAAPADGGE VLTVVGWKGG GGEEARLPEL NAAFERAHPG VRVEHHYVGP GNYETYNNPK LASGTAADVI MVDKAKTRTW TDQGYLADLS DQPWVPAVHP ELAPFTTVDG RTRQFTQENI GIGLYANLDL LASAGITEVP RDWPTLLATL DTLREKGKPG LLVPNKGGWG GVQLALALAV NRLAPTWSAD YNTGRARFGP DWAPVVDELK RAVASGAVDG RLMLGLDAWS DALTEFKAGR WAFLIQGAWK LADFRADLGF RFSLSPIPAG PAGSEPVAVT FVGTGWAVNA DAQRPDLARE YVKFMAEPGN ARLYCEAEGA FSTLVGGTTT LPAEASALVA AFDAGRWAGS PAQGLDFPGA EEVMGTALQE VFLDPTTPTG DVLAALDRMP ARG
|
| |