Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3610 |
Symbol | |
ID | 8327800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 4223112 |
End bp | 4225286 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644944105 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003101345 |
Protein GI | 256377685 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATC TCGAGCTGCC CGACCGGCCC AGGGGCCGCA GACCCCGGCG GCGGGTGCTG CTCATCGCCC TGTTCGCCGT GGTGACCGCC GTGGTCGCCT CGGTGTCCGC CTGGCTGGCC GCGGACGTGG GCGAGCGGTG GCGGGCGGGC GACGTCCTCC ACCTGTGCGC GACCGGGGTC ATGCTGGTGC TCGCCGGGGT GCTGATCACC CTGCTCCCCG GGGCGGTCAA GGAGTTCTGG CGCTGGTCCG GGGTGTTCGC CGCCGTCGCC GAGCGGGTGA TCAGGGCGCT GCGCCCGCTC AAGGCCGTCG GCGTGGTCGT CACGGTGCTC GTGCTGGGCG TCGGCGGCTC CCTGCTGGCC CCGGCTCCCG ACACGGACGC GACCGCGCTG GAGGAGGGCG AGCTGTGGCT CATGGCGGGC GAGGACCTGA GCCCGAGCGA CCCGCGCGCC GTGCTGCTGG AGCAGTGGAA CCAGGCGCAC CCCGAGACCC CGGCGAGGCT GGTGGACGCC GCCGGGGCCA CGGACGAGCA GCGGGAGCGG ATGCTCGACG ACGCCCGCCC CGACGGGCCG CACCGCGCGG ACGTGTACCT GCTGGACGTG GTGTGGCTGC CGGAGTTCGC GGGCGAGGGG CACCTGCGCG AGCTGGACGG GTCCACGATC GGCGGCGGCA CGGCCGACTT CCTGCCCGTG GTGCTCGACA CCTGCCAGGA CGGCGGGAAG CTGTGGGCGC TGCCGCTCAA CACCGACGTC CGGCTGCTCT ACCACCGCAC CGACGTCCCC GGCCTGGTCG TGCCGACCAG CTGGGACGGC CACTCCGGCT CCGCCGCGAC CAGGGCGCAC GGCGGGCTGG AGGCGGCCGA CGCGCCGCAG CTGGCCGAGG AGATGCTCAC CGTGGAGGCG CTGGAGGCGA TCTGGGCGGC GGGCGGCAAC GTGGTCACCC GTGACGGGGA GGTCACCCTG ACCCCGGACG GGAGCAGGGT GCAGTTCACC GAGGACGACT ACGAGGGGAT GCGCAAGCTC AACGCGATGG CCAGGACCGA GGGCGTGGTG CCGCGGGGCG TGCGCCCCGA GGAGACCGGG GCCGAGGACG CGCTCACCCT GTTCGCGGAC GGGCGGACCG CGTTCCTGCG GAACTGGGCC GTCACCCACC GGCAGCTCAC GGGCAGGCAG GACAGGCCGA GCACGACGCA CGTCGGGTTC GACAGCGCCG TGCAGCCCGC GCCGAGCGTG CTGGGCGGGC AGGACCTGGC GATCTCCCGG CACACCGCGA AGCCGAAGGC GGCGAAGGCG CTGCTGGAGT TCCTGACCAG CACGCAGAGC CAGCAGATCC TGGCCGAGGT CGGCGGGTTC GCGCCGAGCC GGACGTCGGT GTACGAGCTG TCGGACAGCA GGCACCTGGC GAACGTGCGC ACCGCGCTCA AGGACGCCAG GCCCCGGCCG AGGACCGAGC GGTACACCGA GTTCAGCAAG CTCTTCCGGG AGGCGGTCGA GAAGGTGGTG GCGCCGGAGG ACTCGATCCC GGACGAGACC GCGCGGAAGC TGGCGGACGT GCTGCGCGGG CGATCCGGCG CGGGTGATCA GACGCGGGTG GGGCGGGGGT TTCGCGGCGG GCGGGAACGG TGTTTCGAAG GGGTTCGAAA GTCCGCTCCG ACGGGCGGTG GGCGCGCCAG GCTGAGGGCC GACGCACCGA CCGAGACCGT TGGAGTGATG ATGCGAAGGA CCTTGACCGC GTTGTTCGTC GCCGTCGTGG CGCTGCTGTC GTCGCTGGTG TCCGCGAGCG CGGAACCCGG CTCGGTGGAC AGATCCGCCA CGCGGGTGGA GGCGCCGGTG ACCGTGGCCG CACCAGTTCA GCAGGACGTG TCGGCGCAGG CGCTGTCGTG CACGGCGGGC GACCTGTGCG CGTGGAACGG CGTCGGGGGC GGGAGCCGTT GCAGCTGGAC CAACAGGGAC AACGACTGGT GGTACGCCCC GACCACCTGC TCGTGGTCGT CGGGCAGCGC GGTGTGGTCC GTCTACAACA ACGGGCGCAA CACCGCCTAC GACAGGGTGT GCCTCTACCC CGAGGCGAAC TACGGGGGCA GCACCGCGTA CTACGTCCTG CGGGGGCAGC AGGCCGAGGG GTGGCCTGGC GTGATCATCC GCTCCCACAG GTGGGTCAAC GGGTCCTGCT GGTGA
|
Protein sequence | MADLELPDRP RGRRPRRRVL LIALFAVVTA VVASVSAWLA ADVGERWRAG DVLHLCATGV MLVLAGVLIT LLPGAVKEFW RWSGVFAAVA ERVIRALRPL KAVGVVVTVL VLGVGGSLLA PAPDTDATAL EEGELWLMAG EDLSPSDPRA VLLEQWNQAH PETPARLVDA AGATDEQRER MLDDARPDGP HRADVYLLDV VWLPEFAGEG HLRELDGSTI GGGTADFLPV VLDTCQDGGK LWALPLNTDV RLLYHRTDVP GLVVPTSWDG HSGSAATRAH GGLEAADAPQ LAEEMLTVEA LEAIWAAGGN VVTRDGEVTL TPDGSRVQFT EDDYEGMRKL NAMARTEGVV PRGVRPEETG AEDALTLFAD GRTAFLRNWA VTHRQLTGRQ DRPSTTHVGF DSAVQPAPSV LGGQDLAISR HTAKPKAAKA LLEFLTSTQS QQILAEVGGF APSRTSVYEL SDSRHLANVR TALKDARPRP RTERYTEFSK LFREAVEKVV APEDSIPDET ARKLADVLRG RSGAGDQTRV GRGFRGGRER CFEGVRKSAP TGGGRARLRA DAPTETVGVM MRRTLTALFV AVVALLSSLV SASAEPGSVD RSATRVEAPV TVAAPVQQDV SAQALSCTAG DLCAWNGVGG GSRCSWTNRD NDWWYAPTTC SWSSGSAVWS VYNNGRNTAY DRVCLYPEAN YGGSTAYYVL RGQQAEGWPG VIIRSHRWVN GSCW
|
| |