Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2404 |
Symbol | |
ID | 8326593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 2663250 |
End bp | 2664524 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644942949 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003100190 |
Protein GI | 256376530 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000132449 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTAA CCCGAGCACT GGCCGCCGCA GCGGCGCTCT CCCTGCTCGC GTCCTGCGGA GGCCCGGCTG CCCCGCAGGA GCCCACCGGC CCGATCACCC TCACCTGGTG GCACAACGGC ACCTCCGACC CGATCAAGAC GGTCTGGCAG GACGTCGTCA CCGACTACCA GGGCAAGAAC CCGGACATCA CGATCGAGGC CCAGCCCATC CAGAACGAGG GCTTCTCCAC CAAGATCCCC CTGGCGCTCC AGTCGCCCAC CCCGCCGGAC GTCTACCAGC AGTGGGGCGG CGGCGACCTG GCCTCGCAGG TCACCTCCGG CAAGCTCGCC GACATCACCG ACGCGAGCAA GCCGTGGATC TCCACCATCG GCGACTTCGC CAAGGGCTGG CAGGTCGACG GCCGCCAGCT CGGCGTCCCG TTCGCCCAGC ACGTCGTGGG CTTCTGGTAC CGCAAGGACC TGTTCTCCCA GGCCGGGATC AGCGCCCCGC CCACCACGAT GGCCGAGCTG AACGCCGCCG TCGCCAAGCT CAAGTCCGCC GGGCTCGCCC CGATCGCCGT CGGCGGCAAG GACCGCTGGC CGGACGCCTT CTACTGGAAC TACTTCGCCG TCCGCGAGTG CTCGCAGCAG ACCATCGAGT CCTCGGTGAA GAACCTCAAG CTCGACGACC CGTGCTGGGT CAAGGCGGGC CAGGACCTGG TGGACTTCCT GCGGACCGAG CCGTTCCAGG AGGGCTTCAA CGGCACCCCC GCCCAGCAGG GCGCGGGCAG CTCGGCGGGC CTGGTGGCCA ACGGCAAGGC CGCCATGGAG CTGCAGGGCG ACTGGAACCC CGGCACCATG TCCTCCCTCA CCGAGGACAA GGACCTGGAC AGCAAGGTCG GCTGGTTCCC GTTCCCGACC GTCCCCGGTG GCCAGGGCGA CCCGGCGGCC GTGCTCGGCG GCGGTGACGG CTTCTCCTGC ACCACCCGCG CGGCAGCCGC CTGCGCGAAG TTCCTGGAGT ACCTGATCGG CCCGGAGATC CAGAACAAGC TCGCCGCCGC GGGCACCGGC CTGCCGGTCA ACGAGGCCGC CGTCACCGCG CTCAAGACCG AGAACCTCAA GACCGTCGCC GAGCACGGCC GCAATGCGCC GTACGTGCAG ATGTACTTCG ACCGGGCCTT CCCGACCGAC GTGGGCGCCG CGCTGAACGA GGCCGTCGCG AACCTGTTCG CGGGCCAGGG CTCGCCTCAG GGCATCGTCG ACGCGGTCAA CGAGGCGGCG GACGGGGCGA AGTGA
|
Protein sequence | MRLTRALAAA AALSLLASCG GPAAPQEPTG PITLTWWHNG TSDPIKTVWQ DVVTDYQGKN PDITIEAQPI QNEGFSTKIP LALQSPTPPD VYQQWGGGDL ASQVTSGKLA DITDASKPWI STIGDFAKGW QVDGRQLGVP FAQHVVGFWY RKDLFSQAGI SAPPTTMAEL NAAVAKLKSA GLAPIAVGGK DRWPDAFYWN YFAVRECSQQ TIESSVKNLK LDDPCWVKAG QDLVDFLRTE PFQEGFNGTP AQQGAGSSAG LVANGKAAME LQGDWNPGTM SSLTEDKDLD SKVGWFPFPT VPGGQGDPAA VLGGGDGFSC TTRAAAACAK FLEYLIGPEI QNKLAAAGTG LPVNEAAVTA LKTENLKTVA EHGRNAPYVQ MYFDRAFPTD VGAALNEAVA NLFAGQGSPQ GIVDAVNEAA DGAK
|
| |