Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2049 |
Symbol | |
ID | 8326238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 2273320 |
End bp | 2274330 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644942600 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_003099841 |
Protein GI | 256376181 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCA GGAGCAGGAG GGGCCTGGTC GCGATCAGCG CGGCGCTGTG CCTCGCGGTG TCCGGCTGCG GGCTGGAGAG CTCGTTCGCG CTGCCGTTCG AGGTGGCGCC CGGCTCGATC AGGCCGGAGC CCGCGCTGGA GGGCGTGAAG GTCTCGGTGG GGTCCAAGGA CTTCACCGAG AACATCGTGC TCGGCTACAT CACCGAGGTC GCGCTGGCGG CGGCGGGCGC CGAGGTCAAC GACATGACCA ACATCCAGGG CTCGAACAGC TCGCGGCAGG CGCTGCTGAC CGGTGACGCC GACCTGTCCT GGGACTACAG CGGGACGGGC TGGATCAGCT ACCTGGGCAA CACCGAGCCG ATTCAGGGCG AGCGCGAGCA GTACGAGGCG GTGCGCGAGG CCGACCTGGA GCGCAACGGC CTGGTGTGGC TGGACTACAC GAAGGTCAAC AACACCTACG CCTTTGCCGT CACGCGGGAG TTCGCCGAGG CCAACAACCT CAGCACCACG AGCCAGATGG CCGAGCTGGT GAAGAACGAC CCCGGCAAGG GCGTGTTCTG CCTGGAGACC GAGTTCATCA GCCGCAACGA CGGGTTCCCC GGCGTGGCGC AGACCTACGG CTTCGACGCG GGCGCGGCGC AGGTGAAGAC CTTCGGCAGC GGCACGATCT ACACCGCGAC CTCCGACGGG CTGTGCAACT TCGGCGAGGT GTTCACCACC GACGGCCGCA TCCTGGCGCT GGACCTGGTG GTGCTGGAGG ACGACAAGAA GTTCTTCCCC CAGTACAACG CCTCCCTGGT GCTGCGCGAG GAGTTCCACG AGCAGCACCC GGAGATCGAG CGGATCATGA CGCCGGTGTT CGAGAAGCTG GACAACGACA CGATCATCCG GCTGAACGCC GAGGTGGACG TGGACGGCCG CGACCCGGCG GCGGTGGCGC GCGACTGGAT GGTGGGCGAG GGGTTCGTGT CGATCCCCGA CGACACCATG GCGGCGGGCG CGCGGCGCTA G
|
Protein sequence | MSGRSRRGLV AISAALCLAV SGCGLESSFA LPFEVAPGSI RPEPALEGVK VSVGSKDFTE NIVLGYITEV ALAAAGAEVN DMTNIQGSNS SRQALLTGDA DLSWDYSGTG WISYLGNTEP IQGEREQYEA VREADLERNG LVWLDYTKVN NTYAFAVTRE FAEANNLSTT SQMAELVKND PGKGVFCLET EFISRNDGFP GVAQTYGFDA GAAQVKTFGS GTIYTATSDG LCNFGEVFTT DGRILALDLV VLEDDKKFFP QYNASLVLRE EFHEQHPEIE RIMTPVFEKL DNDTIIRLNA EVDVDGRDPA AVARDWMVGE GFVSIPDDTM AAGARR
|
| |