Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_2048 |
Symbol | |
ID | 8326237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 2271015 |
End bp | 2273054 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644942599 |
Product | cellulose-binding family II |
Protein accession | YP_003099840 |
Protein GI | 256376180 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCTC CCCGCACCGC TTTGTTGGCA CTCGCCCTGA TCGCCGCACC GCTCGCCACC CCCGCGGCGG CGTCCCCGAG CGCCGATCCC CCGACGTCCC AGGCCGTCGA CACCCGCGTC TCGGTGAACA CCCGCGCGGG CCTGGAAACC GCCCCCGAGC TGGGAATGGG CGTCAACCAC GCCATCTGGG ACTCCCAGCT CGGCACCCCG CAGGTCGCCG ACCTGCTCAA GGACGCAGGC GTCCGCGCCA TGCGCTACCC CGGCGGCTCG TACTCGGACA TCTACCACTG GCGCGACCAC ACCGCCCCCG GCGGCTACGT GGCCCCGAAC ACCGACTTCG ACACGTTCAT GGCAGGCGTG CGGCGGGCGG GCGGTCAGCC GATCGTCACC GCCAACTACG GCACCGGGAC GCCGCAGGAG GCCGCCGACT GGGTGCGCAG CGCCAACGTC GACAAGGGCT ACGGCGTCAA GTACTGGGAG ATCGGCAACG AGCTCTACGG CAACGGCCAC TACGGCACCG CGTGGGAGGA GGACCACCAC GAGGACAAGA GCCCGGCCGG GTACGCGGGC CTGGTGCGCG ACTACTCGCG GGCGATGAAG GCCGTCGACC CGAGCATCAA GATCGGTGCG GTGCTGACCA CGCCCGGCGA GTGGCCGGAC GCCATCACCG CGGCCGGTGA CGCGGGCTCG TGGAACCAGG TCGTGCTGTC CACGGCGGGC GCGGACATCG ACTTCGTGGT CCTGCACTGG TACCCGCGCG GCACGAACGC CCCGGACCTG CTGAGCAGCG CGGACACCAT CCCCGAGATG ATGGCCATGG CCAAGGAGCA GATCCGCAGG CACACCGGGC GGGACCTCGG CATCGCGTTC ACCGAGCTGA ACACGAACTT CGCCCGCAAC ACCCAGCCGA CCGCGCTGTT CGCCGCCGAC GCCTACGCGT CGCTGCTGGA GCGCGGCGCG TTCACCGTCG ACTGGTGGAA CGTGCACAAC GGCCTGGGCA AGGTCACGAC GGTCGCCGGT CAGACCGACT ACGACGACTT CGGCCTGCTC TCCAGCGCCA ACTGCACCGA GGACGGCTCG GTGTGCCAGC CCGCGCTGAA CACGCCGTTC GCGCCGTACC ACGCGCTCGC GCTGACCTCC CGGTTCGCGC GCCCCGGCGA CCAGTTCGTG GCCGCCTCCA GCACCGACCC GCTGGTGCGC GCGCACGCCG CCCGACGTCC GGACGGCGGG CTGTCGGTGC TGCTGGTGAA CCGGGACCCG GACGCGGCGA GGTCGGTCGC GCTGGACTAC GCCGGGTTCA CCCCGAAGTC CTCGGCGGTG GTGCGGACCT ACGGCAACGG CGACACCGCG ATCACCACGT CCACCGGGAC GTCCGGCGCG GTCACGCTCG CCCCGTACTC GATCACCGCG CTGGAGCTGG CGCCCACGTC CGCGACCACC GGCCCGGCGA CCCCCGCCCG GCCGACCGCG TCCTCGGTCA CCGACCGCGG CGCCGTGATC ACCCTGCCCA CTGCCTCTCC CGGTCTGAAG TACGAGGTGC ACCGCCAGGT CGGCACGCGC ACCGACCAGT GGGGCGAGAC CACCGGCGGC AGCTTCACCG CCACGAACCT GTCCCCCAGC ACCGAGTACA CCGTCAACGT CATCGCGCGC GACAACGCGG GCCGGGTGTC GTGGGCTTCG CCGCCGCTGG TGTTCCGCAC CACCGCGCCC GCCTCGTCCA CCTGCGCGGT GCGGCTGGCC AACACCAACG ACTGGGGCAA CGGCTGGGTC GGCGACGTGC AGGTCACCAA CACCGGGACC ACCCCGGTGG ACGGCTGGGT GCTGGCGTTC GACTGGCCGA GCACCTGGCA GAAGGTCGAC AGCGGCTGGA GCGGCACGTG GGCGCAGTCC GGCCGGTCGG TGACGGTGAC CGCCGACGCG GGCAACCGGC TGCTGGCCCC CGGCGCGACG GTCTCGACCG GTTTCGTGGG CTCCTACCAG GGCCCGAACC GGCTGCCGAC CGCGTTCTCG CTGAACGGCG TGCCGTGCTC GCTGCGGTGA
|
Protein sequence | MRAPRTALLA LALIAAPLAT PAAASPSADP PTSQAVDTRV SVNTRAGLET APELGMGVNH AIWDSQLGTP QVADLLKDAG VRAMRYPGGS YSDIYHWRDH TAPGGYVAPN TDFDTFMAGV RRAGGQPIVT ANYGTGTPQE AADWVRSANV DKGYGVKYWE IGNELYGNGH YGTAWEEDHH EDKSPAGYAG LVRDYSRAMK AVDPSIKIGA VLTTPGEWPD AITAAGDAGS WNQVVLSTAG ADIDFVVLHW YPRGTNAPDL LSSADTIPEM MAMAKEQIRR HTGRDLGIAF TELNTNFARN TQPTALFAAD AYASLLERGA FTVDWWNVHN GLGKVTTVAG QTDYDDFGLL SSANCTEDGS VCQPALNTPF APYHALALTS RFARPGDQFV AASSTDPLVR AHAARRPDGG LSVLLVNRDP DAARSVALDY AGFTPKSSAV VRTYGNGDTA ITTSTGTSGA VTLAPYSITA LELAPTSATT GPATPARPTA SSVTDRGAVI TLPTASPGLK YEVHRQVGTR TDQWGETTGG SFTATNLSPS TEYTVNVIAR DNAGRVSWAS PPLVFRTTAP ASSTCAVRLA NTNDWGNGWV GDVQVTNTGT TPVDGWVLAF DWPSTWQKVD SGWSGTWAQS GRSVTVTADA GNRLLAPGAT VSTGFVGSYQ GPNRLPTAFS LNGVPCSLR
|
| |