Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3335 |
Symbol | |
ID | 8327525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 3911164 |
End bp | 3912900 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644943846 |
Product | Cellulase |
Protein accession | YP_003101086 |
Protein GI | 256377426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.59527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACTCGC ACCCGCGCGT AACCCGGTGG TCCACCCGCC TGCTCGCCCC AGCCCTCGCG CTGCTGATGG CGGCGGGCTC GCTCACCGCC ACCGCCACCG CCTCCCCGGC GCCGGCGCCC GCGCCCGCGT CCCGCGACGG CGTCGCGCCC GCCGACGCGC GGGCGACGGT CGCGGCCATG CAGCCGGGGT GGAACCTCGG CAACACCCTG GACGCCATCC CCGACGAGAC CTCGTGGGGC AACCCCCGCA CCACCAGGGC GCTGCTGCGC CACGTGCGCG AGCAGGGCTA CCGCAGCGTC CGCCTGCCGA TCACGTGGAG CAACCACCAC GGTCCCGCGC CGGACTTCAC GATCGACCCG GCGTGGCTGG CCCGCGTCCG CGAGGTCGTG GACTGGTCGC TGGCCGAGGG CCTGCACGTG ATGGTGAACC TGCACCACGA CTCGTGGCAG TGGCTCAACG CCTACCCGAC CGACCGCGCG AACGTGCTGG CCCGCTACAC CGCGCTGTGG AAGCAGATCG CCACCGCCTT CCGCGAGCAC CCCTCGAAGC TGGTGTTCGA GAACATCAAC GAACCCCAGT TCGCGGGCAC CTCCGGCGAC GAGGAGGACT ACCGGGCGCT GCACGAGCTG AACGCCGAGT TCCACCGCGT CGTGCGCGGC TCCGGCGGCG GCAACACCAC CCGCCTGCTG GTGCTGCCCA CCCTGCACAC CAGCGCCGAC CAGGGCCGCC TCGACGCGTT CCTGGCGTCC TACGACCTGC TGCGGGACAC CAACATCGCG GCGACCACGC ACTTCTACGG GTTCTGGCCG TTCAGCGTGA ACATCGCAGG CCACACCCGG TTCGACGCGG AGGTGGAGCG GGACCTGGTC GACACGTTCG ACCGGGTGCG GCGGAGCCTG GTGGACCGGG GCATCCCGGT GATCGTCGGG GAGTGGGCGC TGCTCAACCA CGACCACACC CGGCCGGGGG TGATCGAGCG GGGCGAGTTC CTGAAGTTCC TCGAAGCCGT GGGCCACCAC GCCCGCACCC GCGGCCTGAC CACGATGCTG TGGGACGCCG GGCAGTTCCT CGACCGCACC GCGCTGCGCT GGCGCGACCA GGGCCTGCAC GACCTGGTCG AGTCGAGCTG GACCACCCGC TCCGGAACGG CCGCGAGCGA CCGGATCCAC CTGGCGCGCA GCCAGCCGAT CACCGCGCGG TCGCTCGCGC TCAACCTCAA CGGCACGGCG CTGCGCGAGG TGCGCTCGGG CGGGCGGACC CTGGCCAGGG GCTCGGACTA CACCGTCTCC GGCAGCACCC TGACCTTCAG CGCCGCCGCC CTCACCAGGT TGGCGGGCAA CCGGGCCCAC GGCGTCAACG CGACCGTCGA GCTCCGGTTC TCTCAGGGCG TCCCGTGGCC GGTCGACGTG ATCAGCAGCG ACCGGCCGGT CCAGTCCGCG GCCGAGGGCA CGACCGCCGC GTTCGCGATC CCCACCCGGT TCCTGGGCGA CCAGCTCGCC ACCATGGAGG CCGCGTACGC CGACGGCAGC CCGGCCGGGC CGGCGAACTG GACGACGTAC AAGGAGTTCT GGCAGCACTT CCAGCCCGAC CCGGCGGGGA ACCGGATCAT CCTCAAGCCG GAGTTCTTCG CCGAGGTCGC GGACGGGGTG GTCGCCCTGA CGTTCCACTT CTGGAGCGGC GAGCGCGTCG CGTACCGCGT CACCAAGGCC GGGGACCGGG TCACCGGGAC CCCGTGA
|
Protein sequence | MNSHPRVTRW STRLLAPALA LLMAAGSLTA TATASPAPAP APASRDGVAP ADARATVAAM QPGWNLGNTL DAIPDETSWG NPRTTRALLR HVREQGYRSV RLPITWSNHH GPAPDFTIDP AWLARVREVV DWSLAEGLHV MVNLHHDSWQ WLNAYPTDRA NVLARYTALW KQIATAFREH PSKLVFENIN EPQFAGTSGD EEDYRALHEL NAEFHRVVRG SGGGNTTRLL VLPTLHTSAD QGRLDAFLAS YDLLRDTNIA ATTHFYGFWP FSVNIAGHTR FDAEVERDLV DTFDRVRRSL VDRGIPVIVG EWALLNHDHT RPGVIERGEF LKFLEAVGHH ARTRGLTTML WDAGQFLDRT ALRWRDQGLH DLVESSWTTR SGTAASDRIH LARSQPITAR SLALNLNGTA LREVRSGGRT LARGSDYTVS GSTLTFSAAA LTRLAGNRAH GVNATVELRF SQGVPWPVDV ISSDRPVQSA AEGTTAAFAI PTRFLGDQLA TMEAAYADGS PAGPANWTTY KEFWQHFQPD PAGNRIILKP EFFAEVADGV VALTFHFWSG ERVAYRVTKA GDRVTGTP
|
| |