Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4872 |
Symbol | |
ID | 8329070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 5810924 |
End bp | 5813152 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 644945312 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003102544 |
Protein GI | 256378884 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.703586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG ACCAGCCCAC CGTGCCCGCG TCAGTGGCCG ACCAGGCCGC GCTCGGCAGC GGCGCGGACA TGTGGACCAC GAAGGCGGTC GGCGACGTGC CGTCCCTCTT CGTCACCGAC GGCCCGCACG GCCTGCGCAA GCAGACCGGC GACACCGACA ACCTCGGCAT CGGCGGCAGC GTCCCCGCGA CCTGCTTCCC GCCCGCCGTC GGCCTCGCGC AGAGCTGGGA CGCCGACCTC GTCGAGCGGG TCGGCCGGGC GCTGGGGGAG GAGTGCCAGG CCGAGGGCGT GTCCGTCCTG CTGGGACCGG GCGTGAACAT CAAGCGCGAC CCGCGCTGCG GCCGCAACTT CGAGTACTAC TCCGAGGACC CGCTGCTGTC CGGCGCGCTC GGCGCGGCCT GGGTGCGCGG CGTGCAGTCG CAGGGCGTGG GCGCCTCGCT CAAGCACTAC GCGGCCAACA ACACCGAGAC CGACCGGATG CGCTCCAGCT CCAACGTCGA CCCGCGCACC CTGCGCGAGG TGTACCTGCG GCCGTTCCAG CGGGTCGTCG AGGACGCCCA GCCGTGGACG GTCATGTGCG CCTACAACCG GATCAACGGC GTGTACGCGT CCGAGGACCG CTGGCTGCTC ACCGACGTGC TGCGCGGCGA GTGGGGCTTC GAGGGCGCCG TGGTCAGCGA CTGGGGCGCG GTGCGCGACC GGGTGGCCGC CGTGTCGGCC GGGCTCGACC TGGAGATGCC CGGCGGCGGC GACACCGACG CGGACGTCGT CGCGGCCGTC GAGGCGGGCG GGCTGGACCC GGCCGTGGTG GAGCTGGCCG CGACGCGGGT CGCCGCGCTG GCCGCCAAGG GCGTCGCAGG CCGCCGCTCC GACGTCGTGC TGGACGTCGA CGCGCACCAC GCGCTGGCCC GCGAGGTCGC CGCGCGCTGC GTCGTGCTGC TCAAGAACGA CGGTGCCGTG CTGCCCCTCG CCCCCGGCTC GTCCGTCGCC GTCATCGGCG CGTTCGCGCA GGCCCCGCGC TTCCAGGGCG GCGGCAGCTC CCACGTGAAC CCCACCAGGG TGGACGTGCC GCTGGACGAG ATCCGCCGCC ACGCCCCCTC GGCCACCTTC GCCGCGGGCT TCACCACCGA CGGCTCCGGC GACGCGGCGG CCCTGCGCGC CGAGGCCGTC GCCGCCGCGG GCGCCGCCGA GTCCGCCGTG CTGTTCCTGG GTTTGGCCGC CGACCAGGAG TCCGAGGGCT TCGACCGCGA GCACATCGAG ATCCCCGCCG AGCAGGTCGA ACTGCTGGCC GCCGTGCTCC AGGCCCAGCC CCGCACCGCC GTCGTGCTCT CCCACGGCGG CGCGCTGCGC CTGGCCCCCC TCGCGGGCGC GCCCGCCCTG CTGGACGGCG CGCTGCTCGG CCAGGCGGGC GGCGCGGCCG TCGCGGACGT GCTGTTCGGC GCGGTCAACC CGTCCGGCAG GCTCGCCGAG ACCGCGCCTA CCCGGCTGGA GGACACCCCG GCGTTCCTGA ACTTCCCCGG CGAGCGCTCG CAGGTCTACT ACGGCGAGGG CCTGCACGTC GGCTACCGCT GGTACGACGC CCGCGACGCG GCAGTCGGCT TCCCGTTCGG GCACGGCCTG TCGTACACCA GCTTCGAGCA CCGGGACCTG ACCGTCTCCG CCTCCGAGGC CGGGATCACG GCGTCGGTCG CGGTGGTCAA CACCGGCGAG CGGGCCGGGC GCGAGGTCGT GCAGTTCTAC GTGTCGGTCC CCGGCTCGTC GGTGACGCGC CCCGTGCGCG AGCTGAAGGG CTTCGCCTCG CTGGACCTGG AGCCCGGCGC CGAGGGCCGC GTCGAGGTCC TGCTGCGCGC CGCCGACCTG TCCTACTGGG ACGTCGCGGC GGACCGGTGG GTGCTGGAGA GCGGCGAGTA CGAGGTGACC GTCGGCGCGT CCAGCCGCGA CCTGCGCGCG TCCGCCACCG TCGCGGTGAC CGGCGACGAG ACCCCGGTCC CGTTCACCCC GGAGTCGACC CTGGGCGAGG TCCTGGCCGA CGCCGCGGGC GCCGAGGCGT TCGGCGGCCT GCTGACGGGC GTGTTCGGCT CGTCCGAGTC GTCCGAGGAC GCGCTCGGCA TGGACATGGC GAAGATGATG GCGTCGATCC CGCTGGAGCG CTTCGTGAGC CTGTCCGGCG GCAAGCTGAC CCGAGCCGGG CTGGCGGACA AGCTCGCCGA GGTGAACGCG GCCAGGTGA
|
Protein sequence | MSTDQPTVPA SVADQAALGS GADMWTTKAV GDVPSLFVTD GPHGLRKQTG DTDNLGIGGS VPATCFPPAV GLAQSWDADL VERVGRALGE ECQAEGVSVL LGPGVNIKRD PRCGRNFEYY SEDPLLSGAL GAAWVRGVQS QGVGASLKHY AANNTETDRM RSSSNVDPRT LREVYLRPFQ RVVEDAQPWT VMCAYNRING VYASEDRWLL TDVLRGEWGF EGAVVSDWGA VRDRVAAVSA GLDLEMPGGG DTDADVVAAV EAGGLDPAVV ELAATRVAAL AAKGVAGRRS DVVLDVDAHH ALAREVAARC VVLLKNDGAV LPLAPGSSVA VIGAFAQAPR FQGGGSSHVN PTRVDVPLDE IRRHAPSATF AAGFTTDGSG DAAALRAEAV AAAGAAESAV LFLGLAADQE SEGFDREHIE IPAEQVELLA AVLQAQPRTA VVLSHGGALR LAPLAGAPAL LDGALLGQAG GAAVADVLFG AVNPSGRLAE TAPTRLEDTP AFLNFPGERS QVYYGEGLHV GYRWYDARDA AVGFPFGHGL SYTSFEHRDL TVSASEAGIT ASVAVVNTGE RAGREVVQFY VSVPGSSVTR PVRELKGFAS LDLEPGAEGR VEVLLRAADL SYWDVAADRW VLESGEYEVT VGASSRDLRA SATVAVTGDE TPVPFTPEST LGEVLADAAG AEAFGGLLTG VFGSSESSED ALGMDMAKMM ASIPLERFVS LSGGKLTRAG LADKLAEVNA AR
|
| |