Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4612 |
Symbol | |
ID | 8328810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 5493373 |
End bp | 5495688 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 644945059 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003102291 |
Protein GI | 256378631 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACCG CTTCCGCGCT GTCGCCGCAG AACCGGGTCG GCCAGGTCAA CCAGCGCCTC AAGGGCTGGG AGGCGCTGCG CTGGGTCGAC GGGGCGCCTC GGGTGACCGA CGTGCTCAAG CGGGAGGTCG ACCGGTTCGG CGGCCTCGGC GCGATCTACG GCGTGCTGCG CGCCGACCCG TGGTCGGAGG TGAACTGGCG CAACGGGATT CCGCCCGAGC GCAGCGCGGA GGCGTGCGCG GCGGTGCAGG AGTACGTGAC GCGCAACGGT TGTGGCGTGC CGGTGCTGTT CGTCGAGGAG GTCCCGCACG GGTTGCAGGC GCTCGGCGGG ACGACGCTGC CGGTGAACCT CGCGCTCGGG GCGGGCATGG ACGCCGGGCT GACCGAGGAG CTGGCGGCGG CGGTGGCCGC CGAGGTCCGG GCCAGGGGCA CGCACGTGGC GCTGGTGTCG GGGCTGGACG TGCTGCGCGA TCCGCGCTGG GGGCGCGCCG AGGAGTGCTT CGGCGAGGAC CCGGCGCTGG CCGCGCTGCT GGTGGCCGCG ACCGTGCGCG GGATGCAGGG GACGGCTCCG GGTCCGATCG GCGCGGGGCG CGTCGCGGTG GTGGCCAAGC ACCTGGCGGC CCAGGGCGCC GGGATCGGCG GGCGCAACGG TTCCGGCGCG CCGATCGGGC CCAGGGAGCT GGGCGAGGTC CACCTGCGGC CCGCGCACGC GGCGGCGCGC GCCGGGGTGG CCGGGTTCAT GGCCGCCTAC AACGACGTCG ACGGCGTGCC GTGCACCGGG AACCGGGAGC TGCTGACGGG CGTGCTGCGG GAGGACTGGG GCTGGGACGG GATCGTCATG GCCGACGGCA CCGCGATCGA CCGGTTGCGC GACAGCACCC CCGACCCGGC GGCCGCGGCG GCGCTGGCCC TGCGCGCCGG GGTCGACCTG AGCCTGTGGG ACGAGGCGTT CACGCACCTC GGGGAGGCGC TGGACCGGGG GCTGGTGGCG GAGGCCGAGC TGGACCGCGC GGTCGACCGG GTGCTCGCGC TCAAGCGGCG GGTCGGGCTG CTGGACGAGC CGGCGGCGTC CGGGCCTGCG GCGTCGGGGC CGGCGGCGTC GGTGCCGGTG GCGTCCGGGC CGGTGGCGTC GGGGCCGGTG GTGGACCTGC CCGCGTCGCG TAACGTGGCC CGGCTGGTCG ACCGGGCCGC CCGGCAGGCC GTGGTCCTGG TGCGGGACGA CGGGGTGCTG CCGCTCGACC CGTCCGGCGT GGTCGCGCTG ATCGGCCCGA ACGCCGACGA CCTGGACGCG CAGCTCGGCG ACTACACCCC GCCTCGGCCC GCCGACGACC CCGGCGCGTC GACCGTGCGC TCGGCGCTCG TGGCGCGGCT GGGCGAGGAG CGGGTGCCGC ACGCGCCCGG CTCGCGGGTG CGCAGCGCGC TCGGCCCGGA CGCGCTCGCG GCGGCGCGGG ACGCGGTCGA CCGCGCCGAC GTCTGCGTGC TGGTGCTCGG CGGCACGAGC AAGCGGAGCT ACGACGACGA GTTCGCCGAC AACGGGGCGG TCGCGGAGTC GGCGGCGGAC ACGACCAACG GGGAGGGCGT CGACCTGGCC TCGATCGCGC TTCCCCTGCC GCAACTGGAA CTCGCGCGTG CGGCCCGGTC GAGCGGCAAG CCGGTCGTGG CCGTGGTCGT CGACGGCCGA CCGCGCGCGC TCACCGAACT GGCCGGGCTG GTGGACGCGC TGCTCGTGGT GCCCTTCCCC GGCCCGAGCG GGGGTGCGGC GGTCGTGGGC GCGCTGCTCG ACGGCACCGC GTCGGGTCGG CTGCCCGCGT CGTTCCCGGT GGCCGACGGG GTCTTCCCGG TGGCGCACGA CGAGCGGGTG GAGACCGCGC GCGGGTACGC CGACCAGCGG CGCCCGGTGG GCATCCCCTT CGGCAGCGGT TCGCCGCCCT CGGTCACCAC GCGGGTCCGC GAAGGCGAGC ACCGGATTTC CGCCGCCGCG CTGGAGAGCG GCGGTTCGCT GCGGGTCGCG GTGGAGGTGG TGAGCACCGG CGGGCCGCGA TCCGTGGCCG TGCCGCTCTA CGGCCGCCGC CACGAGCTGG GCGTGCGCCC GCGCCGCCGG ACCCTGCTGG CCGTGCGGCG CGTGCTGTGC GAGCCGGGCG AATCCGTGGT GGAGTTCGCG CTGGGCCTGG ACGAGCTGGG CTCGTGGGCC ACCGGGCGGC CGGTCGCGCT CCCGGTGGAG ATCGGCGCGT GGAGCGGCGA CGAGGTGGAC GAACCGGCTG ACGCGGTGCG GATCAGCGTC ACCGACGAGG GAGGGAGCAC GCTGTGGCGA CGGTGA
|
Protein sequence | MITASALSPQ NRVGQVNQRL KGWEALRWVD GAPRVTDVLK REVDRFGGLG AIYGVLRADP WSEVNWRNGI PPERSAEACA AVQEYVTRNG CGVPVLFVEE VPHGLQALGG TTLPVNLALG AGMDAGLTEE LAAAVAAEVR ARGTHVALVS GLDVLRDPRW GRAEECFGED PALAALLVAA TVRGMQGTAP GPIGAGRVAV VAKHLAAQGA GIGGRNGSGA PIGPRELGEV HLRPAHAAAR AGVAGFMAAY NDVDGVPCTG NRELLTGVLR EDWGWDGIVM ADGTAIDRLR DSTPDPAAAA ALALRAGVDL SLWDEAFTHL GEALDRGLVA EAELDRAVDR VLALKRRVGL LDEPAASGPA ASGPAASVPV ASGPVASGPV VDLPASRNVA RLVDRAARQA VVLVRDDGVL PLDPSGVVAL IGPNADDLDA QLGDYTPPRP ADDPGASTVR SALVARLGEE RVPHAPGSRV RSALGPDALA AARDAVDRAD VCVLVLGGTS KRSYDDEFAD NGAVAESAAD TTNGEGVDLA SIALPLPQLE LARAARSSGK PVVAVVVDGR PRALTELAGL VDALLVVPFP GPSGGAAVVG ALLDGTASGR LPASFPVADG VFPVAHDERV ETARGYADQR RPVGIPFGSG SPPSVTTRVR EGEHRISAAA LESGGSLRVA VEVVSTGGPR SVAVPLYGRR HELGVRPRRR TLLAVRRVLC EPGESVVEFA LGLDELGSWA TGRPVALPVE IGAWSGDEVD EPADAVRISV TDEGGSTLWR R
|
| |