Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3216 |
Symbol | |
ID | 8327406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 3748226 |
End bp | 3750175 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644943730 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003100970 |
Protein GI | 256377310 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGTCAC CCACCCACGG GCGCGGACGG CGGTGGCTGG CCGCGCTCGC GGCCGTCCCG CTGGTCGCCG CCACCCTCGC GGCCTCCGCG CTCGCCCCGC CCGCCGCCTC CGCCGCACCG CCCGCCAAGG ACTGGCTGCA CGTCCAGGGC AACCGGATCG TCGACGCGGC GGGCAACCGC GTCCAGCTCA CCGGAGCCAA CTGGTTCGGC TTCAACGCCA CCGAGCGCGT CTTCCACGGC CTGTGGTCGG CCAACATCAC CGAGATCACC AAGGCGATGG CCGATCGTGG CGTCAACCTG GTGCGCGTGC CCGTGTCCAC CCAGCTGCTG CAGGAGTGGA AGGAGGGCAG GACCGTCGCC AAGCCGAACA TCAACGACTA CGCGAACCCC GAGCTGGCGG GGATGAACAA CCTCCAGATC TTCGACTTCT GGCTGCGGCT GTGCGAGCGG TTCGGGCTCA AGGTGCTGCT GGACGTGCAC AGCGCCGAGG CCGACAACTC CGGCCACGTG CACCCCGTCT GGTACAAGGG CTCGGTCACG CCGGAGGTGT TCTACTCCAC CTGGGAGTGG GTCACCGCGC GGTACGAGGA CAACGACACG ATCATCGCCG TCGACGTCAA GAACGAGCCC CACGGCGGTC CCGGCGACTC GCCGCGCGCC AAGTGGGACG GCTCGACCGA CGTCGACAAC TGGAAGCACA CCTGCGAGAC CGCGGGCCGC CGCATCCTCG CGATCAACCC CGAACTGCTG GTGCTGTGCG AGGGCAACGA GGTCTACCCG CGCCCCGGCA AGGGCTGGGA CGCGCCGGAC ACCAACCCCG ACCGCACGCC CAACTACTTC CACACCTGGT GGGGCGGCAA CCTGCGCGGC GTCGCCGAGC ACCCGGTGAA CCTGGGCGCG AACCAGGACC AGCTCGTCTA CTCCCCGCAC GACTACGGCC CCCTGGTGTT CAACCAGCCG TGGTTCGACA AGCCGTTCAC CAAGGAGTCG CTGATCACCG ACGTGTGGCG GCCCAACTGG CTGTACCTGC ACGAGCAGGG CGCCGCGCCG CTGCTGATCG GCGAGTGGGG CGGGCGGCTC GGCGAGGACG AGCGGCAGGA CCGGTGGATG GCCGCGCTGC GCGACCTGAT CGTGGAGGAG GGGCTGCACC AGACGTTCTG GGCGCTCAAC CCGAACTCCG GCGACACCGG CGGGCTGCTG CTCGACGACT GGAAGACCTG GGACGCGGCC AAGTACGCGC TGCTCAAGCC CGCGCTGTGG CAGCACGCAG GCAAGTTCGT GAGCCTGGAC CACCAGGTGC CGCTCGGCGG CGAGGGCTCC ACCACCGGCA TCAGCCTCGC CCAGCGCTAC GGCGACGGCG GCGGCCCCGG CGACACCGCG GCCCCGACCG CGCCGACCGG GCTGGCGATC GGCACGACCA CCGCCTCGTC GGTCGCGCTG AGCTGGCAGG CCGCGACCGA CGACGTCGGC GTCACCGGGT ACGACGTGTA CCGGGGCGGC GCGAAGGTCG GCACGAGCGC CACGACCTCC TACGTGGACA CCGGGCTCAG CGGCGGCACC ACCTACAGCT ACTCGGTGCG GGCCAGGGAC GCGGCGGGCA ACACCTCGGC GGCCTCGGCC TCCCGCAGCG CGACGACCCC GCCGGGCGGT GGTGGCGGTG ACGCCGGGTG CGCGGCGGTG CTGCGCGTGG TCAACAGCTG GCAGGGCGGC TACCAGGGGG AGGTCACCGT GACCAACTCC GGGACGGCGG CCACGCGGGG CTGGAAGGTC GCGCTGACGA CCGCGTCCGG CACCACGATC TCCAGCGTGT GGAACGGGAC CTACGCCTCG GGGACGGTCG TGAACGCCGC GCACAACGGC GCGCTGGCCC CGGCGGGCAG CACCACGTTC GGGCTGACCG GGACGGGGCA GGCGACCGGG GTGACGATCA CCGGCTGCAC GGCTTCCTGA
|
Protein sequence | MLSPTHGRGR RWLAALAAVP LVAATLAASA LAPPAASAAP PAKDWLHVQG NRIVDAAGNR VQLTGANWFG FNATERVFHG LWSANITEIT KAMADRGVNL VRVPVSTQLL QEWKEGRTVA KPNINDYANP ELAGMNNLQI FDFWLRLCER FGLKVLLDVH SAEADNSGHV HPVWYKGSVT PEVFYSTWEW VTARYEDNDT IIAVDVKNEP HGGPGDSPRA KWDGSTDVDN WKHTCETAGR RILAINPELL VLCEGNEVYP RPGKGWDAPD TNPDRTPNYF HTWWGGNLRG VAEHPVNLGA NQDQLVYSPH DYGPLVFNQP WFDKPFTKES LITDVWRPNW LYLHEQGAAP LLIGEWGGRL GEDERQDRWM AALRDLIVEE GLHQTFWALN PNSGDTGGLL LDDWKTWDAA KYALLKPALW QHAGKFVSLD HQVPLGGEGS TTGISLAQRY GDGGGPGDTA APTAPTGLAI GTTTASSVAL SWQAATDDVG VTGYDVYRGG AKVGTSATTS YVDTGLSGGT TYSYSVRARD AAGNTSAASA SRSATTPPGG GGGDAGCAAV LRVVNSWQGG YQGEVTVTNS GTAATRGWKV ALTTASGTTI SSVWNGTYAS GTVVNAAHNG ALAPAGSTTF GLTGTGQATG VTITGCTAS
|
| |