Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4021 |
Symbol | |
ID | 8328214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 4720433 |
End bp | 4721644 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 644944493 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003101730 |
Protein GI | 256378070 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0221723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCAGC ACGGCAGGGA CCCCCGGCGC AGCCCGGCGC GGCGCGGCGC GTCCGCCGCC AAGCTCGGGT CCGTCGCGCT CGGCGCGGTG CTCGCCGCCT CCGTGCTGAC CGGCGTCGCG CTCAACCGGG ACGACACCGT CACCGGCGCC GCCGTCCCCG AGGGCGGGAC CACCGAGACC GGCGAGACCA CGACGACCAC GACCTCCACC CCGCCGCCCG ACCCGTGCGC CCCCGTCCTG GCCGGGCTGA CCCCGCGCGC CGGGCTCGCC CAGCTGCTCC AGGTCGGCGT CAACCCGCGC GGCCCGCAGG ACGCGCTGTC CATCGTCGGC TCCGAGCAGG TCGGCGGCAT CTTCGTCGGC GGCGACGACG TCGGCCTGCT GTCGGGGGAC GCGCTGGCCG CCGTGCACGC CGCCTCCACG CTGCCGCTCA CCGTCTCGGT GGACGACGAG GGCGGCCGGG TGCAGCGGAT CGACGCGCTC GACGGCGACA TCCCCAGCGC CCGCACCATG ACCCGCACCC TGTCCACCGA GCAGGTCCGC GAGCTGGCGC GCAAGCGCGG CGAGGCGATG AAGGCGCGCG GCGTCAACAC CGACCTCGCC CCCGTGCTCG ACCTGACCTC CCAGGCCGCG AACACCGTGA TCGGCGACCG CTCGTTCAGT GTCGACCCGG CCACCGCCGT CTCCTACGCC GAGGCGTTCG CCGAGGGCCT GCGCCAGGCC GGGGTCGTCT CGGTGGTCAA GCACTTCCCC GGCCACGGCA ACACCTCCGG CGACTCGCAC CTCGGCTCGG TCACCGCGCC CCCGCTCGCC CAGCTGCGCG CCCACGACCT CGCGCCCTAC CGGGAGCTGC CCAGGTTCGG CGAGGACGTG CAGGTCATGG TCGGCCACAT CGCCGTCCCC GACCTGACCG GCGGCCTGCC CGCGAGCCTG AGCCCGGCCG CCTACGAGCT GCTGCGCGGC GAGTTCGCGT TCGACGGCCT GGTCATGACC GACGACCTGG GCGCGATGCG CGCGGTGACC GACCTGGCCG ACCTGCCCGA CGCGGTGCTG CGCGCGCTGG TCGCGGGCGC GGACGTGGCG CTGTGGTCGT CCGGCGGCCG GGTCGGCGAG GTGCTCGACC GGCTGCAGGC CGCCGTCGCG AGCGGCGAGC TGAGCGCCGA GCGGGTGGAC CGCTCGCTGC GCCGCGTGCT CAAGTCCAAG CACCTCTGCT AG
|
Protein sequence | MEQHGRDPRR SPARRGASAA KLGSVALGAV LAASVLTGVA LNRDDTVTGA AVPEGGTTET GETTTTTTST PPPDPCAPVL AGLTPRAGLA QLLQVGVNPR GPQDALSIVG SEQVGGIFVG GDDVGLLSGD ALAAVHAAST LPLTVSVDDE GGRVQRIDAL DGDIPSARTM TRTLSTEQVR ELARKRGEAM KARGVNTDLA PVLDLTSQAA NTVIGDRSFS VDPATAVSYA EAFAEGLRQA GVVSVVKHFP GHGNTSGDSH LGSVTAPPLA QLRAHDLAPY RELPRFGEDV QVMVGHIAVP DLTGGLPASL SPAAYELLRG EFAFDGLVMT DDLGAMRAVT DLADLPDAVL RALVAGADVA LWSSGGRVGE VLDRLQAAVA SGELSAERVD RSLRRVLKSK HLC
|
| |