Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_1531 |
Symbol | |
ID | 6263345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 1623255 |
End bp | 1624691 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642612018 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001876415 |
Protein GI | 187251933 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000631966 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 5.75811e-19 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAATCCTG CAAGAATAGT GCACCCCGGC TTTTGGTTTG GGAAAACAGA TATTGAAGAC GCCAGAAAAT GGGCAAAAAT GGGCGTAGGC GGATTTTGCG TCTACGGTGG AACCAGAGAA GAGATTGAGA CATTTTGTAA AGAAATGAGG GGCCTTTCCC CTTACGCGGA AATTTTTATT TCAGCCGATT ATGAAGACGG CCTTGGAAGG TGGATAAAAG GGGCCGAGCT TTTGCCTTCT AACATGGCTA TAGGCGCCTC CGGAGAGGAG GAACTTGCCA TGAAAAAAGG GTTAATCACC GCCAGGCAGG CAAGAAGCAT CGGAATTAAC TGGATTTTTG CCCCTGTGGT TGATTTAGCT TCGGACCCGG AAAACCCTAT AGTAAATACC CGCGCTTTCG GAAAAGACCC TATGCTTGTG ACGCGTTTGG CCATGGCTTT TATGTCAGGA TTATCGCAAG GCGGAACTTT AAATACTTTA AAACATTTTC CCGGACACGG GGACACGTCA AAAGATTCTC ACTTAGAACT ACCTTTTATC AGCAAATCTT TTGACAAGCT TTTTGATTCC GATTTAGTTC CCTATAAAAC ATTGTTAAAG TTTGCTGACT CAATTATGGT TGGACATCTT CTTATCCCAG CCATAGACGA TGAAAACCCG TCTTCTTTAT CGGAAAAAAC AATACGCGGA ATTTTAAGGC AAAAACTTAA TTATAAAGGA TGTGTTGTTA CCGACGCTCT TTTAATGAAA GCCATCGGCG ACCAAAAAGA AGCCGCTTTA AAAGCTTTAA AAGCGGGCGC GGATATCTTG CTTGCACCTT CAGACCCTTA TGAAATAATA GATTATTTAA ACCAGTTAAT TAAAGAAGAT TACACCTGGA AAGAACATTT TATCAACGCA GTGGCCACGC AAGAAATTCT GCTTACAAAA AACCGGAAAG TGGAAATAAG AACTCCGGAA TATGCGTTTT TTAAATCTTC TTATTCAATG GACGCGGCGC CTAGATGTAT AACAGAGTTC GGAGAAGAGA ATGTTTTAAA AAAAGAAAAT TCTTTGTCTT ATATGGAAAT AGATTGTAAA AGCGATTTTG AAAGCACTCC TTTTGCCAAA CAGCTTAAAG CTAACGGTTT TAAACTGGCC CCTTATACAG GCGGGGAATG TAAAAATTTG CTTATAGTTT CTTTCTCCGG CTACGCTTCT TTTAAAGGCT TTGCTAATTT TACAAAAGAG CAAAAGAAAA CAGTGGAAAA CGCCTTAACA AAAGCCAAGA ACAGCGCTTT TGTTTCCTTC GGCAGCCCTT TTGTGCACAG TGATTTTAAA ACAAAAGCAC AGTACCATTT GCTTGCGTAC TGTGCTAATG AGGACTTTCA AATTTTTTGC GCCGACGCGC TTTGCGGTAA AGCCAAAGTT ACTGGCAAAG CTCCTATTGA AATTTAG
|
Protein sequence | MNPARIVHPG FWFGKTDIED ARKWAKMGVG GFCVYGGTRE EIETFCKEMR GLSPYAEIFI SADYEDGLGR WIKGAELLPS NMAIGASGEE ELAMKKGLIT ARQARSIGIN WIFAPVVDLA SDPENPIVNT RAFGKDPMLV TRLAMAFMSG LSQGGTLNTL KHFPGHGDTS KDSHLELPFI SKSFDKLFDS DLVPYKTLLK FADSIMVGHL LIPAIDDENP SSLSEKTIRG ILRQKLNYKG CVVTDALLMK AIGDQKEAAL KALKAGADIL LAPSDPYEII DYLNQLIKED YTWKEHFINA VATQEILLTK NRKVEIRTPE YAFFKSSYSM DAAPRCITEF GEENVLKKEN SLSYMEIDCK SDFESTPFAK QLKANGFKLA PYTGGECKNL LIVSFSGYAS FKGFANFTKE QKKTVENALT KAKNSAFVSF GSPFVHSDFK TKAQYHLLAY CANEDFQIFC ADALCGKAKV TGKAPIEI
|
| |