Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0085 |
Symbol | |
ID | 6263945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 90391 |
End bp | 92016 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 642610546 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001874988 |
Protein GI | 187250506 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.841393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0151072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TATTTTTTGT TGTATTTTTT ACGGCTGTTT TTGCCCACGG GGCTGTGCCT GATTTTGATT CCCTTACATT GGACCAGAAA TTGGGACAGA CGCTTGTTGC TTTTGTCGAT ACGGACAATG CTCATAAATA CCAAAGTGCT ATAGAAAAAG GGTTGGTTGG CGGTGTGCTT GTGCAATGGG GCAACTACTC TTTAGAGCAA ACGACCGAGC TTGCGGCCAA ATTACAAAGC TGGGCGGCCA AATCCCCGCA TAAAATACCT TTATTAATTT CAATAGATTA TGAAGGCGGC ACGGTTTATA CTCCGGTTAC TTTAGGTTTT GAATATCTTC CGCCAAACAT GATGATAGCC GCCGCTAATG ATGAGGAAGC AGCGGCCAGA ATATTTTATC TTGCCGGCCT TGAACTTAGA AAAGCGGGTA TACATATAAA CTTTTCGCCC GTGGTTGACG TTAACATTAA TCCGGGCAAC CCTATAATAG GGGTACGCTC TTTCGGCTCC TCGCCGGAAT TAGTGGGACG TATGGGCGCG GCTGTTGTAA GTGGGCTTAG CGCGGCGAAC GTAATGTCCG TGGCGAAACA TTTTCCCGGC CACGGCAACA CTGTTTTGGA TTCTCATTAC AGCCTTCCTG TTTTAAACAT AACAAAAAAA GAAATGCAGG ATGTTCATCT GGCTCCATTT AAAAAAGCAA TAGAAGCAGG TGTGCCGGGT ATAATGACGG CTCATATTAT TTATAAAAAT TATGACCCCA AAAATCCCGC CACATATTCC AAAAGGATAT TAAATGATTT ATTGCGTACG GAGATGAAAT TTAAAGGCGT AATTATATCA GACGCGCTTG ATATGAAAGG CGCTACCTTA GACGGCAACA TCGCTTTAAG CGCGGCTAAG ACGCTTGAGG CGGGTTCCGA TATGGCGCTT TTGGGCAGGT TTTTAAACGC GGATAAAACT TTTAATAAAA TTTACGGTTA TGTGGGAACG GAACTTTCAC AAAAAAGAAT TGAAGAAGCT TCTAAAAAAA TACTTGATTT AAAAAAACAA ATGGGTTTGT TTGACGAACA GAAAGAACCT TTTACCTCCA CTTCCAAAGC TTACGCCGCC GCGGCGGAAG TTATAGCCAA AAAATCAGTA ACCGTTTTAA GAAATAAAAA TAATAAAATT CCGTTAAAAG AAGAGTTTGC TAACACTCCG GGTAAAAAAG TGTGCGCCGT GTTTTTTGCC CCTACAAGAT TCGCGGAGGA AATAACTTCG TTTAACAAAC CGTTTTTGGA AAAGGGATGG AAGGTAAATT ATTATAACGC TATTATGAAA CCCACAAGCA AAGATTTAAA ACGCGCAAGA GAATGCGCTA AAGGAGCGGA CCTTTTTGTT ATAGGAACTT TACAGTGGGC GGCAAAACCT TTTTACAAAC AAACAGCCGT AATAGGCACT TTGCTTGAAG AATTTCCCGA CGCGGTTGTT ATATCAACAA TGAGCCCGTA CGAAGTAAAA ACTTACCCCG GCGCTAAAAC TGTTTTATTA ACTTACGGCA TAAGCAAGCA TTCAATGAAG GCGGCGGCGG ACGTGATTGT GGGCAATATC CCCGCGCAGG GTAAGCTGCC CATAGAATTG GAATAA
|
Protein sequence | MKKLFFVVFF TAVFAHGAVP DFDSLTLDQK LGQTLVAFVD TDNAHKYQSA IEKGLVGGVL VQWGNYSLEQ TTELAAKLQS WAAKSPHKIP LLISIDYEGG TVYTPVTLGF EYLPPNMMIA AANDEEAAAR IFYLAGLELR KAGIHINFSP VVDVNINPGN PIIGVRSFGS SPELVGRMGA AVVSGLSAAN VMSVAKHFPG HGNTVLDSHY SLPVLNITKK EMQDVHLAPF KKAIEAGVPG IMTAHIIYKN YDPKNPATYS KRILNDLLRT EMKFKGVIIS DALDMKGATL DGNIALSAAK TLEAGSDMAL LGRFLNADKT FNKIYGYVGT ELSQKRIEEA SKKILDLKKQ MGLFDEQKEP FTSTSKAYAA AAEVIAKKSV TVLRNKNNKI PLKEEFANTP GKKVCAVFFA PTRFAEEITS FNKPFLEKGW KVNYYNAIMK PTSKDLKRAR ECAKGADLFV IGTLQWAAKP FYKQTAVIGT LLEEFPDAVV ISTMSPYEVK TYPGAKTVLL TYGISKHSMK AAADVIVGNI PAQGKLPIEL E
|
| |