Gene Emin_0085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0085 
Symbol 
ID6263945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp90391 
End bp92016 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content42% 
IMG OID642610546 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001874988 
Protein GI187250506 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.841393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0151072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TATTTTTTGT TGTATTTTTT ACGGCTGTTT TTGCCCACGG GGCTGTGCCT 
GATTTTGATT CCCTTACATT GGACCAGAAA TTGGGACAGA CGCTTGTTGC TTTTGTCGAT
ACGGACAATG CTCATAAATA CCAAAGTGCT ATAGAAAAAG GGTTGGTTGG CGGTGTGCTT
GTGCAATGGG GCAACTACTC TTTAGAGCAA ACGACCGAGC TTGCGGCCAA ATTACAAAGC
TGGGCGGCCA AATCCCCGCA TAAAATACCT TTATTAATTT CAATAGATTA TGAAGGCGGC
ACGGTTTATA CTCCGGTTAC TTTAGGTTTT GAATATCTTC CGCCAAACAT GATGATAGCC
GCCGCTAATG ATGAGGAAGC AGCGGCCAGA ATATTTTATC TTGCCGGCCT TGAACTTAGA
AAAGCGGGTA TACATATAAA CTTTTCGCCC GTGGTTGACG TTAACATTAA TCCGGGCAAC
CCTATAATAG GGGTACGCTC TTTCGGCTCC TCGCCGGAAT TAGTGGGACG TATGGGCGCG
GCTGTTGTAA GTGGGCTTAG CGCGGCGAAC GTAATGTCCG TGGCGAAACA TTTTCCCGGC
CACGGCAACA CTGTTTTGGA TTCTCATTAC AGCCTTCCTG TTTTAAACAT AACAAAAAAA
GAAATGCAGG ATGTTCATCT GGCTCCATTT AAAAAAGCAA TAGAAGCAGG TGTGCCGGGT
ATAATGACGG CTCATATTAT TTATAAAAAT TATGACCCCA AAAATCCCGC CACATATTCC
AAAAGGATAT TAAATGATTT ATTGCGTACG GAGATGAAAT TTAAAGGCGT AATTATATCA
GACGCGCTTG ATATGAAAGG CGCTACCTTA GACGGCAACA TCGCTTTAAG CGCGGCTAAG
ACGCTTGAGG CGGGTTCCGA TATGGCGCTT TTGGGCAGGT TTTTAAACGC GGATAAAACT
TTTAATAAAA TTTACGGTTA TGTGGGAACG GAACTTTCAC AAAAAAGAAT TGAAGAAGCT
TCTAAAAAAA TACTTGATTT AAAAAAACAA ATGGGTTTGT TTGACGAACA GAAAGAACCT
TTTACCTCCA CTTCCAAAGC TTACGCCGCC GCGGCGGAAG TTATAGCCAA AAAATCAGTA
ACCGTTTTAA GAAATAAAAA TAATAAAATT CCGTTAAAAG AAGAGTTTGC TAACACTCCG
GGTAAAAAAG TGTGCGCCGT GTTTTTTGCC CCTACAAGAT TCGCGGAGGA AATAACTTCG
TTTAACAAAC CGTTTTTGGA AAAGGGATGG AAGGTAAATT ATTATAACGC TATTATGAAA
CCCACAAGCA AAGATTTAAA ACGCGCAAGA GAATGCGCTA AAGGAGCGGA CCTTTTTGTT
ATAGGAACTT TACAGTGGGC GGCAAAACCT TTTTACAAAC AAACAGCCGT AATAGGCACT
TTGCTTGAAG AATTTCCCGA CGCGGTTGTT ATATCAACAA TGAGCCCGTA CGAAGTAAAA
ACTTACCCCG GCGCTAAAAC TGTTTTATTA ACTTACGGCA TAAGCAAGCA TTCAATGAAG
GCGGCGGCGG ACGTGATTGT GGGCAATATC CCCGCGCAGG GTAAGCTGCC CATAGAATTG
GAATAA
 
Protein sequence
MKKLFFVVFF TAVFAHGAVP DFDSLTLDQK LGQTLVAFVD TDNAHKYQSA IEKGLVGGVL 
VQWGNYSLEQ TTELAAKLQS WAAKSPHKIP LLISIDYEGG TVYTPVTLGF EYLPPNMMIA
AANDEEAAAR IFYLAGLELR KAGIHINFSP VVDVNINPGN PIIGVRSFGS SPELVGRMGA
AVVSGLSAAN VMSVAKHFPG HGNTVLDSHY SLPVLNITKK EMQDVHLAPF KKAIEAGVPG
IMTAHIIYKN YDPKNPATYS KRILNDLLRT EMKFKGVIIS DALDMKGATL DGNIALSAAK
TLEAGSDMAL LGRFLNADKT FNKIYGYVGT ELSQKRIEEA SKKILDLKKQ MGLFDEQKEP
FTSTSKAYAA AAEVIAKKSV TVLRNKNNKI PLKEEFANTP GKKVCAVFFA PTRFAEEITS
FNKPFLEKGW KVNYYNAIMK PTSKDLKRAR ECAKGADLFV IGTLQWAAKP FYKQTAVIGT
LLEEFPDAVV ISTMSPYEVK TYPGAKTVLL TYGISKHSMK AAADVIVGNI PAQGKLPIEL
E