Gene Emin_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1531 
Symbol 
ID6263345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1623255 
End bp1624691 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content41% 
IMG OID642612018 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001876415 
Protein GI187251933 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000631966 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.75811e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATCCTG CAAGAATAGT GCACCCCGGC TTTTGGTTTG GGAAAACAGA TATTGAAGAC 
GCCAGAAAAT GGGCAAAAAT GGGCGTAGGC GGATTTTGCG TCTACGGTGG AACCAGAGAA
GAGATTGAGA CATTTTGTAA AGAAATGAGG GGCCTTTCCC CTTACGCGGA AATTTTTATT
TCAGCCGATT ATGAAGACGG CCTTGGAAGG TGGATAAAAG GGGCCGAGCT TTTGCCTTCT
AACATGGCTA TAGGCGCCTC CGGAGAGGAG GAACTTGCCA TGAAAAAAGG GTTAATCACC
GCCAGGCAGG CAAGAAGCAT CGGAATTAAC TGGATTTTTG CCCCTGTGGT TGATTTAGCT
TCGGACCCGG AAAACCCTAT AGTAAATACC CGCGCTTTCG GAAAAGACCC TATGCTTGTG
ACGCGTTTGG CCATGGCTTT TATGTCAGGA TTATCGCAAG GCGGAACTTT AAATACTTTA
AAACATTTTC CCGGACACGG GGACACGTCA AAAGATTCTC ACTTAGAACT ACCTTTTATC
AGCAAATCTT TTGACAAGCT TTTTGATTCC GATTTAGTTC CCTATAAAAC ATTGTTAAAG
TTTGCTGACT CAATTATGGT TGGACATCTT CTTATCCCAG CCATAGACGA TGAAAACCCG
TCTTCTTTAT CGGAAAAAAC AATACGCGGA ATTTTAAGGC AAAAACTTAA TTATAAAGGA
TGTGTTGTTA CCGACGCTCT TTTAATGAAA GCCATCGGCG ACCAAAAAGA AGCCGCTTTA
AAAGCTTTAA AAGCGGGCGC GGATATCTTG CTTGCACCTT CAGACCCTTA TGAAATAATA
GATTATTTAA ACCAGTTAAT TAAAGAAGAT TACACCTGGA AAGAACATTT TATCAACGCA
GTGGCCACGC AAGAAATTCT GCTTACAAAA AACCGGAAAG TGGAAATAAG AACTCCGGAA
TATGCGTTTT TTAAATCTTC TTATTCAATG GACGCGGCGC CTAGATGTAT AACAGAGTTC
GGAGAAGAGA ATGTTTTAAA AAAAGAAAAT TCTTTGTCTT ATATGGAAAT AGATTGTAAA
AGCGATTTTG AAAGCACTCC TTTTGCCAAA CAGCTTAAAG CTAACGGTTT TAAACTGGCC
CCTTATACAG GCGGGGAATG TAAAAATTTG CTTATAGTTT CTTTCTCCGG CTACGCTTCT
TTTAAAGGCT TTGCTAATTT TACAAAAGAG CAAAAGAAAA CAGTGGAAAA CGCCTTAACA
AAAGCCAAGA ACAGCGCTTT TGTTTCCTTC GGCAGCCCTT TTGTGCACAG TGATTTTAAA
ACAAAAGCAC AGTACCATTT GCTTGCGTAC TGTGCTAATG AGGACTTTCA AATTTTTTGC
GCCGACGCGC TTTGCGGTAA AGCCAAAGTT ACTGGCAAAG CTCCTATTGA AATTTAG
 
Protein sequence
MNPARIVHPG FWFGKTDIED ARKWAKMGVG GFCVYGGTRE EIETFCKEMR GLSPYAEIFI 
SADYEDGLGR WIKGAELLPS NMAIGASGEE ELAMKKGLIT ARQARSIGIN WIFAPVVDLA
SDPENPIVNT RAFGKDPMLV TRLAMAFMSG LSQGGTLNTL KHFPGHGDTS KDSHLELPFI
SKSFDKLFDS DLVPYKTLLK FADSIMVGHL LIPAIDDENP SSLSEKTIRG ILRQKLNYKG
CVVTDALLMK AIGDQKEAAL KALKAGADIL LAPSDPYEII DYLNQLIKED YTWKEHFINA
VATQEILLTK NRKVEIRTPE YAFFKSSYSM DAAPRCITEF GEENVLKKEN SLSYMEIDCK
SDFESTPFAK QLKANGFKLA PYTGGECKNL LIVSFSGYAS FKGFANFTKE QKKTVENALT
KAKNSAFVSF GSPFVHSDFK TKAQYHLLAY CANEDFQIFC ADALCGKAKV TGKAPIEI