Gene Emin_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0018 
Symbol 
ID6263543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp17734 
End bp19191 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content38% 
IMG OID642610481 
Productalpha amylase catalytic region 
Protein accessionYP_001874923 
Protein GI187250441 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACAA ATCCTCATAT TCTTGAAATA AATACAGCAG TTTGGCTCAA CAAACTGAGT 
GAAAAATATG GCAAAAAAAT TCCGCTTTGC CATGTTCCTG ACGAGGAACT TGATAAGTTT
AAAAAATACG GCTTTGACGC TGTTTGGCTT ATGGGCATTT GGAAACGCAG CCCGGAAGCC
CGGGAAGTAG CACAAAATAA TGAGGAAATA AAAAGAGCGA TAGCTGCTTT TAATCCCAAT
TATGATAAAA AGTATATCGC AGGGTCTCCT TACGCAATTT ATGACTATAC TATAGACCCT
GATTTGGGCA CTGAGGACGG GCTTAAAAAC TTTAGGCAGC GTTTAAACGA AAGAGGAGTA
GCGCTTCTTT TAGATTTTGT GGGTAATCAT TTTGCAATTG ACCATCCGCA AACCTTAGAA
AATCCCGATT TTTTTATTAA CACCGGCATG GAAGCGCCCG CTCAAAATCC GGAATGGTTT
TTCCGAACGG AAAAAGGCGT TTTTATAGCC CATGGGAGGG ACCCTTACTT TCCCCCGTGG
ACGGATACTG CGCAGCTTAA CTATTTTAAT CCTAAGACAC AGGAATATAT GTTGCAATGT
TTGGAAAGGA TATCTTCTTT CTGCGACGGA GTAAGGTGCG ACATGTCAAT GCTTTCATTA
AACAAAGTAA TGAAAGACAC TTGGGGAAAC TACCTTAAGT ATGATTATCC TAAAGAAGAA
TTTTGGACAA AAGCTGTAAG TAAAATCAAA AATATAAATC CGTTGTTTTC CTTTATTGCC
GAAGTTTATT GGGGCCTTGA GTGGGATATA CAGGAAATGG GTTTTGACTA TACTTACGAT
AAAGTTTTGT ATGACAGATT AAGATTTTCC ACAGCTGAAG CTATAGAAGC GCACTTAGGG
GCGGAACACT TATTTCAAAT GCGTTCTATA CGTTTTATTT CTAACCACGA TGAAGAGTCC
GCGTTAAAAG CTTTTGGTAA GGAAAAATCT TTAGCGGCCG CGGCTATAAT TTCAACAATT
CCCGGCGCAA AAATGTTTAG CTTAGACCAA ATTTTGGGGC ATAAAGAAAA GATTCCCGTA
CAATATACTT TGGAATCTGA AAAAGATGAT GAGGAGATAA TATCTTTTTA CCAAAAACTT
TTAAGTATTA TTAACCATCC TTCTTTTCAC GGCGGACAAT GGACGGTTAA AAAAGTTCTG
TCAGTAAACG ACAGTCTTAC TTACAAAAAT GTTTTGTGCT GCAGCTGGGT TCAAGGCATT
GAGCATAAAA TTGTTGTTAT AAATTATTCA AACGCCGAAG CTGTTTGTAA GGTTACTTTA
AAACGTTTTA AATTTAAAGA ACTTAAAGAT GACATCGCGG GAAAACCGGT TGATATTCCC
GTTGAAGAAG CATATAAAAA CGGGATTACG CTACAACTTA AACCATACGA GATTAAAATA
TACGGTACTA CCATTTAA
 
Protein sequence
MRTNPHILEI NTAVWLNKLS EKYGKKIPLC HVPDEELDKF KKYGFDAVWL MGIWKRSPEA 
REVAQNNEEI KRAIAAFNPN YDKKYIAGSP YAIYDYTIDP DLGTEDGLKN FRQRLNERGV
ALLLDFVGNH FAIDHPQTLE NPDFFINTGM EAPAQNPEWF FRTEKGVFIA HGRDPYFPPW
TDTAQLNYFN PKTQEYMLQC LERISSFCDG VRCDMSMLSL NKVMKDTWGN YLKYDYPKEE
FWTKAVSKIK NINPLFSFIA EVYWGLEWDI QEMGFDYTYD KVLYDRLRFS TAEAIEAHLG
AEHLFQMRSI RFISNHDEES ALKAFGKEKS LAAAAIISTI PGAKMFSLDQ ILGHKEKIPV
QYTLESEKDD EEIISFYQKL LSIINHPSFH GGQWTVKKVL SVNDSLTYKN VLCCSWVQGI
EHKIVVINYS NAEAVCKVTL KRFKFKELKD DIAGKPVDIP VEEAYKNGIT LQLKPYEIKI
YGTTI