Gene Emin_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0021 
Symbol 
ID6263897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp21085 
End bp23445 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content42% 
IMG OID642610484 
Productglycoside hydrolase family protein 
Protein accessionYP_001874926 
Protein GI187250444 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATT TATGTATACA TGGACATTTT TACCAACCTC CTAGAGAAAA CGCCTGGACA 
GACGCCATTG ATTACCAGGA AAGCGCCTTT CCTTATCATG ACTGGAACGA TCGCATTTGC
GCGGAGTGTT ACCAACCTAA CACCAAATCA AGGGTTTTAG ACGGGGACAA AAACCTTATA
GCTTTGGTTA ATAATTATTC TTCAATAAGT TTTAACTTCG GCCCCACTCT TTTATCCTGG
ATGGAGGAAA AGCAGCCCGA TGTTTACCGC GCCATTTTAG AGGCGGACAG ATTAAGTATG
AAAAAGTTTA ACGGCCACGG CAGTGCCTTA GCGCAGGTTT ATAACCATAT GATAATGCCG
CTAGCCAATG AACGCGACAA ACGCACGCAG ATAATTTGGG GCATAGAGGA CTTTAAAAAA
AGATTCCAAC GTTTTCCCGA AGGAATGTGG CTTGCAGAAA CCGCTTGCGA CACTCCTACT
CTTGAACTGC TTGCCGAACA GGGTATAAAA TTTACCATTT TGGCTCCCGG CCAATGCGCA
AAAACAAGAA AAATCGGTGA TAAAAACTGG CTTGAAACAC CTAACGCCTC AGTAGATCCC
AAAAAACCTT ATTTATGCAA TTTGCCCTCA GGTAAAACAA TTACTTTATT TTTTTATGAC
GGGCCGATAT CACAGGGCAT CGCTTTTAGC GACACTTTAA AAAGCGGGGA AAATTTTGCC
GCCAAATTAA TGGGCGCGTT TACGGACGCT TCTAAAAAAG ACACGGAACT AGTGCACATA
GCCACGGACG GGGAAACCTA CGGGCATCAT CAAAAATTCG CGGATATGGC TCTTGCCTAC
TGTCTTAAAC AGGTTGAAGA TAAAAGCCTT GCCAAAATAA CAATTTACGG CGAGTTTTTA
GAAAAATTCC CCCCAAAATA CGAAGCGAAA ATAAATGAAA ATTCTTCCTG GAGCTGCTTT
CACGGCGTTG AAAGATGGCG AAGCAACTGC GGATGCAACA GCGGCATGCA CCAAGGCTGG
AACCAAAAGT GGCGAGCTCC TTTAAGAGCG GCGCTTGATT TAATAAGAGA ATCTTTTATT
AAAACCTTTG AAACAAAAGG AAGCGAGTTT TACCATGACG TTTGGGACGC GAGAAACGCC
TACATCTCTT TTGTGCTTGA CCGCAAACCC GAAATGCTTG AAAGTTTTTT TAAAGCAAAA
GGAAAAGAAC GAGTTTGGCA AGACAGGCAA ACTGCCGTTG ATTTAATGGA AATGCAGCAT
AACGCCATGC TTATGTATAC AAGCTGCGGC TGGTTTTTTG ACGAGATAAG TGGCATTGAA
ACTGTGCAAA TAATGCAATA CGCGGCCAAA GCGATAGAGC TTCACAAAAA TATTAACGGA
ATTGATTTGG AAGCTGATTT TATAAACAAA CTTTCCGAGG CGCAAAGCAA TATTGCGGAA
CTCGGCAACG GCGGTAATAT TTACGAGCGT TTTGTAAAAA CCGCCGCTTT CACAAGCCAA
AAAGCCGCCG TGCACTACGC GCTTACTCGC ACGTTTAACC ATGAAGATAT TAAGGAAATT
TATTCTTACA CCGTATCCGA CAGCCAGTTA AAGGAAATAG AAACGGGCAC TACAAAAGCC
GTTACGGGCA AAGCGGTGTT TACTTCTAAA ATAGACTTCC GCAGCAGTGA AGAATTTTTT
GTATTGCTTT ATTCGCAAGA TTATAGAATT GTGTGTTTTT GCTCCGAAAA ACCTTTAATA
ACGTTTGAGG AGATAAAAAA TATTATAGAA AACAATTCTA CTGACGAAGC CATAGAACAA
CTGCAAAAGA ATATTCCGTC CTCTTTTACT CTTTCCGACC TCGTAAAAGA CGCGCAGAGG
GTAGTTTCTT CCAAAATACT AAAAAAGCTA CACGCAACAA CCACCGAGGC TTTTGACACC
GTGTTTAACT CGCAGTATCC GCTTTTAAGA GAGTTAAAAT ATATAGGAAC GCCGATACCA
AAGCCGTTTT TTTATGTGGC TATTTTTGTG CTGCTTGAGG ATTTAAAGGA AGAAATTTCT
TCCGGCAATA CAGACCCCGC CAGAATTGAG GAAATGCTTG AGGACTGCCG TAGTTTAAAT
ACAGAAATTA ATTTCGCTCC CGTAAAAGAT ATGGCTCAAA AAAAATTACA AAAAATCTGC
CTGGAGTTTA AAGAAAACCC GGACAGGGAA CACGCCCTTG CTGTTATTGA GCTGCTTTCT
ATACTGCAAA GCACACCTTT CGCGCCGGAT GTTTTCTTTG GGCAGGAGGA CGTTTTTACA
GCGCTTAAAA GCCTGCCAAA AGTTACCAGG GACGAATCCG CCTTTAAGGT TTTAGCAAGA
AAAATGAAAG TAAGAATATA A
 
Protein sequence
MKYLCIHGHF YQPPRENAWT DAIDYQESAF PYHDWNDRIC AECYQPNTKS RVLDGDKNLI 
ALVNNYSSIS FNFGPTLLSW MEEKQPDVYR AILEADRLSM KKFNGHGSAL AQVYNHMIMP
LANERDKRTQ IIWGIEDFKK RFQRFPEGMW LAETACDTPT LELLAEQGIK FTILAPGQCA
KTRKIGDKNW LETPNASVDP KKPYLCNLPS GKTITLFFYD GPISQGIAFS DTLKSGENFA
AKLMGAFTDA SKKDTELVHI ATDGETYGHH QKFADMALAY CLKQVEDKSL AKITIYGEFL
EKFPPKYEAK INENSSWSCF HGVERWRSNC GCNSGMHQGW NQKWRAPLRA ALDLIRESFI
KTFETKGSEF YHDVWDARNA YISFVLDRKP EMLESFFKAK GKERVWQDRQ TAVDLMEMQH
NAMLMYTSCG WFFDEISGIE TVQIMQYAAK AIELHKNING IDLEADFINK LSEAQSNIAE
LGNGGNIYER FVKTAAFTSQ KAAVHYALTR TFNHEDIKEI YSYTVSDSQL KEIETGTTKA
VTGKAVFTSK IDFRSSEEFF VLLYSQDYRI VCFCSEKPLI TFEEIKNIIE NNSTDEAIEQ
LQKNIPSSFT LSDLVKDAQR VVSSKILKKL HATTTEAFDT VFNSQYPLLR ELKYIGTPIP
KPFFYVAIFV LLEDLKEEIS SGNTDPARIE EMLEDCRSLN TEINFAPVKD MAQKKLQKIC
LEFKENPDRE HALAVIELLS ILQSTPFAPD VFFGQEDVFT ALKSLPKVTR DESAFKVLAR
KMKVRI