Gene Emin_0715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0715 
Symbol 
ID6263853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp788965 
End bp790254 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content43% 
IMG OID642611187 
Producthistidinol dehydrogenase 
Protein accessionYP_001875607 
Protein GI187251125 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000000324572 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAT ATAATTATGC AACTTTAACT AAAAAAGAAA TTAAAGCTTT GTCCGAGCGG 
TCTTATCTGA TGCCTGAAAA ATTAAAACAG GCTGTTTGGG AAACGGGCCT TCAGGTTATG
AAAAGGGGGG ATAAAGCTAT AAAAGAATTA ACCGCCGCTT TTGACGGCGT TACTCTTTCG
TCCTTAAAAG TAAGCGTGAA AGAATTTAAA CAAGCCGAAA AACTTGTAAG CAACGAGTTA
AAAGACGCAA TCAAAACAGC ATCTGAAAAT ATAGCTAAAT TCCATAAATC GCAGTTAAAA
ACAAAAGAGC CTGTTATAGA AACATCCAAA GGCATTGAAT GCTGGCGTGA ATTTAAGGCT
TTGGGTACTG CGGGGCTTTA TGTTCCGGGC GGCAGCGCGC CTTTATTTTC AACGGTGCTT
ATGCTTGCTG TGCCGGCTAA AATAGCAGGT TGTAAAAAAG TTTATATTTG CACGCCTCCT
TGTAAAAACG GGCTTATAGC GCCGGAAATT TTGTTTGCCG CTAAAACCGC GGGAGTGGAT
GAGGTTTACA AAATAGGCGG CGCCCAGGCT GTTTTTGCCA TGGCTTACGG CACCCAGACA
GTGCCCAAGG CGGATAAAAT TTTCGGCCCG GGCAACCAGT ATGTAACGCA GGCCAAAATG
GAAGTTTCTT CTTTTACGGC TATAGATATG CCCGCCGGCC CTTCAGAAGT GCTTATAATA
GCGGAGGAAA GCACAAACGC CTCTTACGCG GCGGCCGACA TTTTAAGCCA GGCGGAACAC
GGGCCTGATT CACAGGCTGT TTTAGCGTGT TCAAGCAAAA ATAAAATAAA AGAAATTATA
GCGGAAGTGG ACGCCCAGCT TAAAAAATTG GGAAGAAAAA GCATTGCCGC TAAAGCTTTG
GGCAAAAGCT TTATAATGCA AACCCGTAAT ACTAAAGAAT CAGTCGAGTT TTCCAATATT
TACGCGCCTG AGCATTTGAT ATTAAATTTT AAGGATTGGA AAAAATATTT AACTTCTGTT
CAAAACGCGG GCTCTGTATT TTGCGGAACA TTGGCTACAG AGTCTTTTGG CGATTACGCC
AGCGGCACTA ACCACACTTT ACCGACATCG GGCTTCGCCA AAAGTTTTGG CGGATTAAAT
ACACAAAGTT TCGGTAAATG GATTACTTAC CAAACAGTTT CTGCCCAAGG CTTGCGCGGG
CTTGGTAAAA CGGTTGAAAT TATGGCCGAG GCCGAAGGCC TTACGGCGCA CAAAAACGCC
GTTACTATAA GGATTGAAAA TGAAAAATAA
 
Protein sequence
MKIYNYATLT KKEIKALSER SYLMPEKLKQ AVWETGLQVM KRGDKAIKEL TAAFDGVTLS 
SLKVSVKEFK QAEKLVSNEL KDAIKTASEN IAKFHKSQLK TKEPVIETSK GIECWREFKA
LGTAGLYVPG GSAPLFSTVL MLAVPAKIAG CKKVYICTPP CKNGLIAPEI LFAAKTAGVD
EVYKIGGAQA VFAMAYGTQT VPKADKIFGP GNQYVTQAKM EVSSFTAIDM PAGPSEVLII
AEESTNASYA AADILSQAEH GPDSQAVLAC SSKNKIKEII AEVDAQLKKL GRKSIAAKAL
GKSFIMQTRN TKESVEFSNI YAPEHLILNF KDWKKYLTSV QNAGSVFCGT LATESFGDYA
SGTNHTLPTS GFAKSFGGLN TQSFGKWITY QTVSAQGLRG LGKTVEIMAE AEGLTAHKNA
VTIRIENEK