Gene Emin_0716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0716 
Symbol 
ID6263186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp790244 
End bp791317 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content39% 
IMG OID642611188 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001875608 
Protein GI187251126 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.661536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000000238989 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATA AAAAAACGCT TTTTATTGAC AGGGACGGCA CCTTGATTTT TGAGCCGATT 
GCCACAAAGC AAATAAATTC GCTTGATGAA ATGTTTTTTA CAAAAGGCGT TATCAGCGCT
TTAAAACGTT TTAAACAGGC GGGGTACAGC CTTGTTATCG TTACCAACCA GGACGCTTTG
GGTACACCTG AAAACCCTCG TAAAGTATAT GAGAACATTA ATAATAAAAT GTTCGCAATT
TTTGCTTCCG AAGACATTTT TTTTAACGCT GTTTTAGAAT GCCCGCATAA TAAAACGGAC
GGCTGCGCCT GTCGTAAACC AAAAACTAAA TTGGGCGTTA ATTACATAAA AAATAATCCC
GTGGATTTAG AAAATTCTTA TATGATAGGG GATAGAGACA CCGACGTTGA ATTTGGTGAA
AACCTTGGCA TAAAAAGTTT TAAACTTACT AAAAAATTAG GCTGGGCGGA AATAGCGGGT
GAAATTTTAG ATAAGCCCCG CAAAGCCCAA GTTATAAGAA AAACCAAAGA AACAAACATA
AAATTAAACC TTAATCTTGA CGGTAAAGGA CAAACAAAAA GTAATACGGG TATTGAATTT
TTTGACCACT GCCTTGACCA GCTTGGCAAG CACGGCGGGT TTGATTTGCA AATAAAATGC
AAAGGCGATT TATGCGTGGA CGAACATCAC ACCGTTGAAG ACACGGCGCT CGCGCTGGGG
CAAGCCTTTA AAACGGCGCT CGGCGATAAG CGCGGTATAG AGCGTTATGC CTGGGAAAGA
ATTTTAGTTA TGGATGACGC TAAAGTTGAA ATAAGCATAG ATATTTCAAA CAGGCCTTAC
CTTGTTTTTA AAGGCAAGTT TGACCGTGAA TACGCAGGCA AAATGCCCAC GGAACTTGTG
GAACACTTTT TTGAAAGTTT TGTGTCCGCC TCAGGCATTA ATATGAATAT AAAAATTGAG
GGTAAAAACA CCCACCATAA AATTGAGGCG TGCTTTAAAG CGTTTGCTAG AGTTTTGAGA
GATGCCGTTA AAATAACAGG TACTAAAGTG TCCTCAACAA AGGGAATTTT ATGA
 
Protein sequence
MKNKKTLFID RDGTLIFEPI ATKQINSLDE MFFTKGVISA LKRFKQAGYS LVIVTNQDAL 
GTPENPRKVY ENINNKMFAI FASEDIFFNA VLECPHNKTD GCACRKPKTK LGVNYIKNNP
VDLENSYMIG DRDTDVEFGE NLGIKSFKLT KKLGWAEIAG EILDKPRKAQ VIRKTKETNI
KLNLNLDGKG QTKSNTGIEF FDHCLDQLGK HGGFDLQIKC KGDLCVDEHH TVEDTALALG
QAFKTALGDK RGIERYAWER ILVMDDAKVE ISIDISNRPY LVFKGKFDRE YAGKMPTELV
EHFFESFVSA SGINMNIKIE GKNTHHKIEA CFKAFARVLR DAVKITGTKV SSTKGIL