Gene Emin_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0022 
Symbol 
ID6263619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp23670 
End bp24998 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content42% 
IMG OID642610485 
ProductPhoH family protein 
Protein accessionYP_001874927 
Protein GI187250445 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TATATGTTTT GGATACTAAT GTGTTGCTGC ATGACGTACA GGCAATGGAA 
GCGTTTGAAG ATAATGAAAT AATTGTTCCC ATAGTTGTTA TAGAGGAGCT TGATAATTTT
AAAACCCATT CGGACGAAAG GGGCAAAAAC GCCAGAATAG TATCGCGCGC TTTAGATTCT
TACAGGGAAA AAGGTCGCCT TAGCGAAGGC GTACCCACAA ACAGCGGCGG AACGTTAAGA
ATAGAAATGG AAAGGGCCAA CGTGCTTCCT ACTGGGTTTG TTTTTAATAA ATCAGACAAC
GGTATTTTAA ATATAGCCTA CTCTCTTAAA TTAAAAGAAG AAACCCGCAA AACCAACAAA
AAACCCGTTA TTATAGTTAC AAAAGACACA AATTTACGCT TAAAATCTGA GGCTTTGGGT
ATTGAAGCGC AAGATTTTAT AACGGATAAA ATTAACTTTA GCGAACTTTA TACCGGCGTT
GCGGAAGTGG AGACGGATGC ATCCGTAATA GACGCTCTTT ATAAAAACAA AACCGTGCCG
TTACCTGCGG CCGGCACATA TTATCCAAAC CAATTTATAA TTTTTAAATC TAATGACGGA
AGCAAAAAAT CGGCTATCGG CCGCGTGGGC AACAATGGTG AGCCAAACGT AAAACTTTTA
TCTCAAACAG AGCCTGTGGC ATGGGGTATA AAACCTTTAA ATAAGGAACA GCGTTTTGCC
ATGGAACTTT TGTTAGACGA CAGTTTAGAC ATTGTTACCT TAGTCGGTGC GGCAGGCACG
GGTAAAACCT TAATTACGTT AGCTACCGGT TTACAAAGAA CTTTAGACGA AGAAAAATAC
AGAAGGCTTG TTGTTTGCCG TTCCATAGTT CCTGTAGGTA AAGATATCGG CTTTTTACCG
GGCACAAAAG AAGAAAAACT TGAAGTATGG ATGGGCGCTA TTTATGATAA TATGGCTTTC
CTGGCCGACC GCAGGAACCC CGATGAAGGC GAGGAAAAGG CTAAATATAT TTTAGATTCC
GGTAAAGTTG AAATCGCTTC AATTACGCAT ATAAGAGGCC GCAGCTTGCC GCAACAATAT
ATGATTGTTG ATGACGCGCA AAACCTTACG CCGCATGAAA TGAAAACCAT TTTAACCCGC
GCGGGCGAAG GCACCAAAGT CGTTGTCACG GGCGATCCTT ACCAAATTGA CACGCCTTAT
TTGGACGCGG AATCAAACGG CCTTACCTAT TTAGTTGACA GGTTTAAAGG ACAAAAAAAC
CACGGACACG TAACATTTAC AAAAACCGAA CGCAGCCGCC TGGCGGACTT GGCGTCGCAT
CTGCTTTAA
 
Protein sequence
MKKIYVLDTN VLLHDVQAME AFEDNEIIVP IVVIEELDNF KTHSDERGKN ARIVSRALDS 
YREKGRLSEG VPTNSGGTLR IEMERANVLP TGFVFNKSDN GILNIAYSLK LKEETRKTNK
KPVIIVTKDT NLRLKSEALG IEAQDFITDK INFSELYTGV AEVETDASVI DALYKNKTVP
LPAAGTYYPN QFIIFKSNDG SKKSAIGRVG NNGEPNVKLL SQTEPVAWGI KPLNKEQRFA
MELLLDDSLD IVTLVGAAGT GKTLITLATG LQRTLDEEKY RRLVVCRSIV PVGKDIGFLP
GTKEEKLEVW MGAIYDNMAF LADRRNPDEG EEKAKYILDS GKVEIASITH IRGRSLPQQY
MIVDDAQNLT PHEMKTILTR AGEGTKVVVT GDPYQIDTPY LDAESNGLTY LVDRFKGQKN
HGHVTFTKTE RSRLADLASH LL