Gene Acid345_4432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4432 
Symbol 
ID4073343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5262724 
End bp5264784 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content59% 
IMG OID637986470 
Productmetal dependent phosphohydrolase 
Protein accessionYP_593506 
Protein GI94971458 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.206968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC CAGTTCCGAC GCAAGAGATG AACGCTGCCG CGCGCCCGGC CGCGATCCTG 
AGTTACTTCC CCAACGATCC GGAAGGCGAG CGCTTGCTCC GCGCCATTGC GGAGGCCGAG
CTCGTGTCGT TCGATACCCT CGACGAAGAA TTCCAGGAAG CGATTGTGAT CGTCTGCGTG
AACCCAGTGC GGCTCTCGCA GAATATTGCG CGTGCGCGGC GCGAAAACGT GCGCGTGATT
GCCATCTCCA ATGAGCGCTA CGAAGAGCTA GAGATGGACT CGATCGTTCA CGCCTACTTG
CCACCAAATA CTCCCGCTGC AATTTTGCGT CGCACGGTGG ATAGCGCCGT GGCACACGTC
ATGCTCTGGG ACGGTTTCCA GATGCTCGAC GACCGCTTTG CCGGCCTGAC CCGCGAGATC
CACGAACTCA ATCGCATTGG CGCCGCACTC AGCGCGGAAC ACGACACCAA TAAGCTGCTC
GATTTGATCC TGACCAAATG CCGCGAGCTT ACCCGTGCAG ACGCCGGATC GCTCTACCTT
GTGGAACAAG AGCGCGCGAC GGAGCCTCCA GCGCCAATCG ATCCCACGCA TGTATCCACG
CACGGCCCCA AGGCCGCCGA AGTTCCCGCG CCCGATGTGG AGTCGGCCAG CGGCCGCAAG
GTCCTGCGCT TCAAGCTGGC GCAGAACGAC AGCGTCTCGA TTCCCTTCCG CGAAGTCACG
ATTCCGATCA GCGAAAAGTC CATCGCGGGC TATGTCGCGT TGCGCGGCGA GATCGTAAAC
CTGCGCGACG CTTACGATCT GCCGCACGAA GTTCCGTACG CGATCAACCG CAAGTTCGAC
GAAGACTCCG GCTATCGCAC CTGCTCGATT CTCGCGGTGC CAATGCGCGA CCAGAAAGAA
GAGATCGTCG GCGTCATTCA GCTCATCAAC GCCAAGAATT TTGCGGACGC GCGCCTCGAC
TCGCCGCTTG CCGTCGCCCA TGAAGTGATT CCGTTCACCA AGCACCAGCA GGAGATTATT
GCGTCGTTGG CGAGCCAGGC CGCCGTAGCG TACGAGAACA GCCAGTTATA CGCGAGCATC
CAGAGACTCT TCGAAGGCTT CGTTAAGGCG AGCGTAACCG CAATTGAAGC TCGCGATCCA
ACTACGTCGG GCCATTCGTT CCGCGTCGCG AACTTGACCG TCGCACTGGC CGAAACTGTA
GATCGCTGCG AAGACTCGCT CTTTGGCGAC ATTACCTTCA CCCGTTCGGA GATGAAGGAG
ATCCGCTACG CTTCGCTGCT GCATGACTTC GGCAAGGTCG GTGTGCGCGA AGAAGTTCTT
ATCAAAGCGA AGAAGCTCTA CCCCGGGCAG CTCGACTTGA TCCAGCAGCG CTTCGAGTAC
GTGAAGCGCA CGGTTGAGAA CGAGAACCTG CAGTCGCGGG TGAATTACCT GCTCGAGAAA
TCGCGTGACG AGTATCTCGC GAAGCAGACG GAGTACGACG GCGAGCTCAA GCAGAAGCTT
GAGCAACTCG ATCTCTATTA CCAGACGATC GTTGCCGCCA ACGAGCCCAC GGTGCTACCC
GAGGGCAGCT TCGACAGTTT GAAAGGAATC ATGCGCACTG CGTTCCACGC CTACAGTGGC
GACGAACAAC CGCTGCTGCG CGAAGACGAA GTGCTTCTGC TCTCCATCCG CAAAGGCTCG
CTCGACGAAA GCGAACGCCT GCAGATCGAG TCGCACGTGG TGCACACCTA CAACTTCCTC
AGCAAAATTC CGTGGACGAA AGAGATCAGG CACATCCCCA CCATCGCCCG CGGCCACCAC
GAAAAACTCA ACGGCCTCGG CTACCCATTC AAACTCTCGG CGCCGACGAT CCCAATCCAA
ACCCGCATGA TGACCATCTC GGACATCTTC GACGCGCTCT CCGCGAGCGA CCGTCCGTAC
AAGAAAGCCG TCAGTCAGGA ACGCGCGCTG CAAATCCTCG GCTTCGCAGT AAAAGACGGC
GAAGTGGATG GCGCGCTGCT CAAACTGTTC ATCGACGGCA AAGTGTTCGA GCGGTGGAAG
GTCGAACCGT TCCCGTACTA G
 
Protein sequence
MSTPVPTQEM NAAARPAAIL SYFPNDPEGE RLLRAIAEAE LVSFDTLDEE FQEAIVIVCV 
NPVRLSQNIA RARRENVRVI AISNERYEEL EMDSIVHAYL PPNTPAAILR RTVDSAVAHV
MLWDGFQMLD DRFAGLTREI HELNRIGAAL SAEHDTNKLL DLILTKCREL TRADAGSLYL
VEQERATEPP APIDPTHVST HGPKAAEVPA PDVESASGRK VLRFKLAQND SVSIPFREVT
IPISEKSIAG YVALRGEIVN LRDAYDLPHE VPYAINRKFD EDSGYRTCSI LAVPMRDQKE
EIVGVIQLIN AKNFADARLD SPLAVAHEVI PFTKHQQEII ASLASQAAVA YENSQLYASI
QRLFEGFVKA SVTAIEARDP TTSGHSFRVA NLTVALAETV DRCEDSLFGD ITFTRSEMKE
IRYASLLHDF GKVGVREEVL IKAKKLYPGQ LDLIQQRFEY VKRTVENENL QSRVNYLLEK
SRDEYLAKQT EYDGELKQKL EQLDLYYQTI VAANEPTVLP EGSFDSLKGI MRTAFHAYSG
DEQPLLREDE VLLLSIRKGS LDESERLQIE SHVVHTYNFL SKIPWTKEIR HIPTIARGHH
EKLNGLGYPF KLSAPTIPIQ TRMMTISDIF DALSASDRPY KKAVSQERAL QILGFAVKDG
EVDGALLKLF IDGKVFERWK VEPFPY