Gene Acid345_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3335 
Symbol 
ID4070297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3955864 
End bp3957699 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content60% 
IMG OID637985357 
Productmetal dependent phosphohydrolase 
Protein accessionYP_592410 
Protein GI94970362 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGACT CCGAAATCGC GACTGGGTTG AAGCGCTTCC TTCCGCAGCG CATCCCCATT 
CTGTACCTGA TCCTCGGGGC GCTCCTCGCC GTGAGCGTCA TCCCGATGTA CTTTTATGCG
CGGGTCGTCG CGATCAACCG TGACCGGTTG AAGACCAACG AAATGCTTCT GCAGAACACC
GTCACGCGTT CACTCGCTGA CGACGCCTCG CAGCGCCAGC GCAATCTGCA GATGATGCTC
CAGAACCTTT CGACCGCCGT GCAGATAAAC AGCGGCGGCA ATCTCGACGA CGCCACGATA
GCTTCGCCCG AAATGCGCGC GCTGCTTGAG AACTTTGTCT CCAGCGGCGG CGACGTGGCG
TATGCCACGA TCCTGAACAG CGAAGCCAAG GGCGTGACCG CCGGCCGCAT TGTTCCCGAC
GAATTCCTGC AGCGCGAAAT GAAGAAGGGC TTCGACCTGG CCAAGGAAGG GCAAGCCTAT
ACAGGGCAGG CCCTGCAAGT GGGCACCGGC GACGATACCC ACACCGTGAT GCTGGTGACC
CGTCCGATCA TGATCGGCGA TACGTTCATC GGACTGATCT CGGCCGTAGT TGACCTCGAT
TACCTGCTCA ACCGGTTGCA GGAAGTAAGC CATGGCGGCC TGATTGCCTA CGTGGTGGAT
AGCCACGGCC GCCTGATTGC CGGCGCCGAG CGCAGCTACG CCATCGGCCA GGACATGACG
TCGATTGAAC TGGTAAAAGC ATTCGTGGAA GAAGGCGGAC GTCTCGCCGC GACCCACGAA
TTCAACATGA GCGTTGAGGG CAAGAAAATC GAGATGCTGG GCACGTACAG CCCGGTGCAA
TCGTTGAACT GGGCGGTGAT CGCGCAGAAG CAGACCAGCG AGGCCTACCA GGGCATTTAC
GAGATGCAGC GCAGCGCACG CCTGCTGGCA ATTCTCGCGG TGCTGATGTC GCTGGTGATT
TCGATCTGGG CGGCGCGCCG CATCACCACG CCTCTCGACG TGCTCACGCA ATCCAGCCGC
GCCATTGCGC GCGGAGATTT CAGCCGTCGC GTGGAACTGG TGACGCGAAC GGAAATCGGC
GAACTCGCGA ATACGTTCAA CAGCATGACG GACGAAATCG AACGGCACAT TGAAGACCTG
AAGCGTGCCG CCGAAGAGAA CCGTCAACTG TTCTTGAGCT CGATCCAAAT GCTCGCCGGC
GCAGTCGACG AAAAGGACCC GTACACTCGC GGTCACTCCG ATCGCGTGAC GCGCTATTCC
GTTCTCATCG CGACCGAGAT GGGCTTAAGC ACTGAAGAAG TCGAGAAGAT CCGCATTTCG
GCGCAGCTTC ACGACGTGGG TAAGATCGGT ATCGAAGACC GCATCTTGAA AAAGCCGGGC
GCGCTCACAC CGGAAGAGTT CGAGATCATG AAGACCCACA CGACCAAGGG CGCCATCATC
CTGCGTCCGG TCACGCAATT GGCCGACATG ATCCCGGGCA TCGAACTCCA CCACGAATCG
CTCGACGGCC GCGGCTATCC CTACGGACTG AAGGGCGACC AGATCCCGCT GATGCCGCGC
ATCATCATGG TGGCCGATAC CTTCGATGCC ATGACCACCA ACCGTCCCTA CCAGGCGGCT
GCCGATCCGG AGTACGTGGT GCGCATCATC AACTCGCTGG CGAACACCAA GTTCGATCCT
CGCTGTGTAG CGGCATTTAC CGCGGTCTTC CAACGCGGGC AGGTGCACGT GAAGCGCGAT
GTGCCGATGC CTGCTGTCGC CATGGCGGCT GCTGCACCTC TTCCCGTTGC CGGCCACCGC
GAGGCTGCAC TGCTCGTGGA TACCGAGCGC ATCTAG
 
Protein sequence
MHDSEIATGL KRFLPQRIPI LYLILGALLA VSVIPMYFYA RVVAINRDRL KTNEMLLQNT 
VTRSLADDAS QRQRNLQMML QNLSTAVQIN SGGNLDDATI ASPEMRALLE NFVSSGGDVA
YATILNSEAK GVTAGRIVPD EFLQREMKKG FDLAKEGQAY TGQALQVGTG DDTHTVMLVT
RPIMIGDTFI GLISAVVDLD YLLNRLQEVS HGGLIAYVVD SHGRLIAGAE RSYAIGQDMT
SIELVKAFVE EGGRLAATHE FNMSVEGKKI EMLGTYSPVQ SLNWAVIAQK QTSEAYQGIY
EMQRSARLLA ILAVLMSLVI SIWAARRITT PLDVLTQSSR AIARGDFSRR VELVTRTEIG
ELANTFNSMT DEIERHIEDL KRAAEENRQL FLSSIQMLAG AVDEKDPYTR GHSDRVTRYS
VLIATEMGLS TEEVEKIRIS AQLHDVGKIG IEDRILKKPG ALTPEEFEIM KTHTTKGAII
LRPVTQLADM IPGIELHHES LDGRGYPYGL KGDQIPLMPR IIMVADTFDA MTTNRPYQAA
ADPEYVVRII NSLANTKFDP RCVAAFTAVF QRGQVHVKRD VPMPAVAMAA AAPLPVAGHR
EAALLVDTER I