Gene Acid345_4294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4294 
Symbol 
ID4071867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5101827 
End bp5102954 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID637986327 
Productmetal dependent phosphohydrolase 
Protein accessionYP_593368 
Protein GI94971320 
COG category[R] General function prediction only 
COG ID[COG3481] Predicted HD-superfamily hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.298288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGT TCTACATCTG TAATTGCGTC CAGCAGGAAA ACAAAGTCAT CACCTCGCAG 
TTCGTCGTTG TCTCCAAACA GATCAAGGCG AAGAAGAACG GTGAGCCCTA CCTCGCGCTC
ACCTTGGGCG ACCGTTCCGG TTCGGTGGAC GCGAAGATGT GGGACAACGT CGAGGAGTTC
CTCAACGCCT TCGAGCAGGA TGACTTCGTC AAAGTGAAGG GGCTCATCAA CAAGTACAAC
AACCGCTGGC AGCTCACCAT TCACAAAATC CGCAAGATGG GCGATAGCGA AGTCGATTTC
TCCGATTACT TGCCGAAGAC ATCCAAGGAC ATCGATCAGC TCTGGGCGAA GCTCGGCGAA
TTTGTCGAGA CCATGCAGCA ACCGCAGCTC AAGCAGTTGC TAAAGCTGTT CATGGCAGAC
GAGGAAATTT CGAAAGCCTA TCGCGACGCA CCCGCCGCCA AGACGCTGCA TCATGCCTAC
ATCGGTGGCT TGCTCGATCA CGTCGTGTCA TTGTTCTCGA GCTGCGATCT CATCGTGCGC
AATTATCCGA TGGTGAATCG CGACCTGCTC TTCACCGGCG TCTTCCTGCA CGACATCGGC
AAGATCCACG AGCTCACCTA CGCGCGTTCG TTCGGCTACT CTACCGCCGG CCAGCTGCTC
GGGCACATGA TCATTGAGCT CGAAATGTTG CAAAAGAAGC TCGCGCTCAT TCCAGATTTT
CCGCCCGAAC TGAAGATCCT GATCGAGCAC CTCATCATCA GCCATCACGG CGAATACGAG
TTCGGATCGC CGAAGCTGCC GATGTTCCCC GAGGCCCTCA TGCTCCACTA CATGGACGAC
CTCGATTCCA AAATGGAAGC CATGCGTGCG CAGTTCGAGC GTGAATCCGA GAACGAAAGC
CCGTGGACGA GCTATAACAG TTCGCTCGCG CGCCCATTGC TTAACTCACG AAAGTTCCTC
GAGAAGCCCA ATCCGGTTGA AGAACCCGAG CCTGCGCCCG AAGAAAAGCC AGAGTTCGAG
ACAGAAACAA TGGCCGCTAC GGCCGCTGCG ACTGAACCGG ATAGCGTCTC TCAGAAGAGC
CTGCTCGATC TCCAGTCACA CTTTGCCGCA AAGAAGAATT CCATCTAG
 
Protein sequence
MKEFYICNCV QQENKVITSQ FVVVSKQIKA KKNGEPYLAL TLGDRSGSVD AKMWDNVEEF 
LNAFEQDDFV KVKGLINKYN NRWQLTIHKI RKMGDSEVDF SDYLPKTSKD IDQLWAKLGE
FVETMQQPQL KQLLKLFMAD EEISKAYRDA PAAKTLHHAY IGGLLDHVVS LFSSCDLIVR
NYPMVNRDLL FTGVFLHDIG KIHELTYARS FGYSTAGQLL GHMIIELEML QKKLALIPDF
PPELKILIEH LIISHHGEYE FGSPKLPMFP EALMLHYMDD LDSKMEAMRA QFERESENES
PWTSYNSSLA RPLLNSRKFL EKPNPVEEPE PAPEEKPEFE TETMAATAAA TEPDSVSQKS
LLDLQSHFAA KKNSI