Gene Acid345_2866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2866 
Symbol 
ID4070385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3409383 
End bp3410681 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content60% 
IMG OID637984884 
Productmetal dependent phosphohydrolase 
Protein accessionYP_591941 
Protein GI94969893 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.128247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATTTGG CGGAGATCAA GAACAAATTT CGTAAGCTGT TCACGACCTT CGAGGCGCTC 
AGCGACCTCG GGCCGGCGCT CACGGCAGAA CGCGATTTCG CGCAAACTGC CGACGAACTT
TTGCGCCTGC TCATGGACGC CATCGGCGCT CGCGAAAGCG CGCTCTTCCG CTTTTGCGAT
AAGCCCGCTG TGCTCACCTC GGTCGCATCT CGCGGCTTCC TTTCCTTCCC AAGTCCGGCA
GTCATTCCGT TGCTTCCCTC ACACGTACAC GCGCTGACCA CGGCGCGCGG GCCACAACTT
CTTGCAGCGG AATCTCGACG TACGCTACTC AGTTCGAACG GCAACTTCCC GCCAGATTTG
ATTCAACTGG CTGCGCCTTT GCGCGTCGGG CAGAAGCTCG TCGGAATGCT GGCATTCGGC
CAACCTGATG AATCGCATTA CGGCGAAGAA GAAGTGCATG GCCTGGCGAT GTTCGCGCAC
TATGTCGCGC TCGCCGTCCA GAACAGCGGA CTGACCGAGT CACTCTCCAA TCGTGTTGCG
GAGAACCTGA AACTCATGGC TTCGGTGCAT TCGTTCTACG ACAACGCGCT CGAGGCCTTC
GCCGTCGCGA TTGACGCCAA GCACATCAAT ATCCGCGGAC ACTCGATTCG TGTCGGCCGC
TATGCTGCCT TCATCGGCGA AGCCATGGGC ATGGGAGCTT CGGAAGCTTC CGCCCTGCGC
GCTTCCGGCT ATCTGCACGA TATCGGCAAG GTCGCGGTGG ACCGTCGTCT GTTCGGCAAG
CCCAGCGCGT TGAACGAAGC TGAGTTCAAA GAAATGGCCG ATCACACCAT CGTCGGCAGC
GAGATCGTCT CCGGCGTGCA GTTCCCATGG CCGCAGGTGG GCGAAGTCGT TCGATCGCAT
CACGAGCGTC TTGACGGATC CGGCTATCCC GACCATCTGC GCGGCGACGA ACTTGCAAAA
CATGTGCGCA TCATGGGCTT GGCCGATGCA TTCGATGCCA TGACCAGCGA GCGTCCGTAC
CGGCAGCCGC TCTCCATCGG CGAAGCGTTG ACCGAAGTGG TGAAAATGTC GCCCACGCAC
TTTGATCCCG AGACGGTGCA GGCGCTGCTG GTGCAGGTCC GGCGCGATGC AGTTGCTTCG
TGCAGTCCAA AGCTCAGCGC CGCCTGGATA AAATCGCAGC CGGACAAGCC CAAGTTCCTC
GACGACCGCG TGATGTGCGC CATTGCCCCA CCGGACGTGG ACCAGTTGGC CGCCGTCCTG
CACCACAAGA CAACGCGCAA CCGGGTTTAC TCAAACTAG
 
Protein sequence
MHLAEIKNKF RKLFTTFEAL SDLGPALTAE RDFAQTADEL LRLLMDAIGA RESALFRFCD 
KPAVLTSVAS RGFLSFPSPA VIPLLPSHVH ALTTARGPQL LAAESRRTLL SSNGNFPPDL
IQLAAPLRVG QKLVGMLAFG QPDESHYGEE EVHGLAMFAH YVALAVQNSG LTESLSNRVA
ENLKLMASVH SFYDNALEAF AVAIDAKHIN IRGHSIRVGR YAAFIGEAMG MGASEASALR
ASGYLHDIGK VAVDRRLFGK PSALNEAEFK EMADHTIVGS EIVSGVQFPW PQVGEVVRSH
HERLDGSGYP DHLRGDELAK HVRIMGLADA FDAMTSERPY RQPLSIGEAL TEVVKMSPTH
FDPETVQALL VQVRRDAVAS CSPKLSAAWI KSQPDKPKFL DDRVMCAIAP PDVDQLAAVL
HHKTTRNRVY SN