Gene Acid345_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1178 
Symbol 
ID4072919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1461484 
End bp1463103 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content57% 
IMG OID637983188 
Productmetal dependent phosphohydrolase 
Protein accessionYP_590255 
Protein GI94968207 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTTA CGACCGCAGT AGTGAAGAGC AAGGTAGCGC GTCGCATCCT GTTGTTGTTC 
ATCGGTTGCG CCCTGTTGCC GGTCACGGTG CTGGCTGTGT TGTCGTTTTA CCAAGTCTCC
TCACAGCTTC GCGAACAGAG CCTGAAACAA TTGCAGCACT CGAGTAAAAG CGAAGGAATG
GCCATCTTTC ACAACCTCGA GATGGCAGAT ACAGATCTAC AACTTCTCGT GTCCGAGGCG
GAACCTTCGT CGAAGCTATC GGCGAGCGAT CTCTTGGCGA AACACTTCTT GCGCGTGGGG
GCATTCGGTC CCTCTCGAGA GGCGCAGGCG GTCCTCGGCA ATTCCATTTC GATGCCGGAC
CTGACGGAGC AGGAACTACA CCACCTCGGA ACCGGAAAGG CCTTGCTGCG AAGCATCCCT
TGCCGAGTTA ACGCCGGCAT GGAGTGTCTG CTGATGCTGC GCAAGAGCAA GAATACGAAT
CTCGTGGTGG CCGGTGAACT CGACCTCGTA TTCGTGTTCG AGGCGCGGAC GATTCCCCCG
GGGCTCGACA TGTGTGTATT TTCGGCAGAC CGGGCACCGA TCTTCTGCCC CGCCGGGGAT
GCGGCGCTCC AGTCGAATGC GGCGGGGCGG TTCGACCACA GTTCCGACCT TTTTTCCTGG
CGGAGTGGAT CGGAGACCTA TGACGCGGCA TATTGGAAGC TGCTGATTAC GCCGAGCTTC
TTCCAGCAGT CATGGACGAT TGTAGTGAGC GCCAACCGTC AAGACGTTCT CGCCCCCATG
GCACGCTTTC GCCAGTCGTT CCCATTGCTC ATCCTGTTAG TGTTCTGGGT CGTTTTGCTG
CTGAGCCTGG TCCAGATCCG CCGCACGCTC GGTCCGCTCG AGAGGCTCGA AGAAGCCACC
AAACAGATTG CGGGACGAGA CTTCAAAATC TCGCTTGATG TGCACAGCGG CGACGAGTTC
GAGAACCTCG CAGACTCCTT CAATGAGATG GCGCGGCAAC TGGAGATGCA GTTCAAAGCG
ATGGAGGAAC TCCATTGGGG GACTCTCACA GCGCTTGCCC GAGCCATTGA CGCTAAATCC
GATTGGACGG CCGGTCATTC CGAACGCGTA ACGGTGCTTG CAACTTCGAT TGGACGGCGG
ATGGGACTCT CGCCCGAAGA ACTCAACATC ATGCACATGG GTGGCTTGTT GCATGACATC
GGGAAAATTG GAACGCCGCC CGATGTCTTG GACAAACCTG GCAAACTCAC CCCCGACGAG
ATGCTCACCA TGATGAGCCA CGTGCAAATC GGCGTGAGAA TCCTGGAGCC CATCGCAGGT
TTCCGCAAAG CGCTCTCGAT CGTGTCACAG CATCACGAGT GGTACGACGG TACCGGCTAC
CCCAACAAGC TGAAGGGTGA GGAGATCAGT CTTCATGCCC GCATTTTCGC CGTGGCCGAC
TGCTTCGATG CCCTGACTTC CGATCGCCCT TACCGTAAGG GTCTTCCCAC CGTGCAGACG
CTGGCGATCA TGAAGAGCCA ATCAGGAACC CACTTCGATC CCGCCGTCCT AATCGTCTTT
TTGGAGATGA TGGCCGAAAA AGAGGAGACT CAGCTTGTCG CAAGTACCAT CACAAACTGA
 
Protein sequence
MQFTTAVVKS KVARRILLLF IGCALLPVTV LAVLSFYQVS SQLREQSLKQ LQHSSKSEGM 
AIFHNLEMAD TDLQLLVSEA EPSSKLSASD LLAKHFLRVG AFGPSREAQA VLGNSISMPD
LTEQELHHLG TGKALLRSIP CRVNAGMECL LMLRKSKNTN LVVAGELDLV FVFEARTIPP
GLDMCVFSAD RAPIFCPAGD AALQSNAAGR FDHSSDLFSW RSGSETYDAA YWKLLITPSF
FQQSWTIVVS ANRQDVLAPM ARFRQSFPLL ILLVFWVVLL LSLVQIRRTL GPLERLEEAT
KQIAGRDFKI SLDVHSGDEF ENLADSFNEM ARQLEMQFKA MEELHWGTLT ALARAIDAKS
DWTAGHSERV TVLATSIGRR MGLSPEELNI MHMGGLLHDI GKIGTPPDVL DKPGKLTPDE
MLTMMSHVQI GVRILEPIAG FRKALSIVSQ HHEWYDGTGY PNKLKGEEIS LHARIFAVAD
CFDALTSDRP YRKGLPTVQT LAIMKSQSGT HFDPAVLIVF LEMMAEKEET QLVASTITN