Gene Acid345_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1033 
Symbol 
ID4069857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1296934 
End bp1298451 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID637983040 
ProductPpx/GppA phosphatase 
Protein accessionYP_590110 
Protein GI94968062 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID[TIGR03706] exopolyphosphatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.35316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.768311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCT TCGCCGCTGT CGATATTGGA TCGAACTCCG TCCGGCTCAA GATAGCTGCT 
CTCAACCGTC GGCGACTGGA AACCCTTTTT GAAGATCGGG AAGTGACGCG CCTGGGCGAG
TCTGTGTTCC GCGCCGGATT GCTGGATCCG CGCGCGATGG AACAGACCGT AAAGGTGCTG
CGGCGCTTTC ATCGCGCGGT GCAGCAGCAC GGGGCAGATC GCGTCCGAGT GGTGGCAACG
AGCGCGCTAC GCGATGCGCG CAACGGCAAC GCGTTCCTGC AGTGGGTGCG GGCATCGACT
GGCTGGCAGT GTGAGGTGAT CTCGGGGCTG GAAGAAGGCC GCCTGATTCA CCTTGGCGTG
ATGGCCGGAA GTCGGATTAA GAGCTCGCCG ATGCTGCTGA TTGACCTTGG CGGCGGCAGT
TGCGAATTGA CCATCTCGGT GAAGGAGCAG ATCGAGAAAA TCGTCAGCTT GCCCCTGGGT
GCGGTGCGGC TCACAAAGCA GTTTCTCGAG CACGATCCGC CGAAGAAGAA AGAGCTAAAG
GAACTGCGGG CGTTCATCGC AGAGGAGATC GGGCGCGTCG CGAAGCAAAT GTTGCAGGCG
AAGGTAAAAA TGACGGTAGC AACATCGGGC ACGCCCGCGG CGCTCTCCGA CATGTGGGCA
GCGCGCGAAC GCAAGCATAC GACCACCGTT CCGCGGGCTG GACTGCTGGA GTTGACGCAC
GAGTTGAGTC GCATGACGCT GGCGCAGCGC CGGACGGTCC AGGGAGTTGG CACTCGACGA
GCGGAGATCA TCATTGCCGG CGCGGTTGTC TTCTCGGAGT TGCTGACGCA CTTGAAGCTG
GGAAGCTTTC GATATCTGCC CCTGGGATTG CGCGATGGGA TGCTGGCGCA GATGGCGGCG
GAGCACGACC AGCGCGCCGA TCTCCGGACG CGGTTGGTGG CAGAGCGAGA GAAGTCGGTG
TATGACCTTG GCACGCATTT TGGAGTTGAT CATCGGCATG CCGAACGCGT GCGCGATCAT
GCGGTGCGGT TGTTCCAGGC GTTGAAACCG GTGCATGGAT TGCCCTCGCA GTACGAGCAG
TGGGTGGCGG CAGCGTCCAT GCTGGCCGAG GTGGGATCGT TCATCAATCG CTCAGGACGA
CATCGGCATA CTTACTACGT GATCTCGAAT TCGGAAATTT TCGGTTACAC GGTGCAGCAG
CGCAGGGTCA TCGCGGCGAT CGCGCGGTTC GTGGGCGGTT CGAAGCCGAC ACTGCAGAGC
CGGCAACTCC GGGTGCTGTC GCCACAAGAC CGGCCTTTGA TTCCGCGGGC GGTGCTGCTG
TTGAGAATGG CCCGCGCGCT GGAACAGGGA CGTCGTGGGG CAGTGAAGGG AATCAAGGCG
CGAGTGGAAG CGGATCGCGT GCTGCTGGCA GTGGATGAGC GGTCCACGGG CGCGGAACTG
GAGATCTGGG CGCTGCGCAA AGAGCGCGCT TACTTCCGCG AAGTTTTTGG CAGGGATTTG
CTGTGCGCGG AACCGTAG
 
Protein sequence
MPTFAAVDIG SNSVRLKIAA LNRRRLETLF EDREVTRLGE SVFRAGLLDP RAMEQTVKVL 
RRFHRAVQQH GADRVRVVAT SALRDARNGN AFLQWVRAST GWQCEVISGL EEGRLIHLGV
MAGSRIKSSP MLLIDLGGGS CELTISVKEQ IEKIVSLPLG AVRLTKQFLE HDPPKKKELK
ELRAFIAEEI GRVAKQMLQA KVKMTVATSG TPAALSDMWA ARERKHTTTV PRAGLLELTH
ELSRMTLAQR RTVQGVGTRR AEIIIAGAVV FSELLTHLKL GSFRYLPLGL RDGMLAQMAA
EHDQRADLRT RLVAEREKSV YDLGTHFGVD HRHAERVRDH AVRLFQALKP VHGLPSQYEQ
WVAAASMLAE VGSFINRSGR HRHTYYVISN SEIFGYTVQQ RRVIAAIARF VGGSKPTLQS
RQLRVLSPQD RPLIPRAVLL LRMARALEQG RRGAVKGIKA RVEADRVLLA VDERSTGAEL
EIWALRKERA YFREVFGRDL LCAEP