Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1033 |
Symbol | |
ID | 4069857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1296934 |
End bp | 1298451 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637983040 |
Product | Ppx/GppA phosphatase |
Protein accession | YP_590110 |
Protein GI | 94968062 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | [TIGR03706] exopolyphosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.35316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.768311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCT TCGCCGCTGT CGATATTGGA TCGAACTCCG TCCGGCTCAA GATAGCTGCT CTCAACCGTC GGCGACTGGA AACCCTTTTT GAAGATCGGG AAGTGACGCG CCTGGGCGAG TCTGTGTTCC GCGCCGGATT GCTGGATCCG CGCGCGATGG AACAGACCGT AAAGGTGCTG CGGCGCTTTC ATCGCGCGGT GCAGCAGCAC GGGGCAGATC GCGTCCGAGT GGTGGCAACG AGCGCGCTAC GCGATGCGCG CAACGGCAAC GCGTTCCTGC AGTGGGTGCG GGCATCGACT GGCTGGCAGT GTGAGGTGAT CTCGGGGCTG GAAGAAGGCC GCCTGATTCA CCTTGGCGTG ATGGCCGGAA GTCGGATTAA GAGCTCGCCG ATGCTGCTGA TTGACCTTGG CGGCGGCAGT TGCGAATTGA CCATCTCGGT GAAGGAGCAG ATCGAGAAAA TCGTCAGCTT GCCCCTGGGT GCGGTGCGGC TCACAAAGCA GTTTCTCGAG CACGATCCGC CGAAGAAGAA AGAGCTAAAG GAACTGCGGG CGTTCATCGC AGAGGAGATC GGGCGCGTCG CGAAGCAAAT GTTGCAGGCG AAGGTAAAAA TGACGGTAGC AACATCGGGC ACGCCCGCGG CGCTCTCCGA CATGTGGGCA GCGCGCGAAC GCAAGCATAC GACCACCGTT CCGCGGGCTG GACTGCTGGA GTTGACGCAC GAGTTGAGTC GCATGACGCT GGCGCAGCGC CGGACGGTCC AGGGAGTTGG CACTCGACGA GCGGAGATCA TCATTGCCGG CGCGGTTGTC TTCTCGGAGT TGCTGACGCA CTTGAAGCTG GGAAGCTTTC GATATCTGCC CCTGGGATTG CGCGATGGGA TGCTGGCGCA GATGGCGGCG GAGCACGACC AGCGCGCCGA TCTCCGGACG CGGTTGGTGG CAGAGCGAGA GAAGTCGGTG TATGACCTTG GCACGCATTT TGGAGTTGAT CATCGGCATG CCGAACGCGT GCGCGATCAT GCGGTGCGGT TGTTCCAGGC GTTGAAACCG GTGCATGGAT TGCCCTCGCA GTACGAGCAG TGGGTGGCGG CAGCGTCCAT GCTGGCCGAG GTGGGATCGT TCATCAATCG CTCAGGACGA CATCGGCATA CTTACTACGT GATCTCGAAT TCGGAAATTT TCGGTTACAC GGTGCAGCAG CGCAGGGTCA TCGCGGCGAT CGCGCGGTTC GTGGGCGGTT CGAAGCCGAC ACTGCAGAGC CGGCAACTCC GGGTGCTGTC GCCACAAGAC CGGCCTTTGA TTCCGCGGGC GGTGCTGCTG TTGAGAATGG CCCGCGCGCT GGAACAGGGA CGTCGTGGGG CAGTGAAGGG AATCAAGGCG CGAGTGGAAG CGGATCGCGT GCTGCTGGCA GTGGATGAGC GGTCCACGGG CGCGGAACTG GAGATCTGGG CGCTGCGCAA AGAGCGCGCT TACTTCCGCG AAGTTTTTGG CAGGGATTTG CTGTGCGCGG AACCGTAG
|
Protein sequence | MPTFAAVDIG SNSVRLKIAA LNRRRLETLF EDREVTRLGE SVFRAGLLDP RAMEQTVKVL RRFHRAVQQH GADRVRVVAT SALRDARNGN AFLQWVRAST GWQCEVISGL EEGRLIHLGV MAGSRIKSSP MLLIDLGGGS CELTISVKEQ IEKIVSLPLG AVRLTKQFLE HDPPKKKELK ELRAFIAEEI GRVAKQMLQA KVKMTVATSG TPAALSDMWA ARERKHTTTV PRAGLLELTH ELSRMTLAQR RTVQGVGTRR AEIIIAGAVV FSELLTHLKL GSFRYLPLGL RDGMLAQMAA EHDQRADLRT RLVAEREKSV YDLGTHFGVD HRHAERVRDH AVRLFQALKP VHGLPSQYEQ WVAAASMLAE VGSFINRSGR HRHTYYVISN SEIFGYTVQQ RRVIAAIARF VGGSKPTLQS RQLRVLSPQD RPLIPRAVLL LRMARALEQG RRGAVKGIKA RVEADRVLLA VDERSTGAEL EIWALRKERA YFREVFGRDL LCAEP
|
| |