Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0296 |
Symbol | |
ID | 8413144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 342146 |
End bp | 343813 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645021863 |
Product | putative manganese-dependent inorganic pyrophosphatase |
Protein accession | YP_003179318 |
Protein GI | 257784101 |
COG category | [C] Energy production and conversion |
COG ID | [COG1227] Inorganic pyrophosphatase/exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.079543 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAAG CAATTCGTAA AGTAAATATC ATTGGCCATT TGAACCCAGA TACAGACAGC ATTTGTGCTG CAATTTCTTA TGCGTATCTC AAAAATCAGA TTGACAATCC TATCTATGAA GCTCGTCGTG CTGGTTCTCT TAATCGTGAG ACCGCTTTTG TCTTGAATCA CTTTGGCTTT GAAGAGCCAC AACTTATTAC TACCGTTACT CCTCAGATTA AAGATGCAGA GATTCAGACT CAGCCAAAAG TTGATGCTGA GATGAGTCTC TATTCCGCTT GGCAGCTTAT GCAAAACGTC AAGCTTGATA CGCTCTGCGT TACTGATGAA GAAGAAGAAC TTACTGGTCT GATTGCAGTT AAAGACATTG CAAATGCAAA CATGAGCCTT TCCGAGCCAA ACTTACTTTC CAAGGCAAAG ACAAGCTATG CAAACATTGT TTCAACGCTT GGTGGAACCA TGGTTCTTGG TGATCCCCAG GGCGTTGTTA AGCAGGGTAA CATTCGCGTT GGTACTAGTG CTTCAGCCCT TTCAGAGATT GTTGACGCTG GCGATATCAT TGTTGTTGCT GGCAATCATG AAAATCAAAC CATCGCAGTT GAGCACGGTG CTTCCTGCCT TATTGTCTCT TGCGATGCTC CAATTGCTCA AGACGTCATT GACCTAGCTA AAAAGCGTGA TTGCGCTATT ATCAGCACTC CTCGTGATAC CTTTGAGGTA GCTCGTCTAC TCATCATGTC TATGCCTGTA CGCGAGAAGA TGCTCACCGA TGACATTCTC AAGTTCAGCG TTAACACGGC AATTGATGAT GCACGCAAAG CTATGACCAA CTCGAGACAC CGCTTCTTTC CTGTTATCAA TGAAAACGGT ACCTTTGCTG GCCTAATCAG CGGTCCTGGT CTTTTAAATC CTCGCAAGAA GCATGTCATT CTTGTTGACC ACAACGAGCG CACTCAAGCT GTCGATGGTC TTGAGCAGGC CGAAATTATG GAGATTGTTG ATCACCACCG CATTGGCTCC ATTGAGACTT CTAATCCAAT TACCTTTAGA AATGTTCCTG TTGGCTGCAC CTGTACCATC ATCTATGGTC TCTACCATGA ATACGGTATT GATATTCCTA AAAACATTGC TGGTCTTATG CTCTCTGCAA TTCTTTCTGA CACCCTTGCA TTCCGCTCCC CTACATGCAC CGAGCGCGAT ATTGTTGCAG GTAAGAAACT AGCGGAAATC TGTGGAGAAG ACATTGACTC CTACTCTGAG CAGATGTTTG ATGCTGGTGC AGACCTTACT GGCCGTACCG CAGAGGAAGT CTTCCATGGT GACTACAAGG TATTCAGCCG TGGCGGCGTA AAGTTTGGTG TTGGTCAAGG TTCCTTCATG ACCGAGACTA GCCGTAAGGC TGCAGAGGAA CTTGTTGGAC CATTTTTGGA AACCGCCGCT AAATCAGAAG AACTTCCAAT GGTCTTCTAT ATGTTCACCG ACGTTAAGAG CCAGGTCACC GAGATGCTCT TCTATGGTGC TAATGCTGCT AATGTCATCG AGAGAGCTTT TAACGTAAAG GTTGACGGCA ACATTGCTGT TCTTCCTGGT GTTGTTAGCC GTAAGAAGCA GGTTGTTCCA TCACTGATGG CAACACTTCA AACGCTTGCC GAGGAAGCCG CCAACTAA
|
Protein sequence | MAEAIRKVNI IGHLNPDTDS ICAAISYAYL KNQIDNPIYE ARRAGSLNRE TAFVLNHFGF EEPQLITTVT PQIKDAEIQT QPKVDAEMSL YSAWQLMQNV KLDTLCVTDE EEELTGLIAV KDIANANMSL SEPNLLSKAK TSYANIVSTL GGTMVLGDPQ GVVKQGNIRV GTSASALSEI VDAGDIIVVA GNHENQTIAV EHGASCLIVS CDAPIAQDVI DLAKKRDCAI ISTPRDTFEV ARLLIMSMPV REKMLTDDIL KFSVNTAIDD ARKAMTNSRH RFFPVINENG TFAGLISGPG LLNPRKKHVI LVDHNERTQA VDGLEQAEIM EIVDHHRIGS IETSNPITFR NVPVGCTCTI IYGLYHEYGI DIPKNIAGLM LSAILSDTLA FRSPTCTERD IVAGKKLAEI CGEDIDSYSE QMFDAGADLT GRTAEEVFHG DYKVFSRGGV KFGVGQGSFM TETSRKAAEE LVGPFLETAA KSEELPMVFY MFTDVKSQVT EMLFYGANAA NVIERAFNVK VDGNIAVLPG VVSRKKQVVP SLMATLQTLA EEAAN
|
| |