Gene Apar_0296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0296 
Symbol 
ID8413144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp342146 
End bp343813 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content46% 
IMG OID645021863 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_003179318 
Protein GI257784101 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.079543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAG CAATTCGTAA AGTAAATATC ATTGGCCATT TGAACCCAGA TACAGACAGC 
ATTTGTGCTG CAATTTCTTA TGCGTATCTC AAAAATCAGA TTGACAATCC TATCTATGAA
GCTCGTCGTG CTGGTTCTCT TAATCGTGAG ACCGCTTTTG TCTTGAATCA CTTTGGCTTT
GAAGAGCCAC AACTTATTAC TACCGTTACT CCTCAGATTA AAGATGCAGA GATTCAGACT
CAGCCAAAAG TTGATGCTGA GATGAGTCTC TATTCCGCTT GGCAGCTTAT GCAAAACGTC
AAGCTTGATA CGCTCTGCGT TACTGATGAA GAAGAAGAAC TTACTGGTCT GATTGCAGTT
AAAGACATTG CAAATGCAAA CATGAGCCTT TCCGAGCCAA ACTTACTTTC CAAGGCAAAG
ACAAGCTATG CAAACATTGT TTCAACGCTT GGTGGAACCA TGGTTCTTGG TGATCCCCAG
GGCGTTGTTA AGCAGGGTAA CATTCGCGTT GGTACTAGTG CTTCAGCCCT TTCAGAGATT
GTTGACGCTG GCGATATCAT TGTTGTTGCT GGCAATCATG AAAATCAAAC CATCGCAGTT
GAGCACGGTG CTTCCTGCCT TATTGTCTCT TGCGATGCTC CAATTGCTCA AGACGTCATT
GACCTAGCTA AAAAGCGTGA TTGCGCTATT ATCAGCACTC CTCGTGATAC CTTTGAGGTA
GCTCGTCTAC TCATCATGTC TATGCCTGTA CGCGAGAAGA TGCTCACCGA TGACATTCTC
AAGTTCAGCG TTAACACGGC AATTGATGAT GCACGCAAAG CTATGACCAA CTCGAGACAC
CGCTTCTTTC CTGTTATCAA TGAAAACGGT ACCTTTGCTG GCCTAATCAG CGGTCCTGGT
CTTTTAAATC CTCGCAAGAA GCATGTCATT CTTGTTGACC ACAACGAGCG CACTCAAGCT
GTCGATGGTC TTGAGCAGGC CGAAATTATG GAGATTGTTG ATCACCACCG CATTGGCTCC
ATTGAGACTT CTAATCCAAT TACCTTTAGA AATGTTCCTG TTGGCTGCAC CTGTACCATC
ATCTATGGTC TCTACCATGA ATACGGTATT GATATTCCTA AAAACATTGC TGGTCTTATG
CTCTCTGCAA TTCTTTCTGA CACCCTTGCA TTCCGCTCCC CTACATGCAC CGAGCGCGAT
ATTGTTGCAG GTAAGAAACT AGCGGAAATC TGTGGAGAAG ACATTGACTC CTACTCTGAG
CAGATGTTTG ATGCTGGTGC AGACCTTACT GGCCGTACCG CAGAGGAAGT CTTCCATGGT
GACTACAAGG TATTCAGCCG TGGCGGCGTA AAGTTTGGTG TTGGTCAAGG TTCCTTCATG
ACCGAGACTA GCCGTAAGGC TGCAGAGGAA CTTGTTGGAC CATTTTTGGA AACCGCCGCT
AAATCAGAAG AACTTCCAAT GGTCTTCTAT ATGTTCACCG ACGTTAAGAG CCAGGTCACC
GAGATGCTCT TCTATGGTGC TAATGCTGCT AATGTCATCG AGAGAGCTTT TAACGTAAAG
GTTGACGGCA ACATTGCTGT TCTTCCTGGT GTTGTTAGCC GTAAGAAGCA GGTTGTTCCA
TCACTGATGG CAACACTTCA AACGCTTGCC GAGGAAGCCG CCAACTAA
 
Protein sequence
MAEAIRKVNI IGHLNPDTDS ICAAISYAYL KNQIDNPIYE ARRAGSLNRE TAFVLNHFGF 
EEPQLITTVT PQIKDAEIQT QPKVDAEMSL YSAWQLMQNV KLDTLCVTDE EEELTGLIAV
KDIANANMSL SEPNLLSKAK TSYANIVSTL GGTMVLGDPQ GVVKQGNIRV GTSASALSEI
VDAGDIIVVA GNHENQTIAV EHGASCLIVS CDAPIAQDVI DLAKKRDCAI ISTPRDTFEV
ARLLIMSMPV REKMLTDDIL KFSVNTAIDD ARKAMTNSRH RFFPVINENG TFAGLISGPG
LLNPRKKHVI LVDHNERTQA VDGLEQAEIM EIVDHHRIGS IETSNPITFR NVPVGCTCTI
IYGLYHEYGI DIPKNIAGLM LSAILSDTLA FRSPTCTERD IVAGKKLAEI CGEDIDSYSE
QMFDAGADLT GRTAEEVFHG DYKVFSRGGV KFGVGQGSFM TETSRKAAEE LVGPFLETAA
KSEELPMVFY MFTDVKSQVT EMLFYGANAA NVIERAFNVK VDGNIAVLPG VVSRKKQVVP
SLMATLQTLA EEAAN