Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1359 |
Symbol | |
ID | 8414250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1533427 |
End bp | 1536363 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645022962 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_003180374 |
Protein GI | 257785157 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.838712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGCTTG CAGAGAGGGC TGAGCATGAG TCGTTTGATG TCTTAGAGGA TGATATTGTT GTTTTGGATA CTGAGACAAC AGGTTTGTCC TTTAAGAAAT GCTCCCTTAT TGAGATTTCT GCTGCCAAAT TGAGCGGAAG AGAGATCATA GAGCGCTTTC AGACTTTTGT TGATCCTGGA TGCCCAATTC CGGAAGAGAT TACAACCCTG ACTTCAATCA CCGACGAAGA CGTTAAGGGC GCGCCAAGTG CAAAAGAAGC CGTTGCTGCA CTTGCAGAGT TTGTTGGCGG ACTTCCTGTT TTGGCTCACA ACGCCACTTT TGACCGTACG TTTATTGAAC GAGTGCCCGG CGGAACTTCG GTCTCCGATA CTTGGATTGA TACGCTGTCA CTCTCACGTA TAGCGCTACC GAGACTCTCT TCGCACAAGC TGTCGAGCAT GGCTGAAGCG TTTGGCACCA TGAAGGTCAC GCACCGCGCA AGTGACGACG TAGACGCCCT TTGCGGCATG TGGCGTATCT TGCTTTTGGG ACTTATGAAT CTGCCCCGCG GTCTTCTCGC CAAACTGGCA TCAATGCACG ATAACGTGGA GTGGAAATTC AGGCCAATCT TTTCGTATTT GTCGCAGATA AAAGAAAAAG AAGTTGTTCA GCGTGGCATC GCAAGGAAAG ATGCAACAGG TGCAGAACTT GCAGATGCGG AGATTTCGGG CACGTTTTTC TCGCTTAAGG ACATTCGTAG TCAGCTTGTT GCAGATGCAA AAACAAAGGC GAGAAGGGAC GCGGACGACC CAGAAACGCC TGCTATGCTG CCGATTTCTA AAGACGAGAT TCACAGGGCG TTTGCTAAAC CGGGCGTTGT CTCACAGATG TATGACAAGT TTGAGACGCG CAGCGAGCAG GTAAGCATGT CTGTCGAAGT CAGAAACGCT CTGGTAACTT CATCGCATAG AGAACTTGAA GCAGGAACCG GTATTGGCAA GTCGATTGCA TACCTGTTGC CTGAGGCGTT ATTTGCACAG AAAAATGATG TTACTGTAGG TATTGCTACA AAAACGAATG CGCTGACCGA TCAGCTTGTT ACTCATGATC TCCCTGCACT TGCAAGAGCG CTGCCAAATG GGCTGAGTTT TTGTAGCTTA AAAGGATACG AGCACTATCC ATGCTTGCAC CGTGTTGATA GAGCAGCTCT TGAGGAGCTG CCGTTGACGT TGATTGATCA GGAAGGCCGT TCTAGCAATA GCGTTGCGTC AGATATGCTG ACCGCAATTG CAGTGATTTA CGCGTATGCG TGTCAGTCGG CCGATGGTGA CTTGGATGCA TTGGGTATTC GTTGGCGCTC GGTTCCGCGA GAAATGGTGA CTATTAAAGC AGCAGAGTGC CTACGCTCAA AGTGCCCATA TTATCCTCAT GAGTGTTTTG TCCATGGAGC TCGAAAGCGC GCAGGATCTT CAGATGTTGT GGTGACTAAT CACTCTCTGC TACTAAGAAA TGTTGCTGCT GACGGCAAAA TTCTTCCTCC TATAAGGCAT TGGGTCATTG ACGAGGCTCA TGGATTTGAA GCCGAGGCTC GCCATCAATG GGCAATAGAA ATTTCTGCTA AGGAAATGAG AAACGGCTTT GAGCTGCTTG GAGGAATTAA GTCCGGGGCT ATTCACGCTG CTATGGTGGG CGCCGCCAAT CTTGAAGATT CTACGCTGCT TACTGGTCTT CTCACACGTT CTGCTGCTGC AGTTCAACGG GCTATGGCAG CAATGGGCAA CCTCATGGTT GCCGTGCATG AACTGGCTCC GTTGGCTAAA AGTGATGGTG GCTATAATTC CCTTCAGCTT TGGATTAACG ATGAAGTGAG AGAAACAAAG GAATGGAAAG AGTTTTTGGA GACCGCTTCT GTTGCTCTTT CTGCTCTTGA AGAAGCAGCA CTTAGAATAG GAAAGACCAC CGAGGCTCTT ACTGCATCAG CTCCAAACCT TGCGAGCAAT TTGAGCGAGT CGGGAATGTT TCTTAGCACC CTTCTGGAGT CATTGAAGCT TATTTGTGAT GGAACGGACA AGAGTTACGT CTATTCGGCA AAGTTGACTC GACTAAAGCG TGATATTGGC TCTGAAGCCC TTGTGGCCGA GAAGCTTGAT ATTGGAGCAG AGTTGGCGGA AAAGTGGCTT CCTGAGACGC ATTCCGTTGT ATTTACTTCG GCAACTATTG CTGTTGGAGA CGATTTTTCT CATTTTGAGC ATGCTGTTGG TCTGGATAGG GGTTCCTTTG AGCACAAGAG TTTGCATTTA GACTCTAGCT TTGACTACGA GAATCACATG GGCGTCTTTG TGGCCGAAGA TATGCCTACA CCAACTGATC CAGGGTATTT GGATGCTCTG GAAAAGTTGC TGTTTGACGT CCATGTTCAG ATGGGCGGTT CGGTGTTGAC GCTCTTTACT AACAGGCGCG ATATGGAGCG TCTTTACGAA GCGCTGGAAC CACGTTTGAG CGAGTATGGC CTCACTCTTG CTTGCCAGGA GCGATCTTCT TCCGCACGGC GCATTCGTGA AAAGTTCCTA GCCGAAAAGA ACCTCTCGCT GTTTGCTCTT AAATCATTTT GGGAGGGATT TGATGCTGCG GGGGACACGC TTAGGTGCGT GGTGATCCCT AAACTTCCGT TCGCAAGCCC TAATGAGCCA CTGGTCAAAG AGCGTGAGGT GCGAGAAGAC CGTGCGTGGT GGCGTTATTC TCTGCCAGAG GCAGTAATTG CAACAAAACA GGCAGCTGGT CGTCTCATCA GGAGTGCTGA GGACAAGGGC GTTTTAGTAC TTGCTGATTC AAGACTGGTA TCTAAGCGAT ATGGCAGTTC GTTTTTGAAA TCGTTACCTA ACAAGAACTA TCAATGTGTC TCAACAAAGA ACATCTCTGG ACAGATTGCT AAGTGGCGAG AAGAACACGA CGCGTAG
|
Protein sequence | MELAERAEHE SFDVLEDDIV VLDTETTGLS FKKCSLIEIS AAKLSGREII ERFQTFVDPG CPIPEEITTL TSITDEDVKG APSAKEAVAA LAEFVGGLPV LAHNATFDRT FIERVPGGTS VSDTWIDTLS LSRIALPRLS SHKLSSMAEA FGTMKVTHRA SDDVDALCGM WRILLLGLMN LPRGLLAKLA SMHDNVEWKF RPIFSYLSQI KEKEVVQRGI ARKDATGAEL ADAEISGTFF SLKDIRSQLV ADAKTKARRD ADDPETPAML PISKDEIHRA FAKPGVVSQM YDKFETRSEQ VSMSVEVRNA LVTSSHRELE AGTGIGKSIA YLLPEALFAQ KNDVTVGIAT KTNALTDQLV THDLPALARA LPNGLSFCSL KGYEHYPCLH RVDRAALEEL PLTLIDQEGR SSNSVASDML TAIAVIYAYA CQSADGDLDA LGIRWRSVPR EMVTIKAAEC LRSKCPYYPH ECFVHGARKR AGSSDVVVTN HSLLLRNVAA DGKILPPIRH WVIDEAHGFE AEARHQWAIE ISAKEMRNGF ELLGGIKSGA IHAAMVGAAN LEDSTLLTGL LTRSAAAVQR AMAAMGNLMV AVHELAPLAK SDGGYNSLQL WINDEVRETK EWKEFLETAS VALSALEEAA LRIGKTTEAL TASAPNLASN LSESGMFLST LLESLKLICD GTDKSYVYSA KLTRLKRDIG SEALVAEKLD IGAELAEKWL PETHSVVFTS ATIAVGDDFS HFEHAVGLDR GSFEHKSLHL DSSFDYENHM GVFVAEDMPT PTDPGYLDAL EKLLFDVHVQ MGGSVLTLFT NRRDMERLYE ALEPRLSEYG LTLACQERSS SARRIREKFL AEKNLSLFAL KSFWEGFDAA GDTLRCVVIP KLPFASPNEP LVKEREVRED RAWWRYSLPE AVIATKQAAG RLIRSAEDKG VLVLADSRLV SKRYGSSFLK SLPNKNYQCV STKNISGQIA KWREEHDA
|
| |