Gene Apar_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1359 
Symbol 
ID8414250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1533427 
End bp1536363 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content49% 
IMG OID645022962 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_003180374 
Protein GI257785157 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.838712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGCTTG CAGAGAGGGC TGAGCATGAG TCGTTTGATG TCTTAGAGGA TGATATTGTT 
GTTTTGGATA CTGAGACAAC AGGTTTGTCC TTTAAGAAAT GCTCCCTTAT TGAGATTTCT
GCTGCCAAAT TGAGCGGAAG AGAGATCATA GAGCGCTTTC AGACTTTTGT TGATCCTGGA
TGCCCAATTC CGGAAGAGAT TACAACCCTG ACTTCAATCA CCGACGAAGA CGTTAAGGGC
GCGCCAAGTG CAAAAGAAGC CGTTGCTGCA CTTGCAGAGT TTGTTGGCGG ACTTCCTGTT
TTGGCTCACA ACGCCACTTT TGACCGTACG TTTATTGAAC GAGTGCCCGG CGGAACTTCG
GTCTCCGATA CTTGGATTGA TACGCTGTCA CTCTCACGTA TAGCGCTACC GAGACTCTCT
TCGCACAAGC TGTCGAGCAT GGCTGAAGCG TTTGGCACCA TGAAGGTCAC GCACCGCGCA
AGTGACGACG TAGACGCCCT TTGCGGCATG TGGCGTATCT TGCTTTTGGG ACTTATGAAT
CTGCCCCGCG GTCTTCTCGC CAAACTGGCA TCAATGCACG ATAACGTGGA GTGGAAATTC
AGGCCAATCT TTTCGTATTT GTCGCAGATA AAAGAAAAAG AAGTTGTTCA GCGTGGCATC
GCAAGGAAAG ATGCAACAGG TGCAGAACTT GCAGATGCGG AGATTTCGGG CACGTTTTTC
TCGCTTAAGG ACATTCGTAG TCAGCTTGTT GCAGATGCAA AAACAAAGGC GAGAAGGGAC
GCGGACGACC CAGAAACGCC TGCTATGCTG CCGATTTCTA AAGACGAGAT TCACAGGGCG
TTTGCTAAAC CGGGCGTTGT CTCACAGATG TATGACAAGT TTGAGACGCG CAGCGAGCAG
GTAAGCATGT CTGTCGAAGT CAGAAACGCT CTGGTAACTT CATCGCATAG AGAACTTGAA
GCAGGAACCG GTATTGGCAA GTCGATTGCA TACCTGTTGC CTGAGGCGTT ATTTGCACAG
AAAAATGATG TTACTGTAGG TATTGCTACA AAAACGAATG CGCTGACCGA TCAGCTTGTT
ACTCATGATC TCCCTGCACT TGCAAGAGCG CTGCCAAATG GGCTGAGTTT TTGTAGCTTA
AAAGGATACG AGCACTATCC ATGCTTGCAC CGTGTTGATA GAGCAGCTCT TGAGGAGCTG
CCGTTGACGT TGATTGATCA GGAAGGCCGT TCTAGCAATA GCGTTGCGTC AGATATGCTG
ACCGCAATTG CAGTGATTTA CGCGTATGCG TGTCAGTCGG CCGATGGTGA CTTGGATGCA
TTGGGTATTC GTTGGCGCTC GGTTCCGCGA GAAATGGTGA CTATTAAAGC AGCAGAGTGC
CTACGCTCAA AGTGCCCATA TTATCCTCAT GAGTGTTTTG TCCATGGAGC TCGAAAGCGC
GCAGGATCTT CAGATGTTGT GGTGACTAAT CACTCTCTGC TACTAAGAAA TGTTGCTGCT
GACGGCAAAA TTCTTCCTCC TATAAGGCAT TGGGTCATTG ACGAGGCTCA TGGATTTGAA
GCCGAGGCTC GCCATCAATG GGCAATAGAA ATTTCTGCTA AGGAAATGAG AAACGGCTTT
GAGCTGCTTG GAGGAATTAA GTCCGGGGCT ATTCACGCTG CTATGGTGGG CGCCGCCAAT
CTTGAAGATT CTACGCTGCT TACTGGTCTT CTCACACGTT CTGCTGCTGC AGTTCAACGG
GCTATGGCAG CAATGGGCAA CCTCATGGTT GCCGTGCATG AACTGGCTCC GTTGGCTAAA
AGTGATGGTG GCTATAATTC CCTTCAGCTT TGGATTAACG ATGAAGTGAG AGAAACAAAG
GAATGGAAAG AGTTTTTGGA GACCGCTTCT GTTGCTCTTT CTGCTCTTGA AGAAGCAGCA
CTTAGAATAG GAAAGACCAC CGAGGCTCTT ACTGCATCAG CTCCAAACCT TGCGAGCAAT
TTGAGCGAGT CGGGAATGTT TCTTAGCACC CTTCTGGAGT CATTGAAGCT TATTTGTGAT
GGAACGGACA AGAGTTACGT CTATTCGGCA AAGTTGACTC GACTAAAGCG TGATATTGGC
TCTGAAGCCC TTGTGGCCGA GAAGCTTGAT ATTGGAGCAG AGTTGGCGGA AAAGTGGCTT
CCTGAGACGC ATTCCGTTGT ATTTACTTCG GCAACTATTG CTGTTGGAGA CGATTTTTCT
CATTTTGAGC ATGCTGTTGG TCTGGATAGG GGTTCCTTTG AGCACAAGAG TTTGCATTTA
GACTCTAGCT TTGACTACGA GAATCACATG GGCGTCTTTG TGGCCGAAGA TATGCCTACA
CCAACTGATC CAGGGTATTT GGATGCTCTG GAAAAGTTGC TGTTTGACGT CCATGTTCAG
ATGGGCGGTT CGGTGTTGAC GCTCTTTACT AACAGGCGCG ATATGGAGCG TCTTTACGAA
GCGCTGGAAC CACGTTTGAG CGAGTATGGC CTCACTCTTG CTTGCCAGGA GCGATCTTCT
TCCGCACGGC GCATTCGTGA AAAGTTCCTA GCCGAAAAGA ACCTCTCGCT GTTTGCTCTT
AAATCATTTT GGGAGGGATT TGATGCTGCG GGGGACACGC TTAGGTGCGT GGTGATCCCT
AAACTTCCGT TCGCAAGCCC TAATGAGCCA CTGGTCAAAG AGCGTGAGGT GCGAGAAGAC
CGTGCGTGGT GGCGTTATTC TCTGCCAGAG GCAGTAATTG CAACAAAACA GGCAGCTGGT
CGTCTCATCA GGAGTGCTGA GGACAAGGGC GTTTTAGTAC TTGCTGATTC AAGACTGGTA
TCTAAGCGAT ATGGCAGTTC GTTTTTGAAA TCGTTACCTA ACAAGAACTA TCAATGTGTC
TCAACAAAGA ACATCTCTGG ACAGATTGCT AAGTGGCGAG AAGAACACGA CGCGTAG
 
Protein sequence
MELAERAEHE SFDVLEDDIV VLDTETTGLS FKKCSLIEIS AAKLSGREII ERFQTFVDPG 
CPIPEEITTL TSITDEDVKG APSAKEAVAA LAEFVGGLPV LAHNATFDRT FIERVPGGTS
VSDTWIDTLS LSRIALPRLS SHKLSSMAEA FGTMKVTHRA SDDVDALCGM WRILLLGLMN
LPRGLLAKLA SMHDNVEWKF RPIFSYLSQI KEKEVVQRGI ARKDATGAEL ADAEISGTFF
SLKDIRSQLV ADAKTKARRD ADDPETPAML PISKDEIHRA FAKPGVVSQM YDKFETRSEQ
VSMSVEVRNA LVTSSHRELE AGTGIGKSIA YLLPEALFAQ KNDVTVGIAT KTNALTDQLV
THDLPALARA LPNGLSFCSL KGYEHYPCLH RVDRAALEEL PLTLIDQEGR SSNSVASDML
TAIAVIYAYA CQSADGDLDA LGIRWRSVPR EMVTIKAAEC LRSKCPYYPH ECFVHGARKR
AGSSDVVVTN HSLLLRNVAA DGKILPPIRH WVIDEAHGFE AEARHQWAIE ISAKEMRNGF
ELLGGIKSGA IHAAMVGAAN LEDSTLLTGL LTRSAAAVQR AMAAMGNLMV AVHELAPLAK
SDGGYNSLQL WINDEVRETK EWKEFLETAS VALSALEEAA LRIGKTTEAL TASAPNLASN
LSESGMFLST LLESLKLICD GTDKSYVYSA KLTRLKRDIG SEALVAEKLD IGAELAEKWL
PETHSVVFTS ATIAVGDDFS HFEHAVGLDR GSFEHKSLHL DSSFDYENHM GVFVAEDMPT
PTDPGYLDAL EKLLFDVHVQ MGGSVLTLFT NRRDMERLYE ALEPRLSEYG LTLACQERSS
SARRIREKFL AEKNLSLFAL KSFWEGFDAA GDTLRCVVIP KLPFASPNEP LVKEREVRED
RAWWRYSLPE AVIATKQAAG RLIRSAEDKG VLVLADSRLV SKRYGSSFLK SLPNKNYQCV
STKNISGQIA KWREEHDA