Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1104 |
Symbol | |
ID | 8413977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1246780 |
End bp | 1249518 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645022693 |
Product | DNA polymerase I |
Protein accession | YP_003180123 |
Protein GI | 257784906 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATA CTACTATGCC AGTTAATACT GCAGATCAAA ACAACGCCAC ATCAGCCAAA GAAGCCGCAG TTAAACCACG TAGAACTATT GCCGTTATTG ACGGAAACTC GCTTATGCAT CGAGCATTTC ATGCAATTCG TCAGCCTATG GCGGCCCCAG ATGGAACGCC AACTGGTGCG TTGTTTGGCT TCTTTAACAT GTTTATCAAG CTTGTTGAGT CCTTCTCGCC AGATGGTGTC ATTTGTGCGT TTGATAAGGG CAAACCTCAG ATTCGTATTG ATATGTTGCC TCAGTACAAA GCTCAGCGTC CTCCTATGGA TCCTGCTTTA CACGCACAGT TTCCTGTAGT CAAAGAGCTG CTCAAGACGC TGGATGTTCC CGTCTGTCAG CTTGAAGGCT GGGAGGGTGA CGATATTTTA GGTACCCTTG CTCGTCGTGG AGAGGCAGAG GGTTACCAGA TGCTGCTCTT TACAGGCGAC CGTGATATGT ACCAGCTCTC CACAGAAAAC GTCAAGATTG TCTCTACGCG CAAGGGTGTT TCCGATGTCT CAATTATGAC TCCAGAGACC GTAGCTGACC TTTATGCAGG TATTACGCCA GAGCTTGTCC CAGATTTCTA TGGACTTAAG GGAGATTCTT CCGACAACAT TCCTGGTGTT CCGGGAATTG GCCCCAAGAA AGCTGCTGCT CTCATTGTGG AGTATGGTTC GCTTGATGAG GTTATTGCTC ATGCCGATGA GGTTAAGGGC AAGGTGGGAG AAAACCTTCG CGCCCACATT GATGACGCGC TACTCTCCCG CAAGGTTGCT ACTATTCGTA CCGATGCACC GCTTGATATC AACCTGGATG ACGCTAAGTT CCCAACATTT GATCCAAATG ACGTGGTCAA GGCATTTTCG GCCTTGGGCT TTACAGGCAT GACCTCAAGG CTTGCTCGTA TGGCTGGTGG TTCTGCTTCT GGTGTAACCG CACAAACTGC TTCTGCTACG CCTGCGCTTC CTATTCCGTC AGAATTTCTG CAGGCAGGAG ACGCGTCTGC TGCTTTGACC GCTGCTCTTG AGAATCAGGA GTGGATTGGT GTTGCAGAGG ATTCTGCCGC CGACACTGGC GCACTCTTTA GTCTTTCTCA CACACTTTGG GTGTCTACAG AAAAGGCGCT TCTCAAATTT GAGGGAGAAA ACGCAACCGC GGCGCTGTTG CGTATTTTGC GAGAAGCTCG TGTTGCTGCA GACGATGTCA AGGCACTGCT TCATGAGGTA AGCCCTGTTG ATTCATCCGA GCCACAGAAA CTGGATATTG AGACCATTGA TTCTGACCGC CTCTTTGATA CGGGCGTTGC TGCGTATCTG CTTGAGTCTG ATCGTTCAAG TTTTGATATC AACGGCCTGG TTGAGTTGTA TTTTGGCTCT GAGCTTCCTG AGCCTACAGA TGAGATTCCT GCCGCAGCAT TAAGAGCTGT AGCTGCTAGG GCTCTTGTGC CTGTGCTCAC TAAAAAGCTT GAAGATGATG GTTCCCTGGA GCTCTTTACT TCTCTAGAGA TGCAACTTTT GCCTGTACTT GCTGCGATGG AGCGTCGCGG TCTCTACGTT GATCCTCAAA AGCTTGCCGA ACAGTCCGCT CAGCTTGGTG AAGACATTGC GCAACGAGTG GCAAGTATTC ACCAGACAGC AGGCGAGGAT TTCAACTTGG ATTCTCCAAG TCAGCTTTCT CATATCTTGT TTGACGTGCT CAAGCTTCCA ACTTATGGTC TCAAAAAGAC TCGTACTGGC TTCTATTCCA CCAACGCAAA AGTTCTTGGA GAGCTGGCAC AGGAGTATCA GATTGTTGCT GATGTTCTTG AGTATCGCGA ACGCGCCAAA ATCAAATCTA CCTATCTAGA TGCTCTTCCT TCACTTATTC GTGGTGATAA GCGTATCCAT ACCACGCTCA ATCAGACCGT TACTGCAACG GGCAGGCTTT CTAGCTCTGA TCCAAACTTG CAGAACATTC CTACTCGTTC TGAGCTGGGA CATCGTGTTC GCACCGCATT CACTGTTCCC GAGGGCAGCG TTTTTCTCGC CTGTGACTAC TCTCAGATTG AGCTTCGCCT GCTCGCTCAT CTTTCTGCTG ATGAGCACCT AGTAGCAGCA TTCAACTCTG GCGCTGACTT CCATGCAGCA ACTGCCGCAC GTGTCTTTGG CGTTCCTGTT GAAGAGGTAA CCCCAGCACT TCGTAGTCGT GCTAAGGCAG TCAACTTTGG CATTGTCTAT GGTCAGCAGG CCTTTGGTCT GGCAACGTCT CTTAAGATTT CTCGCAAGGA AGCGCAAGAG ATGATTGACC GTTATTTTGA TGCCTATCCT GGTGTTCGTG CGTATTTAGA TGACTCCGTC CGCACGGCTC ACGAGTGTGG TTACGCTATT ACCATGTATG GTCGTAAGCG TCACATTAGA GAGTTTAACC AATCAAATCG CCAGCTTATT GCCTTTGGCG AGCGCACCGC CATGAATCAT CCTATGCAGG GAAGTGCTGC AGATATTATC AAGATTGCAA TGATTAATGT CGAGAAACGC TTGCGTGATG AAGGTCTTAC GTCAAAACTG ATTCTTCAGA TCCACGACGA ACTTGACTTG GAAGTTCCAG AGTCAGAGAT TGAGACAGTT TCTACGCTGG TAAAAGAGAC TATGGAAGAC GTTGTTACCC TGCGTGTTCC TTTGATTGCT GATGTCAGCT ATGGTTCCAA TTGGGCAGAG GCAAAGTAA
|
Protein sequence | MSDTTMPVNT ADQNNATSAK EAAVKPRRTI AVIDGNSLMH RAFHAIRQPM AAPDGTPTGA LFGFFNMFIK LVESFSPDGV ICAFDKGKPQ IRIDMLPQYK AQRPPMDPAL HAQFPVVKEL LKTLDVPVCQ LEGWEGDDIL GTLARRGEAE GYQMLLFTGD RDMYQLSTEN VKIVSTRKGV SDVSIMTPET VADLYAGITP ELVPDFYGLK GDSSDNIPGV PGIGPKKAAA LIVEYGSLDE VIAHADEVKG KVGENLRAHI DDALLSRKVA TIRTDAPLDI NLDDAKFPTF DPNDVVKAFS ALGFTGMTSR LARMAGGSAS GVTAQTASAT PALPIPSEFL QAGDASAALT AALENQEWIG VAEDSAADTG ALFSLSHTLW VSTEKALLKF EGENATAALL RILREARVAA DDVKALLHEV SPVDSSEPQK LDIETIDSDR LFDTGVAAYL LESDRSSFDI NGLVELYFGS ELPEPTDEIP AAALRAVAAR ALVPVLTKKL EDDGSLELFT SLEMQLLPVL AAMERRGLYV DPQKLAEQSA QLGEDIAQRV ASIHQTAGED FNLDSPSQLS HILFDVLKLP TYGLKKTRTG FYSTNAKVLG ELAQEYQIVA DVLEYRERAK IKSTYLDALP SLIRGDKRIH TTLNQTVTAT GRLSSSDPNL QNIPTRSELG HRVRTAFTVP EGSVFLACDY SQIELRLLAH LSADEHLVAA FNSGADFHAA TAARVFGVPV EEVTPALRSR AKAVNFGIVY GQQAFGLATS LKISRKEAQE MIDRYFDAYP GVRAYLDDSV RTAHECGYAI TMYGRKRHIR EFNQSNRQLI AFGERTAMNH PMQGSAADII KIAMINVEKR LRDEGLTSKL ILQIHDELDL EVPESEIETV STLVKETMED VVTLRVPLIA DVSYGSNWAE AK
|
| |