Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0549 |
Symbol | |
ID | 8413403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 634907 |
End bp | 635935 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 645022122 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_003179571 |
Protein GI | 257784354 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.860226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000115657 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTTCT GGTATCTAGT GTTTCTCGTA GTTGTGTTCC TACTTTTCAA AAAGCGGCGC AATAAAAAAC AAAATTCGAC AAATGGTTTT TACCCAACAC CTCCTGCTGG TGCACAAAAT ATCCCATTGC CTGTATCCCC TCAAGCACCA AAAGTTAAAA CTAAACCAAG TAATGTTCCT TTAAAAAGAA CGCGCTGGGC AGATTTTGAT GTCTCAAAAT ATCCTGAATC ATATGTAGTA GTAGATCTTG AAACAACAGG GCTAGACGTC CATTACTGCG AAATCATTGA AATTGCCGCT CTTAAAGTAG TAGACGGAAA AATTACAGAA GAATTTAGTT CGCTCATTCA TCCTCCAAGA GAGATACCAT CTGGCGCAAC TGCAATCAAC CATATAACTA ATCACATGGT AAAGAATGCG CCAACGCTCG ATAAAGTTAT CCCGCAATTT GATACGTTCG TTAAAGGATT TCCTCTAATC GGTCATAACT CTCTTAGATA TGACGCGATT GTTCTCGAGG AGAATTTCTT TAGACGCGAC TTTTTATGCG ATTATGTTTG GTATGACACA TACAAATTGG CTAGACAAAT CTTAGAGCCA CCATACAAAC TGATAAATAT TGCCAAAAGA CTTAACGTTA AACAACATGG CAAAGCTCAC AGGGCTCTCG CGGACTGTTA TATGACTTAT GGCATCTACG AAAAGATGAG AGAAATTTCT ATAGCAACAA CGGAAAACGT AAAATGTATT GAGAAATACA CCGATAAAAA CACTGAAAGC ACAAAGCTTT CTGGAACCGT CTTTTGCTTG ACAGGTGTTC CGTGCTGTAT GCCTAAAAGC GATTTTCTAA AAATGCTAAT TACAAATGGG GCAACCTTGA GCGAAAGAGT AACTCTCAAA ACTAATTATT TGATTGATTG CTCTGGAGAC GAAACCACAA AAATTAAAAC AGCTAGGAAG TATGCCGACC GAACTGGCAT CAAAATTATA AGTGAGCAAC AAATGCTCGA AATGTTAAAA CAAAGCTAA
|
Protein sequence | MAFWYLVFLV VVFLLFKKRR NKKQNSTNGF YPTPPAGAQN IPLPVSPQAP KVKTKPSNVP LKRTRWADFD VSKYPESYVV VDLETTGLDV HYCEIIEIAA LKVVDGKITE EFSSLIHPPR EIPSGATAIN HITNHMVKNA PTLDKVIPQF DTFVKGFPLI GHNSLRYDAI VLEENFFRRD FLCDYVWYDT YKLARQILEP PYKLINIAKR LNVKQHGKAH RALADCYMTY GIYEKMREIS IATTENVKCI EKYTDKNTES TKLSGTVFCL TGVPCCMPKS DFLKMLITNG ATLSERVTLK TNYLIDCSGD ETTKIKTARK YADRTGIKII SEQQMLEMLK QS
|
| |