Gene Apar_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0549 
Symbol 
ID8413403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp634907 
End bp635935 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content38% 
IMG OID645022122 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_003179571 
Protein GI257784354 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.860226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000115657 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTTCT GGTATCTAGT GTTTCTCGTA GTTGTGTTCC TACTTTTCAA AAAGCGGCGC 
AATAAAAAAC AAAATTCGAC AAATGGTTTT TACCCAACAC CTCCTGCTGG TGCACAAAAT
ATCCCATTGC CTGTATCCCC TCAAGCACCA AAAGTTAAAA CTAAACCAAG TAATGTTCCT
TTAAAAAGAA CGCGCTGGGC AGATTTTGAT GTCTCAAAAT ATCCTGAATC ATATGTAGTA
GTAGATCTTG AAACAACAGG GCTAGACGTC CATTACTGCG AAATCATTGA AATTGCCGCT
CTTAAAGTAG TAGACGGAAA AATTACAGAA GAATTTAGTT CGCTCATTCA TCCTCCAAGA
GAGATACCAT CTGGCGCAAC TGCAATCAAC CATATAACTA ATCACATGGT AAAGAATGCG
CCAACGCTCG ATAAAGTTAT CCCGCAATTT GATACGTTCG TTAAAGGATT TCCTCTAATC
GGTCATAACT CTCTTAGATA TGACGCGATT GTTCTCGAGG AGAATTTCTT TAGACGCGAC
TTTTTATGCG ATTATGTTTG GTATGACACA TACAAATTGG CTAGACAAAT CTTAGAGCCA
CCATACAAAC TGATAAATAT TGCCAAAAGA CTTAACGTTA AACAACATGG CAAAGCTCAC
AGGGCTCTCG CGGACTGTTA TATGACTTAT GGCATCTACG AAAAGATGAG AGAAATTTCT
ATAGCAACAA CGGAAAACGT AAAATGTATT GAGAAATACA CCGATAAAAA CACTGAAAGC
ACAAAGCTTT CTGGAACCGT CTTTTGCTTG ACAGGTGTTC CGTGCTGTAT GCCTAAAAGC
GATTTTCTAA AAATGCTAAT TACAAATGGG GCAACCTTGA GCGAAAGAGT AACTCTCAAA
ACTAATTATT TGATTGATTG CTCTGGAGAC GAAACCACAA AAATTAAAAC AGCTAGGAAG
TATGCCGACC GAACTGGCAT CAAAATTATA AGTGAGCAAC AAATGCTCGA AATGTTAAAA
CAAAGCTAA
 
Protein sequence
MAFWYLVFLV VVFLLFKKRR NKKQNSTNGF YPTPPAGAQN IPLPVSPQAP KVKTKPSNVP 
LKRTRWADFD VSKYPESYVV VDLETTGLDV HYCEIIEIAA LKVVDGKITE EFSSLIHPPR
EIPSGATAIN HITNHMVKNA PTLDKVIPQF DTFVKGFPLI GHNSLRYDAI VLEENFFRRD
FLCDYVWYDT YKLARQILEP PYKLINIAKR LNVKQHGKAH RALADCYMTY GIYEKMREIS
IATTENVKCI EKYTDKNTES TKLSGTVFCL TGVPCCMPKS DFLKMLITNG ATLSERVTLK
TNYLIDCSGD ETTKIKTARK YADRTGIKII SEQQMLEMLK QS