Gene Apre_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1289 
Symbol 
ID8398079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1386195 
End bp1388249 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content42% 
IMG OID644995634 
ProductDNA topoisomerase I 
Protein accessionYP_003153033 
Protein GI257066777 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.781177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTAAGA ACCTAGTAAT AGTAGAGTCT CCAACCAAGG CCAGATCCAT CTCAAAGATG 
CTCGGAAGAA ACTACAAGGT AATGGCAACA GTAGGCCACC TCCGTGATCT TCCAAAGAGC
AAGTTTGGAG TGGATATAGA AAATAACTTT GAACCAGAAT ATATCAAGGT TAGGGGACGA
GCAAAGACTA TAAATGAACT AAAAAAAGAA GCGAAAAAGG CAGAAAATGT CTACCTTGCG
ACAGACCCGG ATAGGGAAGG AGAGGCCATA AGCTGGCATT TACAATTTCT CTTAGACCTT
GACCCTGAAG CGAAAAACAG GGTAGAGTTC CACGAGATAA CCAAAGAAAA TGTCAAGAAC
GCCATCAAAA ACCCAAGGAA AATCGACCAG AACCTAGTTG ACGCTCAACA GGCAAGGCGA
GTAATGGATA GGATTGTAGG TTACGAGATA AGCCCAATCC TCTGGAAGAG GGTCAAGGCA
GGTTTATCTG CAGGTCGTGT CCAATCAGTT GCCCTAAAGC TTATAGTAGA TAAGCAAAAG
GAAATCGACG ATTTTGTTCC AGAAGAATAC TGGACTATAA CAGCCCACCA CAAGGAAGGT
AGGGAGAAAT TCGACTCAGA ATTCTATGGA CAAATCAACA AGAAAATAAA AATAAGCAAT
GAAAACGGAG CGGATAAGGT CCTAAATAAA ATCGATAAGG ATAAGTTTGA AGTTGTAAAA
ATCACCAAGA CTAAAAAGAG AAGAAAGCCT CAAAAGCCTT ACACAACCTC AACCCTCCAA
CAAGATGCCT CCAATAGATT AGGGTTTTCT ACAAGATTTA CCATGCAGCT AGCCCAACAG
CTCTTTGAGG GTATAGATGT AGGAGATGGA AGTGTGGGTC TTATTACCTA TATGAGAACT
GACGCTAACA GGATCTCTAA GGAGATCGTA GGCGAAGCCC TCTCATATAT TAAGGAAAAA
TACGGACCGG AATATGCTGG CAAGGGAAAT ACCTACGGGG GCAAGAAAAA GGGCAGCCAA
GATGCCCACG AGGCCATAAG ACCTACCTCT ATTAGGAGAA ATCCTCTAGA GATTAAGGAA
TACCTAACAG ATCAACAATA TAAGCTATAC AAGATGATTT GGGAAAGAGT CGTAGCAAGC
CAGATGACAG ATTACGAATT CCTATCAACC CAAGTCCTAT TCGACAACAA TTCCCTAATC
TTTAAGACAA ACGGGAAAAT CACCCTATTT GAAGGTTTCA ATAAATTGGG AGCAAATAAA
GAAAACGAAA ATATCCTACC AGAGCTTAAG GAAGGGGATG TGATAAGTGC TGAGTCAATC
GATAAGGACC AACACTTCAC TAAGCCTCCA GCAAGATATA CTGAGGCAAG TCTTGTAAAG
ACCCTAGAAG AATTCGGCAT AGGTAGACCT TCAACCTATT CTGCTACCAT CAACCAAATC
ATCTCAAGAA ACTACGTAGA ACTTGAAGGA AGATCAATCT TCCCAACAGA TCTAGGAAAA
ACCGTAAATA CCTTCCTCCA AGAAAACTTT GACGATGTAA TAAACGTAGA GTTCACCAGG
GAAATGGAAG ATGCCTTGGA TAATATCGCA GAAGGAGATA GATTCTGGAA AGAAACATTA
AAATCCTTCT ACAAGGACTT CGAAAAAGAC ATGAAGGGTG TCAAAAAGGA CGGCAAGGAC
TACAAGGTAA GAGATGAAAT CTTAGAAGAA AAATGCCCAA AATGCGGAAA GCCTCTTGCC
ATCAAACACG GAAGAAACGG GAAATTCATA GGCTGTACCG GCTTTCCAGA TTGTAACTTT
ACCAAATCAA TAGTAAAATC AACCGGAGTC AAATGCCCAG AATGTGAAGA CGGAACGATA
ATAGAAAAAG TCAGCAAAAG AGGCAAGAGA TTCTACGGCT GTGACAACTA CCCAAAATGC
GACTTTGCCC TATGGGACCC ACCAACAGGA GAAAAATGTC CAGAATGCGG CTCTCTCCTA
ATCCACAAGA AAAACAGGTC CACAGACGAA ATAAAATGCT CCTCCTGTGA CTATGTCAAA
GAAAAGAGGA GATAA
 
Protein sequence
MAKNLVIVES PTKARSISKM LGRNYKVMAT VGHLRDLPKS KFGVDIENNF EPEYIKVRGR 
AKTINELKKE AKKAENVYLA TDPDREGEAI SWHLQFLLDL DPEAKNRVEF HEITKENVKN
AIKNPRKIDQ NLVDAQQARR VMDRIVGYEI SPILWKRVKA GLSAGRVQSV ALKLIVDKQK
EIDDFVPEEY WTITAHHKEG REKFDSEFYG QINKKIKISN ENGADKVLNK IDKDKFEVVK
ITKTKKRRKP QKPYTTSTLQ QDASNRLGFS TRFTMQLAQQ LFEGIDVGDG SVGLITYMRT
DANRISKEIV GEALSYIKEK YGPEYAGKGN TYGGKKKGSQ DAHEAIRPTS IRRNPLEIKE
YLTDQQYKLY KMIWERVVAS QMTDYEFLST QVLFDNNSLI FKTNGKITLF EGFNKLGANK
ENENILPELK EGDVISAESI DKDQHFTKPP ARYTEASLVK TLEEFGIGRP STYSATINQI
ISRNYVELEG RSIFPTDLGK TVNTFLQENF DDVINVEFTR EMEDALDNIA EGDRFWKETL
KSFYKDFEKD MKGVKKDGKD YKVRDEILEE KCPKCGKPLA IKHGRNGKFI GCTGFPDCNF
TKSIVKSTGV KCPECEDGTI IEKVSKRGKR FYGCDNYPKC DFALWDPPTG EKCPECGSLL
IHKKNRSTDE IKCSSCDYVK EKRR