Gene Apre_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1822 
Symbol 
ID8368730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013164 
Strand
Start bp79893 
End bp81614 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content32% 
IMG OID644984746 
ProductDNA topoisomerase type IA central domain protein 
Protein accessionYP_003142397 
Protein GI256821198 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAG TAATAGCAGA AAAACCAAGT GTAGCAGTTA CAATTGCAAA AGTAATTGGA 
GCAAGAACAA GAAAAAACGG ATATTATGAG GGAGGTGGAT ACATTGTTTC TTGGTGTGTT
GGTCATTTAA TTCAAATGGC AAGTCCAGAT AAGATAGACG AAAAATGGAA GAAATGGACA
ATAGAAAATC TTCCAATAAT CCCAGAAGAA TATATTTATG AAGTATCTAA AAGCACTAAA
AAGCAATATG GAGTTTTAAA GAAGCTTTTA AACGATAAGA ACATCGACAC AGTAATAAAT
GCTTGTGATG CTGGAAGAGA GGGAGAACTT ATTTTTAGGC TTGTATATAA TCAAGCTAAA
TGTAAGAAGA AAATTCAAAG ACTTTGGATA TCTTCAATGG AAAACAAAGC TATTGAAGAT
GGCTTTAGAA ATCTTAAAGA CGGAGAAAAC TTTGAAGACT TATATAGATC GGCAAGTGCA
AGAGCAATTG TGGATTGGCT GGTAGGAATG AATTTAAGTA GGCTTTATTC TTGCATTTAC
AAGGAAACAT ATTCAGTCGG TAGAGTACAA ACTCCAACTC TATATTTAAT AGCTAAAAGG
GATAGTGAAA TAAACCTGTT TAAGAAGCAA AAATATTATA CAGTTGACCT ATCTTATGGA
GGATTAAAAC TTGTATCAGA TAGGATTGAT AAAATTGAAG TTGCAGAGCA ACTTTTAAAC
TTGCTAGAAG ATGAAATAGT TATTACAGAG GTAGAAGATA AAGAAATAAG CACAAGACCA
GATAAACCTT ATGATCTCAC TACCTTACAA AGAGAAGCAA ACAAATATTT TGGATATTCA
GCAAATGACA CTTTAAACCT GGCACAAGGC TTGTATGAAA AAAAGCTAAT CACATATCCA
AGAACAGATA GTAGGCATTT AACCGATGAT ATGGTTAATA CTATGAAAGA ATTATTAGAA
GGATTTGAAG AAGATTTTAA AATCAACGAA TCAAACTTTA AGTCTATTTT TAATTCATCT
AAGGTTACAG ACCACTATGC GATTATTCCT ACTATATCAG GCATTGGAAA AGCCAAAGAT
TTATCTGATA AAGAAAGCAA AATCTATAAT CTAATTAAGA ATAAATTACT TGCTTCATGT
TCGGATAATT TAAAGGAATC TAGCAGAAAA ATCAGATATG AATATGATAA ATTTAACTTC
AATGCAAGTG GCAAGACTGT AATCGATGAG GGTTATACCA AGTATCTAAA GCCTTATGGA
AAGGAAAGAC AAGAAAATGA ATTACCAGAT GTAAAGACTG GAGATAAAAT TAAGCTAACT
TCTAAAAATA TATCGGAGAA ATTTACCAAA GCTCCAAGTC ATTATAATGA AGATACACTT
TTAAAGGCTA TGGAGAATGC AGGGATTAAA TCGCTGGATA AAGACATAGA AGTAGAAAGA
AAAGGCTTAG GAACACCAGC TACAAGAGCA GGAATTATTG AAAATCTTAT CCATAAGGAT
CTTATAAGAA GAGATAAGAA AAAATTACTT GTAACAGAAA AAGGCAATAG ACTTGTATCG
ATTGTAGAGG ATAAGTTTAA ATCAGCTGAA ACAACATCTG AATGGGAAAT GAAACTTGCA
AAGATAAGCT CAGGCGAAGT AGATAAAGAA GACTTTTTAA GAGAAATTGA AGATAGTATA
AGGGAGCTTG TAGACAGGTA CAAGAATAAT CTAAATGAAT AA
 
Protein sequence
MKLVIAEKPS VAVTIAKVIG ARTRKNGYYE GGGYIVSWCV GHLIQMASPD KIDEKWKKWT 
IENLPIIPEE YIYEVSKSTK KQYGVLKKLL NDKNIDTVIN ACDAGREGEL IFRLVYNQAK
CKKKIQRLWI SSMENKAIED GFRNLKDGEN FEDLYRSASA RAIVDWLVGM NLSRLYSCIY
KETYSVGRVQ TPTLYLIAKR DSEINLFKKQ KYYTVDLSYG GLKLVSDRID KIEVAEQLLN
LLEDEIVITE VEDKEISTRP DKPYDLTTLQ REANKYFGYS ANDTLNLAQG LYEKKLITYP
RTDSRHLTDD MVNTMKELLE GFEEDFKINE SNFKSIFNSS KVTDHYAIIP TISGIGKAKD
LSDKESKIYN LIKNKLLASC SDNLKESSRK IRYEYDKFNF NASGKTVIDE GYTKYLKPYG
KERQENELPD VKTGDKIKLT SKNISEKFTK APSHYNEDTL LKAMENAGIK SLDKDIEVER
KGLGTPATRA GIIENLIHKD LIRRDKKKLL VTEKGNRLVS IVEDKFKSAE TTSEWEMKLA
KISSGEVDKE DFLREIEDSI RELVDRYKNN LNE