Gene Apar_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1093 
Symbol 
ID8413966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1236350 
End bp1238863 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content49% 
IMG OID645022682 
ProductDNA topoisomerase type IA central domain protein 
Protein accessionYP_003180112 
Protein GI257784895 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTTTAG TTGTTACCGA GAAAAACGAC GCAGCGCAGC AGATTGCTCG CCTTCTGTCT 
GAATCAGGTA AGCCTGAGGC AGACAAGGTA TACAACACTC CTGTTTATCG TTTTAGACGC
GGTGGCGAGG ATTATGTTGC TATTGGTCTT CGTGGTCATA TTCTGCAACC TGATTTTCCT
TTAGCTTTGA AGTTTGATAA GGAAGAGGGT TGGTACGGAG AAACTGCAGA AGGGGAGCAC
CTTCCTGCAG ACGTTCCAGA TGGTCTTGAG CGTCCTCCTT ACAAGGTTAA GCGCAAGCCT
TTTATGGCAG ATGGCGTTGA CCTTAAGGGC TGGAAAATTC CCAGCTTGCC GTATTTGATT
TGGGCTCCTG TAGATAAACA ACCTGCAGAA AAAGAGATCA TTCGAGCGCT CAAGAATCTT
GCTAAGAAAG CAGAGTCTGT CATCATCGGT ACAGACTTTG ACCGTGAGGG TGAGCTTATT
GGTTCTGATG CATTGGCACA GGTTCTCTCG GTTAATCCTA ACCTGGCGGT ATTTCGTGCT
CGCTACTCTG CTTTTACTAA GACAGAGATT GAGACTGCCT TTAATAACCT GGTAGATCTT
GACTATAACT TGGCTGCAGC AGGTGAGTCC AGACAGTACA TTGATCTTAT TTGGGGTGCG
GTATTAACCC GTTATTTGAC TACTGCACGC TTTGGTGGTC TGGGCAATAC GCGCTCTGCT
GGTCGTGTTC AGACTCCTAC GCTGGCACTT GTTGTTGAGC GTGAGCGTGA GCGTCTGGCG
TTTAAGCCAG AAGATTATTG GGTCATTTCT GGTGTTGCTT CTCCAGAAGA TGATGACGAA
GCAACGTTCA AGATGGTTCA CCAGACTGCA AGGTTCTGGG ATCAGGCTGA AGCAGATACA
GTTTATGACG TTGTGAAAGA TCAAACGCAG GCAACGGTTG TCGAGATTGA GTCTCGTAGC
AGAAAGCAGC AGCCACCAGC TCCATTTAAC ACTACATCGC TGCAAGCAGC TGCAGCTGCA
GAGGGTATCT CGCCAGCTAG AACTATGCGT TTGGCAGAGT CCTTGTACAT GGATGGCTTA
ATTTCGTATC CGCGTGTTGA TAACACGGTG TATCCAAGTT CTCTTGACCT GAAAGATTGT
GTGCGTACGT TATCTGCAGT TCCTCAGTAT GCACCTACCT GTAAAAAGCT TTTGGGTCAG
CCAAAGCTCC ATGCAACTCG TGGCAAGCAG GAGTCTACCG ACCACCCACC AATTTATCCA
ACGGCAGCGG CAAATCCAGA TTCTTTGCAG CCTGCAGCTT GGAAGCTTTA TAACCTTATT
GCTCGAAGGT TCTTAGCAAC CCTTATGGGT CCAGCTACTA TGAGTGGTAC CAAGATTACA
CTTGATGTTG CTGGTCAGCC TTTTGTTGCA AACGGAACGG TACTGGCAGA GCCAGGCTTT
AGAGAAATTT ATCCTTTTGG CCTTAAGAAG GACGATCAGA TTCCTGATGT AGGCGAGGGA
GAGATTGTCT CTATTAAGTC TTTATCGCTT GACGCTAAGC AGACAGAGCC ACCTGCTCGC
TATAGTCAGG GTAAGCTTAT TCAAGAGATG GAGAAGCGCG GTCTGGGTAC AAAGTCTACG
CGTGCTTCCA TCATTGATAC GCTTTATCAG CGTAAATACC TCAAGAATGA TCCCGTTGAG
CCAAGTCAGC TTGGTATGGC AATTGTCGAT GCTCTTTCGC AATTTGCACC ACACATTACC
TCGCCTGATA TGACGGCTGA GCTTGAATCC GATATGACTA GCGTTGCTGA GGGCAAAGAT
ACGCGTGATG GCGTTGTTAC GCACTCAAGA TCCCTGCTTG CGGGCATGAT TGATGTGCTC
ATTGATCATA AAGAAGATCT GTCTGAGGCC ATTGCGGATG CAGTCACTGC TGACGCAAAG
GTTGGAACCT GTCCTAAGTG TGGCAAGGAT TTGGTTGCCA AGAGCTCTCT AAAGACTCGT
GGAAGCTTTG TTGGCTGCAT GGGTTGGCCA GACTGTGATG TTACGTATCC ACTGCCTCAG
GGACGCGTTG AACCGCTTGA GGGAGAGGCT GGCGTCTGCC CAGAGTGTGG TGCTCCTCGT
GTAAAGGTTC AGCCGTTTAG GCAGCGTGCG TATGAGACTT GCATCAATCC AGCTTGTCCT
ACAAATTCGG AGCCAGATCT TATTGTGGGC GAGTGCCCAA CTTGCAAAGC AAATGGCAAG
CACGGAGACT TGGTGGCTCA TAAGTCAGAG CGCACGGGCA AGCGTTTTAT CCGCTGTACC
AACTACGAGG AGTGCGAAAC AAGTTATCCT CTACCTGCTC GTGGAAAGCT TGAGAAGACC
GATGAGGTTT GTCCAGACTG CGGCGCTCCT ATGGTAGTTG TTACTACTCA GCGTGGTCCT
TGGAAGCTGT GCCCAAATTA CGATTGCCCT GGCAAAGAAA AGAAAACTGC CCCGCGCAGA
GGTGCTCGTA AGAGTACTAA GAGCACCAAA TCCACCAAGG CAGCAAAAAA GTAA
 
Protein sequence
MILVVTEKND AAQQIARLLS ESGKPEADKV YNTPVYRFRR GGEDYVAIGL RGHILQPDFP 
LALKFDKEEG WYGETAEGEH LPADVPDGLE RPPYKVKRKP FMADGVDLKG WKIPSLPYLI
WAPVDKQPAE KEIIRALKNL AKKAESVIIG TDFDREGELI GSDALAQVLS VNPNLAVFRA
RYSAFTKTEI ETAFNNLVDL DYNLAAAGES RQYIDLIWGA VLTRYLTTAR FGGLGNTRSA
GRVQTPTLAL VVERERERLA FKPEDYWVIS GVASPEDDDE ATFKMVHQTA RFWDQAEADT
VYDVVKDQTQ ATVVEIESRS RKQQPPAPFN TTSLQAAAAA EGISPARTMR LAESLYMDGL
ISYPRVDNTV YPSSLDLKDC VRTLSAVPQY APTCKKLLGQ PKLHATRGKQ ESTDHPPIYP
TAAANPDSLQ PAAWKLYNLI ARRFLATLMG PATMSGTKIT LDVAGQPFVA NGTVLAEPGF
REIYPFGLKK DDQIPDVGEG EIVSIKSLSL DAKQTEPPAR YSQGKLIQEM EKRGLGTKST
RASIIDTLYQ RKYLKNDPVE PSQLGMAIVD ALSQFAPHIT SPDMTAELES DMTSVAEGKD
TRDGVVTHSR SLLAGMIDVL IDHKEDLSEA IADAVTADAK VGTCPKCGKD LVAKSSLKTR
GSFVGCMGWP DCDVTYPLPQ GRVEPLEGEA GVCPECGAPR VKVQPFRQRA YETCINPACP
TNSEPDLIVG ECPTCKANGK HGDLVAHKSE RTGKRFIRCT NYEECETSYP LPARGKLEKT
DEVCPDCGAP MVVVTTQRGP WKLCPNYDCP GKEKKTAPRR GARKSTKSTK STKAAKK