Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BLD_1276 |
Symbol | topA |
ID | 6363902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bifidobacterium longum DJO10A |
Kingdom | Bacteria |
Replicon accession | NC_010816 |
Strand | - |
Start bp | 1469415 |
End bp | 1472501 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642680458 |
Product | topoisomerase I |
Protein accession | YP_001955219 |
Protein GI | 189440138 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACCA AACTGGTGAT TGTGGAGTCT CCGACCAAGG CCCACAAGAT CGGTGACTAC CTGGGCAAGG GGTACACCGT CATGGCGTCG GTCGGCCATA TCCGAGACCT CGCGCAGCCC AGCCAGGTGC CCGCGGCAGA CAAGGCCAAG TTCGGCAAGT TCGGCGTGGA CGTCAACGAC GGGTTCAAGC CGTATTACAT TGTCGACGGC GACAAGAAGC GTACGGTCAG CGAACTCAAA TCGGCCCTGA AGAACGCCGA TGAACTCTAT CTGGCGACTG ATGAGGATCG CGAGGGCGAG GCCATCGCGT GGCACTTGGT GCAGACCTTG AAACCCGAGG TGCCGGTCAA GCGCATGGTG TTCCACGAGA TCACCAAGAA CGCCATCAAC GCCTCCCTGA ACAACACGCG CGATGTGGAT GGCAATATGG TCGACGCACA GGAGACCCGC CGCATCCTCG ACCGTCTGTA CGGCTATGAG CTGTCGCCGG TGCTGTGGCG CAAGGTGGGC CCAGGCCTGT CCGCCGGCCG CGTGCAGTCC GTGGCCACCC GCCTGATCGT GGAACGCGAG CGCGAACGCA TGGCGTTCAA GCGCGCACCG TACTGGGATA TCGTCGCCAC GCTGTCCGCG CCTGACGCGC TGGGGGAGCG TGCCGAATTC TCCGCCCGCA TGATCGCGCT CGGCGGCAAG CGACTCGCCG GCTCCAAGGA TTTCGGTGCC GACGGCCAGC TGACCCCGGA CGGGTTCGCG GCGAACGTTC GCCAGCTTGA TGAGGCTGGT GCCACTGCCG TGGCTGAAGC GCTGAGGACC GCCGATTTCA CTGTGATGTC GATGGAAACC AAGCCGTACC GCCGCCGCCC GCAGCCGCCG TTCACCACCT CGACCTTGCA GCAGACCGCC GGCAATCGCC TCGGCATGGG TGCTCGCGTC GTGATGCGCG CGGCCCAAAG CCTGTACGAG AACGGCTACA TCACCTATAT GCGAACCGAT TCGGTGACGC TGTCGCAGGA GGCCATCACC GCCGCCCGCA ACGCCGTCTC CTACCATTTC GGAGACAAGT TCCTCTCCGC CGAGCCCAAG CAGTACGCCA CCAAAACCGC TGGCGCGCAG GAAGCCCACG AGTGTATTCG CCCGGCGGGC TCGCGCTTCC ACGACCCGGA CGAGCTGGCT TCCAAACTTC CCGGCGACCA GCTGCGCCTA TACACGCTGA TCTGGCAGCG CACGCTTGCC TGCCAGATGG CTGACGCCAC CGGTTCCACC GCCACCGTGC GCCTGTCTGC GCCGGCCGGC CCCACCGAGG GCGAGGCCGT GTTCCAGGCG TCCGGCACCG TCATCGAATT CCCCGGCTTT ATGAAGGCCA CGGGGGAGGG GCGCAAGCCC AAGGCCGCCG CTCCCGGCGC TGGGGCTGGC TCGGACCAGG CTGCTGCCGC CGGCAAGACG GACGCCAAGG CCGTCAGGGG TGACGCCTCC GAATCCAACA CCTCCCTGCC GCCGATGAGC GTCGGCCAGC AGGTCGAGGC AAGCGACATC GAACCGGACG GCCATGAAAC CCAGCCGCCG GCCCGCTACA CCGAGGCCAC GCTGGTCAAG ACGCTGGAAG CCAAGGAGAT CGGTCGCCCG TCCACTTACG CGAGCATTAT CTCCACAATC ATGGATCGCG GTTACGTATA CGAGCGTGGC CGTGCGCTGA TCCCCAGCTG GCTCGCCTTC TCCGTGACCA AGTTGCTCGA GACGAACTTC CCCAAGCTGG TCGACTACCA GTTCACCGCC GAGATGGAAA ACGGCCTTGA CCGCATCGCC CACGGCGAGG AATCCGGCCG CGACTGGCTC ACCCACTTCT ACTTCGGTTC CGGCGAGGGC GCCGCCCGCA ACGCCGACGA GGCGCACGAA GGCCTGCAAC AGCAGGTCGC GCAGCTCGGC GAGATCGACG CGCGCGCCAT CAACACCATC GACATCGGCG ACGGCCTGCA CGTGCGCGTC GGCCGTTACG GCCCGTATCT GGAAGACATG GAGCATCTGG ACGCCGAAGG CAACCCGAAG CGCGCGTCTC TGCCGGACAC CATCGCGCCG GATGAGCTGA CGGTGGCCGT GGCCCGCGAT TTGATCGACA ACCACTCCGG TGGACCGCGC GAGCTGGGCG TTGATCCGGT ATCCGGCGGT ACCGTCGAGG TGCGCAATGG GCGTTTCGGC CCGTATGTGG CGTTGGTGCC GCCGGCTGAG GCTTCGGCTG GCGCTGCTGG CGATACTGCT GGTGCTTCTG CGGCCAAGAA GGGTTCGAAG AAGGCCGCGG CCGCCGCCGC GTCGCGCCCG AAGATGGCCT CGTTGTTCAA GACGATGAGC CCTGAATCGC TGTCGCTGGA AGACGCGCTA AAGCTGCTGA GCCTGCCGCG CGAAGTGGGC ACATACGAGG AAACCAACGC CGAAACCGGC GAGGTATCCG AATGCACGGT GGCCGCCAAC AACGGTCGCT ACGGCCCCTA CCTGACCAAG ACGGGTGCCG ACGGCAAGTC GGAGACCCGC TCGCTGGCCT CGGAAGACGA GATCTTCACG GTCGATATCG ACAAGGCCAA AGAACTGTTC TCGCAGCCCA AGTACGGCCG CGGGCGTGGC CGTGGTGCCG CCAAGCCGCC GCTGCGTGAT TTGGGCAAGG ACCCGAACAC CGGCAAGAAC GTGACCATCA AGGACGGCCG TTTCGGCGCG TACATCACCG ATGGCGAGAC CAACCGTACG GTGCCGCGTC AGTACACGCC TGAATCCATC ACCCCTGATG ACGCCTTCCG ACTGCTCGCC GAAAAGCGTG CGGCTGGTCC CTCCACCCGT GGCCGTCGCG GTGCTGGGCG TGCTGGTGGC GCCAAGGCCG TTGCCGGCAA GGGCAAGAAG GGCGGTACCT CGGCTGCGGT TTCGGCGCAG GAGGCCAAGC GCGCCGAACG CCGTGCCGAA GTCAAGAAGT TGGCGAACAA GGGCTGGTCC AACCAGCGCA TCGCCGAAAA ACTCGGTTCC ACCCCCGCCA CTGTGAAAAA GGACGTCGAC TGGCTGACCG CCAACGAGGG CTACGAGCGC CCCGCCGTAA TTCCCAAGCG TGGCTGA
|
Protein sequence | MATKLVIVES PTKAHKIGDY LGKGYTVMAS VGHIRDLAQP SQVPAADKAK FGKFGVDVND GFKPYYIVDG DKKRTVSELK SALKNADELY LATDEDREGE AIAWHLVQTL KPEVPVKRMV FHEITKNAIN ASLNNTRDVD GNMVDAQETR RILDRLYGYE LSPVLWRKVG PGLSAGRVQS VATRLIVERE RERMAFKRAP YWDIVATLSA PDALGERAEF SARMIALGGK RLAGSKDFGA DGQLTPDGFA ANVRQLDEAG ATAVAEALRT ADFTVMSMET KPYRRRPQPP FTTSTLQQTA GNRLGMGARV VMRAAQSLYE NGYITYMRTD SVTLSQEAIT AARNAVSYHF GDKFLSAEPK QYATKTAGAQ EAHECIRPAG SRFHDPDELA SKLPGDQLRL YTLIWQRTLA CQMADATGST ATVRLSAPAG PTEGEAVFQA SGTVIEFPGF MKATGEGRKP KAAAPGAGAG SDQAAAAGKT DAKAVRGDAS ESNTSLPPMS VGQQVEASDI EPDGHETQPP ARYTEATLVK TLEAKEIGRP STYASIISTI MDRGYVYERG RALIPSWLAF SVTKLLETNF PKLVDYQFTA EMENGLDRIA HGEESGRDWL THFYFGSGEG AARNADEAHE GLQQQVAQLG EIDARAINTI DIGDGLHVRV GRYGPYLEDM EHLDAEGNPK RASLPDTIAP DELTVAVARD LIDNHSGGPR ELGVDPVSGG TVEVRNGRFG PYVALVPPAE ASAGAAGDTA GASAAKKGSK KAAAAAASRP KMASLFKTMS PESLSLEDAL KLLSLPREVG TYEETNAETG EVSECTVAAN NGRYGPYLTK TGADGKSETR SLASEDEIFT VDIDKAKELF SQPKYGRGRG RGAAKPPLRD LGKDPNTGKN VTIKDGRFGA YITDGETNRT VPRQYTPESI TPDDAFRLLA EKRAAGPSTR GRRGAGRAGG AKAVAGKGKK GGTSAAVSAQ EAKRAERRAE VKKLANKGWS NQRIAEKLGS TPATVKKDVD WLTANEGYER PAVIPKRG
|
| |