Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4733 |
Symbol | topA |
ID | 5153124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4959842 |
End bp | 4962592 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640559534 |
Product | DNA topoisomerase I |
Protein accession | YP_001240665 |
Protein GI | 148256080 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.767724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCG TCATCGTGGA GTCGCCTGCA AAGGCCAAGA CGATCAATAA ATATCTGGGC TCCTCCTACG AGGTTCTGGC CTCTTTTGGT CATGTCCGCG ACCTTCCGGC CAAGAACGGT TCGGTCGATC CGGACGAGAA TTTCCGGATG ATCTGGGAGG TCGACCCCAA GGCTGCCGGC CGACTCAACG ACATCGCCAA ATCGCTCAAG AATGCAGACC GACTGATTCT CGCCACTGAC CCCGATCGTG AGGGAGAGGC GATCTCCTGG CACGTGCTCG AAGTGTTGAA GGAGAAGCGC GCGCTCAAGG ATCAGAAGAT CGAGCGCGTG GTCTTCAACG CCATCACCAA GCAGGCCGTG TCGGAGGCGA TGAAGCATCC GCGCCAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG CGCCGCGCCC TCGATTACCT CGTCGGCTTC ACGCTCTCCC CCGTGCTGTG GCGCAAGCTG CCGGGCGCGC GCTCGGCCGG GCGCGTGCAG TCAGTGGCAC TGCGCCTTGT GTGCGATCGC GAGCTCGAGA TCGAGAAGTT CGTTCCGCGC GAATATTGGT CGCTGATCGC CACGCTGGCG ACGCCGCGCG GCGACACGTT CGAAGCGCGG CTGGTCGGCG CCGACGGCAA GAAGATCCAG CGGCTCGACA TCGGCACCGG CGCGGAAGCC GAAGACTTCA AGCAGGCGCT GAACGCGGCG AGCTACACGG TCGCCACCGT CGATGCCAAG CCGGCGCGGC GCAATCCGCA GGCCCCGTTC ACCACCTCGA CGCTGCAGCA GGAGGCGAGC CGCAAGCTCG GCTTCGCGCC GGCGCACACG ATGCGCATTG CGCAGCGTCT CTATGAAGGC ATCGACATCG GCGGCGAGAC CACCGGTCTC ATCACCTATA TGCGAACCGA CGGCGTGCAG ATCGACGGCT CGGCGATCAC CCAGGCGCGC AAGGTGATCG GCGAGATCTA CGGCAACAAA TATGTGCCGG ACAGCCCGCG CCAGTACCAG ACCAAGGCCA AGAACGCCCA GGAAGCGCAC GAGGCGATCC GCCCGACCGA TTTGTCGCGC AAGCCCGCGG ACCTGCGCAA GCGGCTCGAT GACGACCAGG CCAAGCTCTA TGAGCTGATC TGGACCCGCA CCATCGCCAG CCAGATGGAG TCGGCCGAGC TCGAGCGCAC CACCGTCGAC ATCATCGCCA AGGCAGGCGG TCGCACGCTG GAACTGCGCG CCACCGGCCA GGTCGTCAAG TTCGACGGCT TCCTCGCGCT CTATCAGGAA GGCCGCGACG ACGAGGAGGA CGAGGACAGC CGCCGACTGC CGCAGATGAG CCCGAACGAG CCACTGAAGC GTCAGAACCT CGCGGTCACT CAGCACTTCA CCGAGCCGCC ACCACGCTTC TCGGAAGCCT CTCTGGTCAA GCGCATGGAG GAACTCGGCA TCGGCCGGCC CTCGACCTAC GCCTCGATCC TCGATGTGCT GAAAGCCCGC GGCTATGTGA AGCTCGAGAA GAAGCGGCTG CATGGCGAGG ACAAGGGTCG CGTCGTGATC GCGTTCCTGG AGAACTTCTT CCGCCGCTAT GTCGAGTACG ATTTCACGGC GGACTTGGAA GAGCAGCTCG ACCGCATCTC CAACAACGAG ATCGCCTGGC AGCAGGTGCT GAAGGATTTC TGGACCGGCT TCATCGGCGC CGTCGACGAC ATCAAGGATC TGCGCGTCGC GCAGGTGCTC GATGCGCTCG ATGAGATGCT GGGGCCGCAC ATCTATCCGC CGCGCGCCGA TGGTGGCGAC GTCCGGCAAT GCCCGACCTG CGGCACCGGA CGGCTCAACC TCAAGGCCGG CAAGTTCGGC GCCTTCGTCG GCTGCTCGAA CTATCCGGAA TGCCGCTACA CCCGCCCGCT CGCCGCCGAC AGCGAGGCCG CGGCCGATCG CATTCTCGGC CAGGACCCGG ATTCGGGCCT CGACGTCGCC GTCAAGGCCG GCCGCTTCGG CCCCTACATC CAGCTCGGCG AGCAGAAGGA CTATGCCGAG GGCGAGAAGC CCAAGCGCGC CGGCATTCCC AAGGGCACGC AGCCCTCGGA CGTCGATCTC GATCTCGCGC TGAAGCTGCT GTCGCTGCCG CGCGAGATCG GCAAGCATCC GGAGACCGGC CTGCCGATCA CCGCGGGCCT CGGCCGCTTC GGGCCGTTCG TGAAGCACGA CAAGACCTAT GCGAGCCTGG AGGCCGGCGA CGAGGTGTTC GATATCGGCC TCAACCGTGC GGTCACCCTG ATCGCCGAAA AGGTCGCGAA GGGCCCGAGC AAGCGCTTCG GCGCCGATCC CGGCAAGGCG CTGGGTGATC ATCCCTCGCT CGGCCCGGTC GCCGTGAAGG CCGGCCGCTA CGGCGCCTAT GTCACCGCTG GCGGCGTCAA CGCCACCATT CCCGGCGACA AGGAGAAGGA TACGATCACG CTCGCCGAGG CGATCGCGCT GCTCGACGAG CGCGCGGCCA AGGGCGGCGG CAAGGCCAAG GGCGCCAAAA AGGCTGCGAA GCCGGCCAAG GCGGCCGCGA AGCCGAAGGC CGGCGGCGAC GACGACTCGC CGAAGCCGGC AAAGAAGGCC GTCGCGAAGA AGGCTGCGAC GAAGCCGAAA TCAGAGTCCA CCAGCAAGGC GCGCGCGCCG GTTGCCAAGG CGGCCAAGAC ATCGGCCAAG CCTGCGACCA AGGCGACGCC CAAGAAGAGC GCCGGCAAGG CCCGCGGATG A
|
Protein sequence | MNIVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDENFRM IWEVDPKAAG RLNDIAKSLK NADRLILATD PDREGEAISW HVLEVLKEKR ALKDQKIERV VFNAITKQAV SEAMKHPRQI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR ELEIEKFVPR EYWSLIATLA TPRGDTFEAR LVGADGKKIQ RLDIGTGAEA EDFKQALNAA SYTVATVDAK PARRNPQAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL ITYMRTDGVQ IDGSAITQAR KVIGEIYGNK YVPDSPRQYQ TKAKNAQEAH EAIRPTDLSR KPADLRKRLD DDQAKLYELI WTRTIASQME SAELERTTVD IIAKAGGRTL ELRATGQVVK FDGFLALYQE GRDDEEDEDS RRLPQMSPNE PLKRQNLAVT QHFTEPPPRF SEASLVKRME ELGIGRPSTY ASILDVLKAR GYVKLEKKRL HGEDKGRVVI AFLENFFRRY VEYDFTADLE EQLDRISNNE IAWQQVLKDF WTGFIGAVDD IKDLRVAQVL DALDEMLGPH IYPPRADGGD VRQCPTCGTG RLNLKAGKFG AFVGCSNYPE CRYTRPLAAD SEAAADRILG QDPDSGLDVA VKAGRFGPYI QLGEQKDYAE GEKPKRAGIP KGTQPSDVDL DLALKLLSLP REIGKHPETG LPITAGLGRF GPFVKHDKTY ASLEAGDEVF DIGLNRAVTL IAEKVAKGPS KRFGADPGKA LGDHPSLGPV AVKAGRYGAY VTAGGVNATI PGDKEKDTIT LAEAIALLDE RAAKGGGKAK GAKKAAKPAK AAAKPKAGGD DDSPKPAKKA VAKKAATKPK SESTSKARAP VAKAAKTSAK PATKATPKKS AGKARG
|
| |