Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3417 |
Symbol | |
ID | 6199005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | - |
Start bp | 3877727 |
End bp | 3880612 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641707364 |
Product | DNA topoisomerase I |
Protein accession | YP_001834463 |
Protein GI | 182680317 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0414891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.668216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTCG TCATCGTCGA ATCGCCGGCG AAGGCCAAGA CCATCAATAA ATATCTCGGC AAGGATTACG AAGTTTTTGC CTCGTTCGGT CATGTCCGCG ATTTGCCGCC CAAGGATGGT TCGGTCGATC CCGACCACGA CTTCGCCATG CTTTGGGATG TCGACACCAA ATCCGCCAAA CGGCTCGCCG ACATCGCCAA GGCGGTGAAG GAGGCCGACC GGGTGATTCT CGCCACCGAC CCTGACCGTG AGGGCGAGGC GATTTCCTGG CATGTGCTGG AAGTGCTCAA GGCCAAAAAG GTCCTGAAGG ATAAGCCGGT CGAGCGTGTG GTGTTCAATG CGATCACCCA ATCGGCGATT CTCGACGCCA TGCGGCATCC GCGCGCTATC GACATTGATC TCGTCGATGC CTATCTCGCG CGCCGGGCGC TTGATTATCT CGTCGGCTTC AATCTTTCGC CGGTGCTTTG GCGCAAATTG CCGGGGGCGC GTTCGGCTGG CCGCGTGCAA TCGGTCGCCT TGCGTCTCGT CTGCGAGCGC GAATTGGAGA TCGAACGCTT CGTTCCGCGG GAATATTGGT CGCTCACGGC CTTTCTGCGC ACGCCCGCTG ATCAGCCCTT TTCCGCGAAA CTCGTCGGCG CGGACGGCAA AAAGATCAAC CGGCTCGATA TTGGCGCGGG CGCCGAAGCC GAAGCCTTCA AGGCGGCGCT CGAAACGGCC AAATTCACGG TGGCCAAGGT CGAGGCCAAG CCCGCCAGAC GCAATCCCGC GCCTCCTTTC ACTACATCGA CCTTGCAACA GGAGGCGGCG CGCAAATTGG GGCTTGCACC AGCGCGCACC ATGCAATTGG CCCAGCGCCT TTACGAAGGC ATCGATCTCG ATGGCGAGAC GGTGGGCCTC ATCACTTATA TGCGAACCGA TGGCGTTGAT CTGGCACCGG AAGCGATCAC TGGCGCGCGG AAAGTGATTG CCGCCGAATA TGGCGATAAA TATGTGCCGC AGGCACCGCG CCGCTATCAG GTCAAGGCCA AGAATGCCCA GGAGGCGCAT GAGGCGATCC GACCGACCGA TCTTGCCCGC CTGCCGAAAC ATGTCGCTCG TTTTCTCGAT GCGGAGCAGG CGCGGCTTTA TGATTTGATC TGGACCCGCA CCATTGCGAG CCAAATGGAA TCGGCCGAAC TGGAGCGGAC CACGGTGGAT ATTCTGGCCG AGGTTGGTGC GCGCCGGCTC GATCTTCGCG CCACTGGGCA GGTGGTGCGT TTCGACGGAT TCCTGAAACT CTATCAGGAA GGCCGCGACG ACGAAGAGGA TGAAGAGGGT GGCCGCCTAC CGGCCATGCA GGTCGGCGAC CCCTTGAAAA AGGACAGGAT CGAGGCGAGC CAGCATTTCA CCGAGCCGCC ACCGCGCTTT ACCGAGGCGA CGCTCGTCAA GCGCATGGAG GAACTTGGCA TAGGCCGGCC CTCGACCTAT GCCTCGACGC TCGCCGTCCT GAAAGATCGC GAATATGTGC GGATCGACAA GAAACGGCTG ATTCCCGAGG ACAAGGGACG GCTCGTTACG GCTTTTCTCG AAAGCTTCTT TGGCCGCTAT GTCGGCTATG ATTTCACCGC CGATCTGGAA TCAAGCCTCG ACAAGATTTC CAATCACGAG ATCGATTGGA AACAGGTCCT GCGCGATTTC TGGGCCGATT TTTCTGGCGC CATCGCCGAC ACCAAGGATT TGCGCACCAC ACAAGTCCTC GATAGCCTCA ATGAAGTGCT CGGTCCCTAT ATTTTCCCCG ATAAGGGGGA TGGCTCCAAT CCGCGCGCCT GTCCTTCTTG CGCAAATGGC CAATTGTCGC TGAAGCTCGG CAAATTCGGT TCTTTCATCG GCTGTTCCAA TTATCCGGAG TGCAAATTCA CCCGCACTCT TTCGGATACG GGGCCGGAGG GAGGCAACGG CGAAACGGAT CGTCCGGGTG TCAAGGTGCT GGGGGTCGAT GCCGAAACAG GCGAGGAGAT TTCGCTGCGC GATGGGCGCT TTGGTGCCTA TGTGCAGCGC GGCGAAGGCG AAAAGCCCAA ACGCGCCTCC TTGCCCAAGA CGATCGCGCC GGCCGATCTG ACACTCGACA TGGCGCTCGG GCTTCTTTCC CTGCCGCGCG AGGTTGCGCG CCATCCCGAA ACCCATGAGC CGATTCTGGC GGGCATCGGC CGGTTTGGTC CCTATGTCCA GCATGGCAAG ACCTATGCGA ATATCGGCAA GGACGAGGAT ATTCTGACCC TTGGCGCCAA TCGCGCCATC GACCTCATCA TTGCCAAAGA AAGCGGGCTC ACCGGTCGCC GTTTCGGCAA AGGCGAATCC GCGCCTGCCC GTGTTTTGGG TGATCATCCC GAAGGGGGGC AGGTCACGAT CAAGGCCGGG CGCTTTGGTC CCTATGTCAA TTACGGCAAG CTCAACGCGA CCTTGCCGAA AGACGCCGAC CCCACCACAT TGACGCTGGA GGAAGGCTTG GCCTTGCTCG CCGCCAAGGC GAGTGGTCAA GGGGGAGGAA AAGGCGCGGT GCAGGGCCAA CTCCTCGGCG AGCACCCTTC GGGCGGTCCC ATTACCGTGC GGGAAGGCCG TTTTGGGCCT TATGTCAATC ACGGCAAGGT CAATGCGACC TTGAAATCCG GTCTCTCGCC GGAAACTTTG ACGCTCGAAG AGGCCATCCG CCTGATTGAC GAGAAGGCCG GAGCCGCATC CAAAAAGGCC CCTGCGAAGA AGGCGCCCGC AAAGAAAGCC TCTGGCAAGA CGACGACCAC CGAGAAAGCG CCCGCGAAAA AGGCTGCAAG CAAGGCAGCA ACGGCCAAAA CCACCAAGGC CAAAGCGGCG AAATCATCGG AGCCGGATGA AGAACCCCCT TTTTAA
|
Protein sequence | MNVVIVESPA KAKTINKYLG KDYEVFASFG HVRDLPPKDG SVDPDHDFAM LWDVDTKSAK RLADIAKAVK EADRVILATD PDREGEAISW HVLEVLKAKK VLKDKPVERV VFNAITQSAI LDAMRHPRAI DIDLVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCER ELEIERFVPR EYWSLTAFLR TPADQPFSAK LVGADGKKIN RLDIGAGAEA EAFKAALETA KFTVAKVEAK PARRNPAPPF TTSTLQQEAA RKLGLAPART MQLAQRLYEG IDLDGETVGL ITYMRTDGVD LAPEAITGAR KVIAAEYGDK YVPQAPRRYQ VKAKNAQEAH EAIRPTDLAR LPKHVARFLD AEQARLYDLI WTRTIASQME SAELERTTVD ILAEVGARRL DLRATGQVVR FDGFLKLYQE GRDDEEDEEG GRLPAMQVGD PLKKDRIEAS QHFTEPPPRF TEATLVKRME ELGIGRPSTY ASTLAVLKDR EYVRIDKKRL IPEDKGRLVT AFLESFFGRY VGYDFTADLE SSLDKISNHE IDWKQVLRDF WADFSGAIAD TKDLRTTQVL DSLNEVLGPY IFPDKGDGSN PRACPSCANG QLSLKLGKFG SFIGCSNYPE CKFTRTLSDT GPEGGNGETD RPGVKVLGVD AETGEEISLR DGRFGAYVQR GEGEKPKRAS LPKTIAPADL TLDMALGLLS LPREVARHPE THEPILAGIG RFGPYVQHGK TYANIGKDED ILTLGANRAI DLIIAKESGL TGRRFGKGES APARVLGDHP EGGQVTIKAG RFGPYVNYGK LNATLPKDAD PTTLTLEEGL ALLAAKASGQ GGGKGAVQGQ LLGEHPSGGP ITVREGRFGP YVNHGKVNAT LKSGLSPETL TLEEAIRLID EKAGAASKKA PAKKAPAKKA SGKTTTTEKA PAKKAASKAA TAKTTKAKAA KSSEPDEEPP F
|
| |