Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4305 |
Symbol | |
ID | 3907273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5139309 |
End bp | 5142458 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881632 |
Product | DNA topoisomerase I |
Protein accession | YP_483380 |
Protein GI | 86742980 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCAGCGC GCACGAAGAC GACGACACGG GCGACCGCGC GGTCATCCGC ACCGGTCGCC GAGCCCACCG AGGCCGGGCT CCCGCCGGAG ACCGGCTCGA GCGAGGCATC GGCCGACGTA CCCGCCGGCG CCACCAGCCG GACGGCGAAC GGGCGAACGG AGGCTGGCCA TTCGGCGAAC GGCTCAGGCG GAGCCACCGG CGCCCACGGC AGCCGCGCCA CCGGCTCCGG CCCGGGCAAC CGCCTGGTGA TCGTGGAATC GCCGGCGAAG GCGAAGACGA TCGCCGGCTA TCTCGGTCCG GGGTGGCAGG TCGAATCCAG TATCGGGCAC ATCCGTGACC TCCCGCGCAG TGCCGCCGAT GTCCCGGCCG CGCACAAGGG CAAGCCGTGG GCTCGGCTCG GCGTCGACGT CGACAACGAC TTCGAACCGC TCTACATCGT GACGCCGGAC AAGAAACCGC AGGTCAGCAA GCTCAAGGCG CTCGTGAAAG ACGCCAGCGA GCTCTACCTC GCGACGGACG AGGACCGCGA GGGCGAGGCC ATCGCCTGGC ATCTGCTCCA GACACTGAAG CCGACGGTGC CGGTCAAGCG GATGGTGTTC CACGAGATCA CCCCCCAGGC GATCCAGCGG GCCGTCGACA ATCCGCGCGA TATCGACAAG AACCTGGTCA ACGCTCAGGA AACGCGTCGC ATCCTGGACC GTCTCTACGG CTACGAGGTC TCTCCCGTGC TGTGGAAGAA GGTCATGCCG AGGCTGTCGG CGGGCCGGGT CCAGAGCGTG GCGACCCGCA TCCTGGTGGA GCGGGAACGT GCCCGGATGC GGTTCCACTC GGCGGAGTAC TGGAACATCG AGGGCCTGTT CGGGGCGACC GTCGCCCGCC AGGCGTGGTC GGGCGCGGAC GGCACACCGG GATCCGGGCC GGATGGACCC GGCGCCGGGT CCATCCGCGG CGACGGGGTC GAGAAGACCC CGCTCCCGGC AACCCTGATC GCCCTGAACG GCAACCGGAT CGCGACGGGT CGTGACTTCT CCCCGACGGG GCAGCTCGTC TCCTCCGGGG TGACGCGGCT GGACGAGGCC ACCGCCCGGT CGCTGGCCGA GCGGCTCGCC GACGCCGCAT TCACGGTGCG CTCGGTGGAG ACCAAGCCCT ACCGCCGTTC GCCGTACCCA CCGTTCATGA CCTCGACGTT GCAGCAGGAG GCCGGCCGCA AACTGCGCTT CTCCAGCCAG CGCACGATGC AGGTGGCGCA GCGGCTGTAC GAGAACGGCT ACATCACCTA CATGCGGACG GACTCGACGA ACCTGTCCAA GACCGCCCTG ACCGCGGCCC GCGCGCAGGC GGCGAGCCTG TACGGGCCGG AGTACGTGCC GGCACGCCCC CGCACGTACG CCAAGAAGGT CAAGAACGCG CAGGAGGCCC ACGAGGCCAT CCGACCCGCC GGGGACCACT TCCGGACCCC GGGTGAGGTT CGTGGCGAGC TCGACGTCGA CTCCTACCGG CTCTACGAGC TGATCTGGCA GCGGACGGTG GCCAGCCAGA TGGCGGACGC CCGCGGCACG AGCGCCACGA TCCGGCTCGG CGCGACCTCG TCCGCTGGGG AGGACGCCGA GTTCTCGGCA TCGGGGAAGG TCATCACCTT TCCCGGTTTC CTGCGCGCCT ACGTCGAGGG CGCCGACGAT CCAGACGCGG AACTGGAGGA CCGGGAGCGG CGGCTGCCGG ACGTACGGCA GGGCGACCCA CTGACCACCC GCTCGCTGAC CCCCCGCGGG CACACCACCA GTCCGCCGGC GCGCTTCACC GAGGCCAGCC TGGTCAAGAC GCTCGAGGAG CTCGGGATCG GTCGGCCGTC CACCTACGCC TCGATCATCG GCACCATCCA GGACCGCGGC TACGTGTGGA AGAAGGGCTC GGCCCTGGTG CCGAGCTTCG TCGCGTTCGC CGTCGTCGGC CTGTTGGAGG ACCACTTCAC CCGGCTGGTC GACTACCGGT TCACCGCGAC GATGGAGGAC GACCTCGACG ACATCGCCGC CGGCACGGCC GCCTCGACCG ACTGGCTGAC CCGCTTCTAC TTCGGAACCG GCGACGGCAC CGACCCGGCC GCCGACGGAC TGAAGCACCT GGTCAATGAG CGGCTCGGAG AGATCGACGC CCGCGAGGTG AACTCGATCC CCCTCGGCGA GACCGACGAC GGCACCCTGC TGGTCGTCCG GGTGGGCCGC TACGGCCCCT ACGTCCAGCA CGGGGAGCGC CGTGCCAGCG TGCCGGACGA TCTCGCACCG GACGAACTCA CCGTCGACAA AGCGCTGGGG CTGCTGGCCG CGCCCAGTGG CGACCGGATG CTGGGAATCG ACCCGGCGTC GGGGGCGACC ATCACGGCGA AGGCGGGTCG CTTCGGCCCC TACGTCACCA CCGACACTGA CCCGCCGCGC ACGTCCAGTC TGCTGCGTGG CATGTCGTTG GAGACGCTGA CCCTCGACGA CGCCGTGCGG TTGCTCACCC TTCCGCGCAT TCTGGGTGCG GGGGATGACG GGGAGGAGGT CACCGCCCAG AATGGCCGCT ATGGCCCATA CGTGAAGAAG GGTGCCGAAA GTCGGTCGTT GGAATCTGAG GATCAGTTGT TCACCGTGAC CCTGGACGAG GCCTTGGCGC TGCTCTCCCA GCCCAAGGCC CGCGGTCGGC GCCAGGCCGC GCAGAGCCCG CCGCTGCGCG AGCTCGGGAC CGACCCGGCC AGCGGCAAAC CGATGGTGGT GCGGGAGGGC CGGTTCGGCC CCTACGTCAC CGACGGTGAG ACCAACGCCA GTCTGCGCAA GGGCGACACG GTCGAAACGA TCACCGATGA GCGTGCCGCG GAGCTGCTGG CAGACCGTCG GGCCCGCGGA CCGGCCACCG CGAAGCGTCC TGCCCGGGGC ACCGCGAAAG CCGGCACCGC CAAGACCGGC CCGAAAACCA CCAAGGCCAA GCCGGATACG GCGAAGTCCG GCACCGCGAA GTCCGGCACC GCGAAGACCG GCACCGCGAA GACCGGCACC GCGAAGACCG GCACCGCCAG GTCCAAGACC GCCCGGACGG TGACGGACGA CGGCGGCGGC TCCGATGGCT CCGATGACTC CGGTAGCTCG TCGTCCAGTG GCACCCGCCG GACGGACTGA
|
Protein sequence | MPARTKTTTR ATARSSAPVA EPTEAGLPPE TGSSEASADV PAGATSRTAN GRTEAGHSAN GSGGATGAHG SRATGSGPGN RLVIVESPAK AKTIAGYLGP GWQVESSIGH IRDLPRSAAD VPAAHKGKPW ARLGVDVDND FEPLYIVTPD KKPQVSKLKA LVKDASELYL ATDEDREGEA IAWHLLQTLK PTVPVKRMVF HEITPQAIQR AVDNPRDIDK NLVNAQETRR ILDRLYGYEV SPVLWKKVMP RLSAGRVQSV ATRILVERER ARMRFHSAEY WNIEGLFGAT VARQAWSGAD GTPGSGPDGP GAGSIRGDGV EKTPLPATLI ALNGNRIATG RDFSPTGQLV SSGVTRLDEA TARSLAERLA DAAFTVRSVE TKPYRRSPYP PFMTSTLQQE AGRKLRFSSQ RTMQVAQRLY ENGYITYMRT DSTNLSKTAL TAARAQAASL YGPEYVPARP RTYAKKVKNA QEAHEAIRPA GDHFRTPGEV RGELDVDSYR LYELIWQRTV ASQMADARGT SATIRLGATS SAGEDAEFSA SGKVITFPGF LRAYVEGADD PDAELEDRER RLPDVRQGDP LTTRSLTPRG HTTSPPARFT EASLVKTLEE LGIGRPSTYA SIIGTIQDRG YVWKKGSALV PSFVAFAVVG LLEDHFTRLV DYRFTATMED DLDDIAAGTA ASTDWLTRFY FGTGDGTDPA ADGLKHLVNE RLGEIDAREV NSIPLGETDD GTLLVVRVGR YGPYVQHGER RASVPDDLAP DELTVDKALG LLAAPSGDRM LGIDPASGAT ITAKAGRFGP YVTTDTDPPR TSSLLRGMSL ETLTLDDAVR LLTLPRILGA GDDGEEVTAQ NGRYGPYVKK GAESRSLESE DQLFTVTLDE ALALLSQPKA RGRRQAAQSP PLRELGTDPA SGKPMVVREG RFGPYVTDGE TNASLRKGDT VETITDERAA ELLADRRARG PATAKRPARG TAKAGTAKTG PKTTKAKPDT AKSGTAKSGT AKTGTAKTGT AKTGTARSKT ARTVTDDGGG SDGSDDSGSS SSSGTRRTD
|
| |