Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0309 |
Symbol | |
ID | 5668733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 366844 |
End bp | 369915 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239240 |
Product | DNA topoisomerase I |
Protein accession | YP_001504681 |
Protein GI | 158312173 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.130268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.23063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCACCGC GCACGAATTC GACGAGCCGG ACGACCGCGG GGTCGCCCAC GCCGGTCGCC GAGCCCACCA AGCCCGCGGC AGCGAGCGCC ACGGCGGCCA GCGCCACGGC AGCCAAGGCC GAAGCGGCCG AGACTGCCGA GGCCGGCGCC ACCGCAGGGC GCGCTTCGGG CGCCCGCGCC ACGGGCGGGC GCCGCCCGGC GCGGGCCACC ACGAACGGTA ACGGGACCCG TCTGGTGATC GTGGAGTCGC CGGCCAAGGC GAAGACGATC GCGGGCTACC TGGGCCCGGG GTGGCAGGTG GAGTCGAGCA TCGGCCACAT CCGCGACCTG CCGCGCAGCG CCGCCGACGT GCCGACCGCG CACAAGGGCA AGCCGTGGGC CCGGCTCGGG GTCGACGTCG ACAACGGCTT CGAGCCCCTC TACGTCGTCA GCCCGGATAA GAAGGTCCAG GTCAGCAAGC TCAAGTCGCT CGTCAAGGAC GCCAGCGAGC TCTACCTGGC GACAGACGAG GACCGCGAGG GCGAGGCGAT CGCCTGGCAC CTCCTGCAGA CTCTGAAGCC GACCGTCCCG GTCAAGCGGA TGGTCTTCCA CGAGATCACC CCGCAGGCCA TCCAGCGTGC GGTCGACAGC CCGCGGGAGA TCAACGAGAA CCTGGTCAAC GCCCAGGAGA CCCGGCGGAT CCTGGACCGG CTCTACGGCT ACGAGGTCTC CCCCGTGCTG TGGAAGAAGG TCATGCCCAA GCTCTCGGCA GGGCGGGTCC AGAGCGTGGC GACCCGCATC CTCGTCGAGC GGGAGCGGGC GCGGATGCGG TTCCGCACCG CGGAGTACTG GAACATCGAG GGCGTCTTCC AGCAGAACGT CGCGCACGAC GGCGCCGCCC TCGACACGAC CCCGCTCCCG GCGACCCTCG TCGCGCTGGA CGGGCGGCGC CTGGCCAGCG GCCGCGACTT CGCCCCGACC GGCGAGCTGA CGTCCGACGG GGTCGCACTG CTCGACGAGG CCGGTGCCCG CGCGCTGGCC GGGCGGCTCA CCGGTGCCGC GTTCGCGGTG CGGTCGGTCG AGACCAAGCC GTACCGGCGC TCGCCCTACC CGCCGTTCAT GACCTCCACG CTCCAGCAGG AGGCCGGCCG CAAACTGCGG TTCTCCAGCC AGCGCACGAT GCAGGTCGCG CAGCGCCTCT ACGAGAACGG CTACATCACC TACATGCGGA CGGACTCGAC GAACCTGTCC GAGACCGCCC TGGTCGCCGC CCGGGACCAG GCCCGCACCC TCTACGGCGC CGAGTACGTC CCGGACCGGC CCCGCGTCTA CGCCAAGAAG GTCAAGAACG CCCAGGAGGC CCACGAGGCG ATCCGGCCCG CCGGGGATCA CTTCCGGACC CCGGGTGAGG TGCGCTCGGA GCTCGACGGT GACTCGTTCC GCCTGTACGA GCTGATCTGG CAGCGCACGG TGGCCAGCCA GATGGCCGAC GCGCGCGGCA CCAGCGCCAC CATCCGCCTG GGCGCGACCT CCAGCTCCGG GGAGGACGCA GAGTTCTCCG CCTCCGGCAA GGTGATCACC TTCCCCGGGT TCCTGCGCGC GTACGTCGAG GGCGCCGACG ACCCGGACGC CGAGCTCGAG GACCGCGAGC GGCGGCTGCC CGACGTCCGG CGGGGGGACC CGCTGGCCAC TCGCACGCTC ACCCCGCGTG GCCACACGAC CAGCCCGCCG CCGCGGTTCA CCGAGGCCAG CCTGGTCAAG ACGCTGGAGG AGCTGGGGAT CGGCCGGCCG TCCACCTACG CGTCGATCAT CGGCACGATC CAGGACCGCG GCTACGTGTG GAAGAAGGGG TCCGCGCTGG TCCCGAGCTT CGTCGCGTTC GCGGTGGTCG GGCTGTTGGA GGACCACTTC ACCCGGCTCG TGGACTACCA GTTCACGGCC TCGATGGAGG ACGACCTCGA CGCGATCGCC GCTGGTACGG CCGCGTCCAC GGACTGGCTC ACCGGGTTCT ACTTCGGTCT GCCCGACACG ACCGACACCG GCGGTTCCGG CGCGGTCGAG GGCCTCAAGC ACCTGGTCGG CGAGCGGCTC GGGGAGATCG ACGCCCGCGA GGTCAACTCC ATCCCGCTGG GCAAGGCGGA CGACGGCGAG CCTGTCGTCG TGCGGGTCGG CCGTTACGGG CCCTATGTCC AGCACGCCGA CGGGCGTGCC AGCGTCCCGG ACGAGGTCGC TCCCGACGAG CTGACCGTGG AGCGCGCGCT CGAACTGCTG GCCGCGCCCA GCGGCGACCG TCTGCTCGGC ACGGACCCGA AGACGGGTGC GTCGATCACC GCGAAGGCCG GCCGCTACGG CCCGTACGTG ACGACGGACA GCGAGCCGCC GCAGACCGCG AGCCTGCTGC GCACCATGTC GTTGGAAACC GTGACCCTCG AGGACGCGCT GCGGCTGCTG ACGCTCCCCC GCGTCCTCGG CACCGACGCG GAAGGCGCGG AGGTCACCGC CCAGAACGGG CGGTACGGCC CCTATGTGAA GAGGGGCGCC GACAGCCGTT CGCTGGAGTC CGAGGACCAG TTGTTCACGG TGACGCTGGA CGAGGCGCTC GCGCTGCTCG CGCAGCCGAA GGCCCGCGGT CGGCGCCAAG CGGCGCAGAC GCCGCCGCTG CGTGAGCTCG GGCCCGACCC CGCCACCGAG CGCCCGATGG TCCTGCGCGA GGGCCGGTTC GGCCCGTACG TGACCGACGG CGAGACCAAC GCCAGCCTGC GCAAGGGCGA CGCGGTCGAG ACCATCACGG TCGAGCGTGC CGCCGAGCTC CTCGCGGACC GCCGAGCCCG CGGCACGACC ACGCCGCGCC GGACCACGAA GACCACGGCC AAGGCGCCCG CGAAGGCCAC AGCCAAGCCC CGGACGGCCG CGAAGACCAC CACGAAGGCC AAGACCGCGG GCAAGACGTC GGGCGGCACG GCGAAGTCCG GCTCCCGCGC GTCGAAGTCC GCCGCCAGCG ACGCCGGGGC GACCGGCACC GCCGCGGGCG ACGCGTCCGG CACCGACAGC GCCACCGGAG CAACGTCCGG TGGCTCGCAG CGGTCGAGCT GA
|
Protein sequence | MPPRTNSTSR TTAGSPTPVA EPTKPAAASA TAASATAAKA EAAETAEAGA TAGRASGARA TGGRRPARAT TNGNGTRLVI VESPAKAKTI AGYLGPGWQV ESSIGHIRDL PRSAADVPTA HKGKPWARLG VDVDNGFEPL YVVSPDKKVQ VSKLKSLVKD ASELYLATDE DREGEAIAWH LLQTLKPTVP VKRMVFHEIT PQAIQRAVDS PREINENLVN AQETRRILDR LYGYEVSPVL WKKVMPKLSA GRVQSVATRI LVERERARMR FRTAEYWNIE GVFQQNVAHD GAALDTTPLP ATLVALDGRR LASGRDFAPT GELTSDGVAL LDEAGARALA GRLTGAAFAV RSVETKPYRR SPYPPFMTST LQQEAGRKLR FSSQRTMQVA QRLYENGYIT YMRTDSTNLS ETALVAARDQ ARTLYGAEYV PDRPRVYAKK VKNAQEAHEA IRPAGDHFRT PGEVRSELDG DSFRLYELIW QRTVASQMAD ARGTSATIRL GATSSSGEDA EFSASGKVIT FPGFLRAYVE GADDPDAELE DRERRLPDVR RGDPLATRTL TPRGHTTSPP PRFTEASLVK TLEELGIGRP STYASIIGTI QDRGYVWKKG SALVPSFVAF AVVGLLEDHF TRLVDYQFTA SMEDDLDAIA AGTAASTDWL TGFYFGLPDT TDTGGSGAVE GLKHLVGERL GEIDAREVNS IPLGKADDGE PVVVRVGRYG PYVQHADGRA SVPDEVAPDE LTVERALELL AAPSGDRLLG TDPKTGASIT AKAGRYGPYV TTDSEPPQTA SLLRTMSLET VTLEDALRLL TLPRVLGTDA EGAEVTAQNG RYGPYVKRGA DSRSLESEDQ LFTVTLDEAL ALLAQPKARG RRQAAQTPPL RELGPDPATE RPMVLREGRF GPYVTDGETN ASLRKGDAVE TITVERAAEL LADRRARGTT TPRRTTKTTA KAPAKATAKP RTAAKTTTKA KTAGKTSGGT AKSGSRASKS AASDAGATGT AAGDASGTDS ATGATSGGSQ RSS
|
| |