Gene Franean1_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0309 
Symbol 
ID5668733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp366844 
End bp369915 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content73% 
IMG OID641239240 
ProductDNA topoisomerase I 
Protein accessionYP_001504681 
Protein GI158312173 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.23063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACCGC GCACGAATTC GACGAGCCGG ACGACCGCGG GGTCGCCCAC GCCGGTCGCC 
GAGCCCACCA AGCCCGCGGC AGCGAGCGCC ACGGCGGCCA GCGCCACGGC AGCCAAGGCC
GAAGCGGCCG AGACTGCCGA GGCCGGCGCC ACCGCAGGGC GCGCTTCGGG CGCCCGCGCC
ACGGGCGGGC GCCGCCCGGC GCGGGCCACC ACGAACGGTA ACGGGACCCG TCTGGTGATC
GTGGAGTCGC CGGCCAAGGC GAAGACGATC GCGGGCTACC TGGGCCCGGG GTGGCAGGTG
GAGTCGAGCA TCGGCCACAT CCGCGACCTG CCGCGCAGCG CCGCCGACGT GCCGACCGCG
CACAAGGGCA AGCCGTGGGC CCGGCTCGGG GTCGACGTCG ACAACGGCTT CGAGCCCCTC
TACGTCGTCA GCCCGGATAA GAAGGTCCAG GTCAGCAAGC TCAAGTCGCT CGTCAAGGAC
GCCAGCGAGC TCTACCTGGC GACAGACGAG GACCGCGAGG GCGAGGCGAT CGCCTGGCAC
CTCCTGCAGA CTCTGAAGCC GACCGTCCCG GTCAAGCGGA TGGTCTTCCA CGAGATCACC
CCGCAGGCCA TCCAGCGTGC GGTCGACAGC CCGCGGGAGA TCAACGAGAA CCTGGTCAAC
GCCCAGGAGA CCCGGCGGAT CCTGGACCGG CTCTACGGCT ACGAGGTCTC CCCCGTGCTG
TGGAAGAAGG TCATGCCCAA GCTCTCGGCA GGGCGGGTCC AGAGCGTGGC GACCCGCATC
CTCGTCGAGC GGGAGCGGGC GCGGATGCGG TTCCGCACCG CGGAGTACTG GAACATCGAG
GGCGTCTTCC AGCAGAACGT CGCGCACGAC GGCGCCGCCC TCGACACGAC CCCGCTCCCG
GCGACCCTCG TCGCGCTGGA CGGGCGGCGC CTGGCCAGCG GCCGCGACTT CGCCCCGACC
GGCGAGCTGA CGTCCGACGG GGTCGCACTG CTCGACGAGG CCGGTGCCCG CGCGCTGGCC
GGGCGGCTCA CCGGTGCCGC GTTCGCGGTG CGGTCGGTCG AGACCAAGCC GTACCGGCGC
TCGCCCTACC CGCCGTTCAT GACCTCCACG CTCCAGCAGG AGGCCGGCCG CAAACTGCGG
TTCTCCAGCC AGCGCACGAT GCAGGTCGCG CAGCGCCTCT ACGAGAACGG CTACATCACC
TACATGCGGA CGGACTCGAC GAACCTGTCC GAGACCGCCC TGGTCGCCGC CCGGGACCAG
GCCCGCACCC TCTACGGCGC CGAGTACGTC CCGGACCGGC CCCGCGTCTA CGCCAAGAAG
GTCAAGAACG CCCAGGAGGC CCACGAGGCG ATCCGGCCCG CCGGGGATCA CTTCCGGACC
CCGGGTGAGG TGCGCTCGGA GCTCGACGGT GACTCGTTCC GCCTGTACGA GCTGATCTGG
CAGCGCACGG TGGCCAGCCA GATGGCCGAC GCGCGCGGCA CCAGCGCCAC CATCCGCCTG
GGCGCGACCT CCAGCTCCGG GGAGGACGCA GAGTTCTCCG CCTCCGGCAA GGTGATCACC
TTCCCCGGGT TCCTGCGCGC GTACGTCGAG GGCGCCGACG ACCCGGACGC CGAGCTCGAG
GACCGCGAGC GGCGGCTGCC CGACGTCCGG CGGGGGGACC CGCTGGCCAC TCGCACGCTC
ACCCCGCGTG GCCACACGAC CAGCCCGCCG CCGCGGTTCA CCGAGGCCAG CCTGGTCAAG
ACGCTGGAGG AGCTGGGGAT CGGCCGGCCG TCCACCTACG CGTCGATCAT CGGCACGATC
CAGGACCGCG GCTACGTGTG GAAGAAGGGG TCCGCGCTGG TCCCGAGCTT CGTCGCGTTC
GCGGTGGTCG GGCTGTTGGA GGACCACTTC ACCCGGCTCG TGGACTACCA GTTCACGGCC
TCGATGGAGG ACGACCTCGA CGCGATCGCC GCTGGTACGG CCGCGTCCAC GGACTGGCTC
ACCGGGTTCT ACTTCGGTCT GCCCGACACG ACCGACACCG GCGGTTCCGG CGCGGTCGAG
GGCCTCAAGC ACCTGGTCGG CGAGCGGCTC GGGGAGATCG ACGCCCGCGA GGTCAACTCC
ATCCCGCTGG GCAAGGCGGA CGACGGCGAG CCTGTCGTCG TGCGGGTCGG CCGTTACGGG
CCCTATGTCC AGCACGCCGA CGGGCGTGCC AGCGTCCCGG ACGAGGTCGC TCCCGACGAG
CTGACCGTGG AGCGCGCGCT CGAACTGCTG GCCGCGCCCA GCGGCGACCG TCTGCTCGGC
ACGGACCCGA AGACGGGTGC GTCGATCACC GCGAAGGCCG GCCGCTACGG CCCGTACGTG
ACGACGGACA GCGAGCCGCC GCAGACCGCG AGCCTGCTGC GCACCATGTC GTTGGAAACC
GTGACCCTCG AGGACGCGCT GCGGCTGCTG ACGCTCCCCC GCGTCCTCGG CACCGACGCG
GAAGGCGCGG AGGTCACCGC CCAGAACGGG CGGTACGGCC CCTATGTGAA GAGGGGCGCC
GACAGCCGTT CGCTGGAGTC CGAGGACCAG TTGTTCACGG TGACGCTGGA CGAGGCGCTC
GCGCTGCTCG CGCAGCCGAA GGCCCGCGGT CGGCGCCAAG CGGCGCAGAC GCCGCCGCTG
CGTGAGCTCG GGCCCGACCC CGCCACCGAG CGCCCGATGG TCCTGCGCGA GGGCCGGTTC
GGCCCGTACG TGACCGACGG CGAGACCAAC GCCAGCCTGC GCAAGGGCGA CGCGGTCGAG
ACCATCACGG TCGAGCGTGC CGCCGAGCTC CTCGCGGACC GCCGAGCCCG CGGCACGACC
ACGCCGCGCC GGACCACGAA GACCACGGCC AAGGCGCCCG CGAAGGCCAC AGCCAAGCCC
CGGACGGCCG CGAAGACCAC CACGAAGGCC AAGACCGCGG GCAAGACGTC GGGCGGCACG
GCGAAGTCCG GCTCCCGCGC GTCGAAGTCC GCCGCCAGCG ACGCCGGGGC GACCGGCACC
GCCGCGGGCG ACGCGTCCGG CACCGACAGC GCCACCGGAG CAACGTCCGG TGGCTCGCAG
CGGTCGAGCT GA
 
Protein sequence
MPPRTNSTSR TTAGSPTPVA EPTKPAAASA TAASATAAKA EAAETAEAGA TAGRASGARA 
TGGRRPARAT TNGNGTRLVI VESPAKAKTI AGYLGPGWQV ESSIGHIRDL PRSAADVPTA
HKGKPWARLG VDVDNGFEPL YVVSPDKKVQ VSKLKSLVKD ASELYLATDE DREGEAIAWH
LLQTLKPTVP VKRMVFHEIT PQAIQRAVDS PREINENLVN AQETRRILDR LYGYEVSPVL
WKKVMPKLSA GRVQSVATRI LVERERARMR FRTAEYWNIE GVFQQNVAHD GAALDTTPLP
ATLVALDGRR LASGRDFAPT GELTSDGVAL LDEAGARALA GRLTGAAFAV RSVETKPYRR
SPYPPFMTST LQQEAGRKLR FSSQRTMQVA QRLYENGYIT YMRTDSTNLS ETALVAARDQ
ARTLYGAEYV PDRPRVYAKK VKNAQEAHEA IRPAGDHFRT PGEVRSELDG DSFRLYELIW
QRTVASQMAD ARGTSATIRL GATSSSGEDA EFSASGKVIT FPGFLRAYVE GADDPDAELE
DRERRLPDVR RGDPLATRTL TPRGHTTSPP PRFTEASLVK TLEELGIGRP STYASIIGTI
QDRGYVWKKG SALVPSFVAF AVVGLLEDHF TRLVDYQFTA SMEDDLDAIA AGTAASTDWL
TGFYFGLPDT TDTGGSGAVE GLKHLVGERL GEIDAREVNS IPLGKADDGE PVVVRVGRYG
PYVQHADGRA SVPDEVAPDE LTVERALELL AAPSGDRLLG TDPKTGASIT AKAGRYGPYV
TTDSEPPQTA SLLRTMSLET VTLEDALRLL TLPRVLGTDA EGAEVTAQNG RYGPYVKRGA
DSRSLESEDQ LFTVTLDEAL ALLAQPKARG RRQAAQTPPL RELGPDPATE RPMVLREGRF
GPYVTDGETN ASLRKGDAVE TITVERAAEL LADRRARGTT TPRRTTKTTA KAPAKATAKP
RTAAKTTTKA KTAGKTSGGT AKSGSRASKS AASDAGATGT AAGDASGTDS ATGATSGGSQ
RSS