Gene Francci3_4305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4305 
Symbol 
ID3907273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5139309 
End bp5142458 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content70% 
IMG OID637881632 
ProductDNA topoisomerase I 
Protein accessionYP_483380 
Protein GI86742980 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGCGC GCACGAAGAC GACGACACGG GCGACCGCGC GGTCATCCGC ACCGGTCGCC 
GAGCCCACCG AGGCCGGGCT CCCGCCGGAG ACCGGCTCGA GCGAGGCATC GGCCGACGTA
CCCGCCGGCG CCACCAGCCG GACGGCGAAC GGGCGAACGG AGGCTGGCCA TTCGGCGAAC
GGCTCAGGCG GAGCCACCGG CGCCCACGGC AGCCGCGCCA CCGGCTCCGG CCCGGGCAAC
CGCCTGGTGA TCGTGGAATC GCCGGCGAAG GCGAAGACGA TCGCCGGCTA TCTCGGTCCG
GGGTGGCAGG TCGAATCCAG TATCGGGCAC ATCCGTGACC TCCCGCGCAG TGCCGCCGAT
GTCCCGGCCG CGCACAAGGG CAAGCCGTGG GCTCGGCTCG GCGTCGACGT CGACAACGAC
TTCGAACCGC TCTACATCGT GACGCCGGAC AAGAAACCGC AGGTCAGCAA GCTCAAGGCG
CTCGTGAAAG ACGCCAGCGA GCTCTACCTC GCGACGGACG AGGACCGCGA GGGCGAGGCC
ATCGCCTGGC ATCTGCTCCA GACACTGAAG CCGACGGTGC CGGTCAAGCG GATGGTGTTC
CACGAGATCA CCCCCCAGGC GATCCAGCGG GCCGTCGACA ATCCGCGCGA TATCGACAAG
AACCTGGTCA ACGCTCAGGA AACGCGTCGC ATCCTGGACC GTCTCTACGG CTACGAGGTC
TCTCCCGTGC TGTGGAAGAA GGTCATGCCG AGGCTGTCGG CGGGCCGGGT CCAGAGCGTG
GCGACCCGCA TCCTGGTGGA GCGGGAACGT GCCCGGATGC GGTTCCACTC GGCGGAGTAC
TGGAACATCG AGGGCCTGTT CGGGGCGACC GTCGCCCGCC AGGCGTGGTC GGGCGCGGAC
GGCACACCGG GATCCGGGCC GGATGGACCC GGCGCCGGGT CCATCCGCGG CGACGGGGTC
GAGAAGACCC CGCTCCCGGC AACCCTGATC GCCCTGAACG GCAACCGGAT CGCGACGGGT
CGTGACTTCT CCCCGACGGG GCAGCTCGTC TCCTCCGGGG TGACGCGGCT GGACGAGGCC
ACCGCCCGGT CGCTGGCCGA GCGGCTCGCC GACGCCGCAT TCACGGTGCG CTCGGTGGAG
ACCAAGCCCT ACCGCCGTTC GCCGTACCCA CCGTTCATGA CCTCGACGTT GCAGCAGGAG
GCCGGCCGCA AACTGCGCTT CTCCAGCCAG CGCACGATGC AGGTGGCGCA GCGGCTGTAC
GAGAACGGCT ACATCACCTA CATGCGGACG GACTCGACGA ACCTGTCCAA GACCGCCCTG
ACCGCGGCCC GCGCGCAGGC GGCGAGCCTG TACGGGCCGG AGTACGTGCC GGCACGCCCC
CGCACGTACG CCAAGAAGGT CAAGAACGCG CAGGAGGCCC ACGAGGCCAT CCGACCCGCC
GGGGACCACT TCCGGACCCC GGGTGAGGTT CGTGGCGAGC TCGACGTCGA CTCCTACCGG
CTCTACGAGC TGATCTGGCA GCGGACGGTG GCCAGCCAGA TGGCGGACGC CCGCGGCACG
AGCGCCACGA TCCGGCTCGG CGCGACCTCG TCCGCTGGGG AGGACGCCGA GTTCTCGGCA
TCGGGGAAGG TCATCACCTT TCCCGGTTTC CTGCGCGCCT ACGTCGAGGG CGCCGACGAT
CCAGACGCGG AACTGGAGGA CCGGGAGCGG CGGCTGCCGG ACGTACGGCA GGGCGACCCA
CTGACCACCC GCTCGCTGAC CCCCCGCGGG CACACCACCA GTCCGCCGGC GCGCTTCACC
GAGGCCAGCC TGGTCAAGAC GCTCGAGGAG CTCGGGATCG GTCGGCCGTC CACCTACGCC
TCGATCATCG GCACCATCCA GGACCGCGGC TACGTGTGGA AGAAGGGCTC GGCCCTGGTG
CCGAGCTTCG TCGCGTTCGC CGTCGTCGGC CTGTTGGAGG ACCACTTCAC CCGGCTGGTC
GACTACCGGT TCACCGCGAC GATGGAGGAC GACCTCGACG ACATCGCCGC CGGCACGGCC
GCCTCGACCG ACTGGCTGAC CCGCTTCTAC TTCGGAACCG GCGACGGCAC CGACCCGGCC
GCCGACGGAC TGAAGCACCT GGTCAATGAG CGGCTCGGAG AGATCGACGC CCGCGAGGTG
AACTCGATCC CCCTCGGCGA GACCGACGAC GGCACCCTGC TGGTCGTCCG GGTGGGCCGC
TACGGCCCCT ACGTCCAGCA CGGGGAGCGC CGTGCCAGCG TGCCGGACGA TCTCGCACCG
GACGAACTCA CCGTCGACAA AGCGCTGGGG CTGCTGGCCG CGCCCAGTGG CGACCGGATG
CTGGGAATCG ACCCGGCGTC GGGGGCGACC ATCACGGCGA AGGCGGGTCG CTTCGGCCCC
TACGTCACCA CCGACACTGA CCCGCCGCGC ACGTCCAGTC TGCTGCGTGG CATGTCGTTG
GAGACGCTGA CCCTCGACGA CGCCGTGCGG TTGCTCACCC TTCCGCGCAT TCTGGGTGCG
GGGGATGACG GGGAGGAGGT CACCGCCCAG AATGGCCGCT ATGGCCCATA CGTGAAGAAG
GGTGCCGAAA GTCGGTCGTT GGAATCTGAG GATCAGTTGT TCACCGTGAC CCTGGACGAG
GCCTTGGCGC TGCTCTCCCA GCCCAAGGCC CGCGGTCGGC GCCAGGCCGC GCAGAGCCCG
CCGCTGCGCG AGCTCGGGAC CGACCCGGCC AGCGGCAAAC CGATGGTGGT GCGGGAGGGC
CGGTTCGGCC CCTACGTCAC CGACGGTGAG ACCAACGCCA GTCTGCGCAA GGGCGACACG
GTCGAAACGA TCACCGATGA GCGTGCCGCG GAGCTGCTGG CAGACCGTCG GGCCCGCGGA
CCGGCCACCG CGAAGCGTCC TGCCCGGGGC ACCGCGAAAG CCGGCACCGC CAAGACCGGC
CCGAAAACCA CCAAGGCCAA GCCGGATACG GCGAAGTCCG GCACCGCGAA GTCCGGCACC
GCGAAGACCG GCACCGCGAA GACCGGCACC GCGAAGACCG GCACCGCCAG GTCCAAGACC
GCCCGGACGG TGACGGACGA CGGCGGCGGC TCCGATGGCT CCGATGACTC CGGTAGCTCG
TCGTCCAGTG GCACCCGCCG GACGGACTGA
 
Protein sequence
MPARTKTTTR ATARSSAPVA EPTEAGLPPE TGSSEASADV PAGATSRTAN GRTEAGHSAN 
GSGGATGAHG SRATGSGPGN RLVIVESPAK AKTIAGYLGP GWQVESSIGH IRDLPRSAAD
VPAAHKGKPW ARLGVDVDND FEPLYIVTPD KKPQVSKLKA LVKDASELYL ATDEDREGEA
IAWHLLQTLK PTVPVKRMVF HEITPQAIQR AVDNPRDIDK NLVNAQETRR ILDRLYGYEV
SPVLWKKVMP RLSAGRVQSV ATRILVERER ARMRFHSAEY WNIEGLFGAT VARQAWSGAD
GTPGSGPDGP GAGSIRGDGV EKTPLPATLI ALNGNRIATG RDFSPTGQLV SSGVTRLDEA
TARSLAERLA DAAFTVRSVE TKPYRRSPYP PFMTSTLQQE AGRKLRFSSQ RTMQVAQRLY
ENGYITYMRT DSTNLSKTAL TAARAQAASL YGPEYVPARP RTYAKKVKNA QEAHEAIRPA
GDHFRTPGEV RGELDVDSYR LYELIWQRTV ASQMADARGT SATIRLGATS SAGEDAEFSA
SGKVITFPGF LRAYVEGADD PDAELEDRER RLPDVRQGDP LTTRSLTPRG HTTSPPARFT
EASLVKTLEE LGIGRPSTYA SIIGTIQDRG YVWKKGSALV PSFVAFAVVG LLEDHFTRLV
DYRFTATMED DLDDIAAGTA ASTDWLTRFY FGTGDGTDPA ADGLKHLVNE RLGEIDAREV
NSIPLGETDD GTLLVVRVGR YGPYVQHGER RASVPDDLAP DELTVDKALG LLAAPSGDRM
LGIDPASGAT ITAKAGRFGP YVTTDTDPPR TSSLLRGMSL ETLTLDDAVR LLTLPRILGA
GDDGEEVTAQ NGRYGPYVKK GAESRSLESE DQLFTVTLDE ALALLSQPKA RGRRQAAQSP
PLRELGTDPA SGKPMVVREG RFGPYVTDGE TNASLRKGDT VETITDERAA ELLADRRARG
PATAKRPARG TAKAGTAKTG PKTTKAKPDT AKSGTAKSGT AKTGTAKTGT AKTGTARSKT
ARTVTDDGGG SDGSDDSGSS SSSGTRRTD