Gene Francci3_3448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3448 
Symbol 
ID3905688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4102317 
End bp4106384 
Gene Length4068 bp 
Protein Length1355 aa 
Translation table11 
GC content73% 
IMG OID637880771 
ProductATP-dependent helicase HrpA 
Protein accessionYP_482531 
Protein GI86742131 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01967] ATP-dependent helicase HrpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0350076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTCCTC GCGACGCCCA TCGGCTGGGC CGACGGCTCG CCGAGAGCCG GCGCACCCGC 
GAGCCCGCCG CGCGCCAGCG GGCCCTACAG GCCATCGCCG CCGAGGTGGA CCGGGCATCG
CTACGGCTGG AGCGGCGCCG GGCCAGCGTT CCGACGCTCG ACTACCCCGA CATCCTTCCG
GTCACCCAGC GCAAGGACGA GATTCTCGCC GCGATCCGGG ATCATCAGGT GGTCGTCGTC
GCCGGGGAGA CCGGGTCCGG CAAGACCACC CAGCTACCGA AGATCTGCCT GGAGCTCGGC
CGCGGGGTCC GGGCGATGAT CGGGCACACC CAGCCGCGCC GAATCGCCGC GCGTACCGTC
GCCGACCGCA TCGCGGAGGA ACTGCGCACC CCGGCTCCCC AGATGGGCGG CGTCGTCGGC
TACCAGACCC GGTTCACCGA CCAGGTGCAC GAGAACACGC TGGTCAAGCT CATGACGGAC
GGCATCCTGC TGGCCGAGAT CTCCTCCGAT CGCCAGCTGC GCCGCTACGA CACGCTCATC
ATCGACGAGG CGCACGAGCG CAGCCTCAAC ATCGACTTTA TCCTCGGCTA CCTGCGGTCG
CTGCTCCCCC GCCGGCCGGA TCTCAAGATT GTCATCACCT CCGCGACGAT CGAGACGGCC
CGGTTCTCCG CCCACTTCGC GGGTGCCCCG GTCATCGAGG TGTCCGGGCG CACCTATCCG
GTCGAGGTCC GCTACCGCCC GCTCGTCCCG GCGGCCAGCG GCCCAGCCGG CGGACCAGCC
AGCGGACCAG CCAGCGGCCC GGCCGACGCG CGGAGGACCG GCAGGGAGAA CGGCGAGGCC
GAGCGCGACC AGACCCAGGC CATCAGCGAG GCGGTCGACG AGCTGTGCGC GGAGGGCCCG
GGCGACATCC TGGTGTTCCT CAGCGGCGAA CGGGAGATCC GCGACACCGC CGAGGCACTC
ACCCGCGAGC AGCGGCCGAA CACCGAGATC GTGCCGCTCT ACGCCCGCCT GTCCGCCGGC
GAGCAGCACC GGGTGTTCCA GCCGCACACC GGACGGCGGG TGGTCCTGGC CACCAACGTC
GCCGAGACCT CGCTGACCGT GCCGGGGATC CATTATGTGA TCGACCCGGG CACCGCCCGG
ATCTCCCGCT ACAGCCACCG CACCAAGGTG CAGCGGCTGC CCATCGAACC GATCTCGCAG
GCCTCCGCCA ACCAGCGCAA GGGGCGCTGC GGGCGGACCG CCGACGGCAT CTGCATCCGC
CTCTACAGCG AGGAGGACTT CGCCGGCCGG CCGGAGTTCA CCGATCCGGA GATCCTGCGG
ACGAACCTGG CCTCGGTCAT CCTGCGGATG GCCGACCTTG GCCTCGGCGA GATGGCCACG
TTCGGGTTCC TCGATCCACC CGACCCCCGC CAGATCAGCG ACGGCGAACT GCTCCTCGCC
GAGTTGGGGG CGTTCGACGC CACCGCGTCC GATCCCCGTC ACCGGATCAC TCCCCTCGGC
CGGCGGCTGG CGCAGATCCC GGTGGATCCC CGGCTGGCCC GCATGGTGCT CGCCGCCGAC
GAGCAGGGCT GCCTGCGCGA GGTGCTCGTC ATCGCCGCCG CGCTCGCGAT CCAGGATCCC
CGGGAGCGCC CGGTCGAGCA CCAGCAGGCG GCCGACGCCC GCCACGCCCG GTTCGCGGAC
CCGACCTCGG ACTTCCTGGC GTACCTGAAC CTGTGGAACT ACCTGCGCGA CGCCCGCGGC
GAGCTGTCGG CCAACCAGTT CCGGCGGATG TGCCGCACGG AGTTCCTCAA CTACCTGCGC
ATCCGGGAAT GGCAGGACGT GCACGGCCAG CTCGCCGCGG TGGTCCGCGG CCTGGGGCTC
ACCCCCCGGG ACGACAGCTC CGGCGCGGCC GATCCACGAA CCGTGCACCG GGCCCTGCTG
ACCGGGCTGC TGTCGCACAT CGGTCGTTAC GACCCGGAGC GCCGGGAGTA CGCCGGCGCC
CGCGGCGGCC GCTTCGCGCT CTGGCCGGGC TCCGTCCTGG CCCGGCGGTC GAACCGTGCC
GAGCGCACCG GGCCGACAGC CACGTCGGCG AACCCGGCAG CCGCGCTGGC GGACCCGGGG
GACCCGGCGG GCGAGGACGC GCCGAAGCGG CCGTCCGGAC CACCCGCCTG GGTGATGGCC
GCCGAGCTGG TGGAGACCTC CCGGCTGTGG GGCCGCACCG CCGCGCGGAT CGACCCGGAC
TGGATCGAGC CGCTCGCCGC CCACCTGGTG CATCGCACGT ACAGCGAACC GCGCTGGTCG
CGCCGGCAGG GCGCGGTGCT CGCCGACGAG AAGGTGACTC TGTACGGGGT GACGATCGTC
GCCTCCCGGC CGGTCCAGTA CAGCCGGATC GACCCGGTCC TGTGCCGGGA GCTGTTTCTC
CGGCACGCAC TGGTCGAGGG TGACTGGCAG ACCCGCCACA CTTTCTTCCA CGCCAACCGG
GAGCTGCTCG CCGACGTCGA GGAGCTGGAG CACCGGGCCC GGCGGCGCGA CATCGTCGTC
GACGACGAGA CGCTGTTCCA TTTCTACGAC GAGCGCGTCC CCGCGGACAT CGTGTCCGCC
CGGCACTTCG ACGCCTGGTG GCGCAAGGCG CGCCGGACGA CCCCGGACCT GCTCGACTTC
CCGCGCTCGA TGCTCGTCAC CGCGGACGCC ACGGGAATCA CCGAGGCCGA CTACCCCGAC
GTCTGGCAGG CGGGCGACCT CGCGCTGCCG CTGAGCTACC AGTTCGAACC GGGCTCGGCG
GCCGACGGGG TCACCGTGCA CATCCCGCTG GCCGTGCTGA ACCAGGTCGG CGCCGAGGGC
TTTGAATGGC AGGTGCCGGG GCTGCGCGAG GAGCTCGTCA CCGCCCTCAT CCGGGCGCTG
CCGAAGGCGG TGCGGCGCAG CTTCGTCCCG GCGCCGAACT ACGCCAAGGC GGTCCTGGCG
AACATCACGC CACGCCAGGC GCCCCTGCTC ACCGCGGTGG AACACGAGCT GCGCCGGATG
GGCGGGCCGG AGATCCCGCG CGATTCCTGG TCGCTGGCGG GTGTGCCGGA TCACCTGCGG
TTCACCTTCC GGGTGGAGGA CGCCGGCGGG CGGGTACTGG CCGAGGGCAA GGACCTCGAC
GCCATCAAGG AGCGGCTGCG CCCGAGGACC CGGGAGGCCG TCGCTGCCGC GGCGGACGGC
CTCGAGCGGG CCGGCCTGCG GGCGTGGGGG GACCTCGGCA CCCTGCCGAA GGTCGTCGAG
CTGCGCCGCG GCGGCAACGT GGCGGGCGGC CACGTGGTGA AGGCGTTCCC GGCGCTGGTC
GACGAGGGCG GTTCGGTCGC GGTGCGAGCC GCCGACACCG AGGCCGAACA GCGCCAGCTG
ATGTGGGCCG GCACCCGGCG CCTGGTGCTG CTGGGCGTCC CCTCCCCGGT GCGCGGCCTC
AACGCCCGGC TGTCGAACGC CGCGAAGCTG GCCCTGAGCC ACAACCCGCA CCGCGACGCC
GCCGACCTGC TGGACGACTG TGTGCGGGCC GCCGCCGACC GGTTGATCGC CGCGGCCGGC
GGGCCCGCCT GGGACGAAGC GGGCTTCACC GCGCTGCTCG CCGCGGTACG CGCGGGACTG
CCCGAGGCCG CCTTCGAGGT GGTCCGCGAG GTTCAGCAGG TGCTCGGCCT CGCCCACGCG
GTCGATCTGG CGCTGCGCGA GCTGCGTGCC CCGGCGGTGG CGGCGTCGGT GGCCGACGCG
CGTGACCAGC TCATCTCGTT GATCTACCGG GGATTCGTCA CCGACACCGG GGCCGACCGG
CTGGCGGACC TCGTCCGCTA CCTGACCGCG CTGGAACGGC GACTGGAGCG GCTGCCCCGC
GATCCCGGGC GGGACAGGCT CAACACCGCG ACCGTCGGGC GCGTCCAGGA CGCCTATCGG
GAGCTGCTCG CCACCGTCCC CGCCGGCCGG GAGCCGGCGC CCGAGATCCG GCGCCTCCGC
TGGATGATCG AGGAGCTGCG GGTGAGCCTG TTCGCGCAGA GCCTGCGCAC CCCGTACCCG
GTTTCCGAGG AACGCGTCTA CCGGGCGATC GACGCCATCC TCGGTTAA
 
Protein sequence
MSPRDAHRLG RRLAESRRTR EPAARQRALQ AIAAEVDRAS LRLERRRASV PTLDYPDILP 
VTQRKDEILA AIRDHQVVVV AGETGSGKTT QLPKICLELG RGVRAMIGHT QPRRIAARTV
ADRIAEELRT PAPQMGGVVG YQTRFTDQVH ENTLVKLMTD GILLAEISSD RQLRRYDTLI
IDEAHERSLN IDFILGYLRS LLPRRPDLKI VITSATIETA RFSAHFAGAP VIEVSGRTYP
VEVRYRPLVP AASGPAGGPA SGPASGPADA RRTGRENGEA ERDQTQAISE AVDELCAEGP
GDILVFLSGE REIRDTAEAL TREQRPNTEI VPLYARLSAG EQHRVFQPHT GRRVVLATNV
AETSLTVPGI HYVIDPGTAR ISRYSHRTKV QRLPIEPISQ ASANQRKGRC GRTADGICIR
LYSEEDFAGR PEFTDPEILR TNLASVILRM ADLGLGEMAT FGFLDPPDPR QISDGELLLA
ELGAFDATAS DPRHRITPLG RRLAQIPVDP RLARMVLAAD EQGCLREVLV IAAALAIQDP
RERPVEHQQA ADARHARFAD PTSDFLAYLN LWNYLRDARG ELSANQFRRM CRTEFLNYLR
IREWQDVHGQ LAAVVRGLGL TPRDDSSGAA DPRTVHRALL TGLLSHIGRY DPERREYAGA
RGGRFALWPG SVLARRSNRA ERTGPTATSA NPAAALADPG DPAGEDAPKR PSGPPAWVMA
AELVETSRLW GRTAARIDPD WIEPLAAHLV HRTYSEPRWS RRQGAVLADE KVTLYGVTIV
ASRPVQYSRI DPVLCRELFL RHALVEGDWQ TRHTFFHANR ELLADVEELE HRARRRDIVV
DDETLFHFYD ERVPADIVSA RHFDAWWRKA RRTTPDLLDF PRSMLVTADA TGITEADYPD
VWQAGDLALP LSYQFEPGSA ADGVTVHIPL AVLNQVGAEG FEWQVPGLRE ELVTALIRAL
PKAVRRSFVP APNYAKAVLA NITPRQAPLL TAVEHELRRM GGPEIPRDSW SLAGVPDHLR
FTFRVEDAGG RVLAEGKDLD AIKERLRPRT REAVAAAADG LERAGLRAWG DLGTLPKVVE
LRRGGNVAGG HVVKAFPALV DEGGSVAVRA ADTEAEQRQL MWAGTRRLVL LGVPSPVRGL
NARLSNAAKL ALSHNPHRDA ADLLDDCVRA AADRLIAAAG GPAWDEAGFT ALLAAVRAGL
PEAAFEVVRE VQQVLGLAHA VDLALRELRA PAVAASVADA RDQLISLIYR GFVTDTGADR
LADLVRYLTA LERRLERLPR DPGRDRLNTA TVGRVQDAYR ELLATVPAGR EPAPEIRRLR
WMIEELRVSL FAQSLRTPYP VSEERVYRAI DAILG