Gene Franean1_5422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5422 
Symbol 
ID5673753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6553194 
End bp6559868 
Gene Length6675 bp 
Protein Length2224 aa 
Translation table11 
GC content75% 
IMG OID641244277 
Producthelicase superfamily protein 
Protein accessionYP_001509683 
Protein GI158317175 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.383128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CGCTGCTGCC GTCGTTGCAG GCGGCTGACC TTCGCCGTGC CCTGACGGAC 
TACCTGGCGA CCACCTTCGC GTTGACCGAC GACGACGTCC GCGCCGCGCT GACGACGTTC
CTCGGTGACC CGGAGGAGGG CATTTTCCGC GGCCCGTATG CCCGGCTGCG GCTGCCGTTC
CGGCCGGCGG ACGGGACCTG GATGGTTCCG CTGGACTGGT GGCCGAAGGA CTTCGTCCCG
TACGTGCATC AGGCGCAGGC GTTCGAGCGG TTGTCGTCGA AGCACGGCCG GCCGCGGCCG
ACGATCGTCA CGACGGGTAC CGGGTCGGGC AAGACGGAGG CGTTCCTCGT CCCGATCATC
GATCATGCGC TGCGGATGCG GCAGCAGGGC CAGCGCGGGA TGAAAGCGCT GATCCTCTAT
CCGATGAACG CGCTGGCGAA CGACCAGGCC GGCCGGCTCG CCGAGATGCT GACCGGTGAC
CCGCGTCTGG GCGGCCTGAC CGCCGGCATC TACACCGGTG AGCAGCAGGG CCAGCGCACG
CAGGTCAGCC AGGCGGGTCT GATCACCTCC CGGGAGGTGA TGCGCTCCGA TCCGCCGGAC
ATCCTGCTGA CGAACTACAA GATGCTCGAC CACCTGCTGC TGCGCGCGAA CGACCGCGGC
CTGTGGGAGG GCGCGCAGGC GTCTCTCACC TATGTGTGCC TGGACGAGTT CCACACCTAC
GACGGCGCGC AGGGCACCGA TGTGGCGATG CTGCTGCGCC GGCTCGGGTC GATGCTGGGC
GTGGCACGCC CCGGCGCTCC GCTGGGTGAC GTGGTGCCGG TGGCCACCTC CGCGACCCTC
GGCGCTGGCA GCGGCGCAAG CAGTGGCACC GAGGACACGA CCAACGCAGC CGAGGCCACA
GCGGAGGACG AGGACGAGAG CGAGGAAGGC GGCGCGGGCG AGCCGTTCGC CGCGATGCGG
GCGTTCGCCG AGACCGTTTT CGGGCGCCCG TTCGAGGCGG ACGCGGTGGT CGTCGAGCAG
CGGACCGCGC CGGACGACTG GCGCCGCGAG CACAGCCGTC CGACACCTGG TGTGGTGATC
CGGCGGCTGC CGGCCACCCG GTTGCTCGCC GAGACCGTCC GCGAGGCCGT CCCGGAGGGC
CCAGAGCAGG TGGCGGACGT GCTGCTCGGC GCGCTGGTAG CCCGGGACGT CGACGGGTCG
GAGTTCTCCG CCGCCGAGAT CCGCCGGTTG CGCCGCGGTG ACCGGCTTCC CGCCGTCCTG
GCCGCGCATC CGCTCACCGA GGGCCTGCTC ACCGCGGCGC TGCACCCGAA ATCGGTACGG
GATCTCGCGA AGAAGGTGCT GCCCGACGAC ACCAGCCCGG GGCGCGCGGA CGGGATGGCT
GTCGTCGAGG CCTACCTCGG CCTGCTCTCC CAGGCGCGGG CGCGGGCGAA CGCCGAGCCG
GCGGCGGGAC GCCGGCTGCT GGGGGTGGAC ACGCAGCTGT GGGTCCGGGA GGTGACTCGC
ATCGACCGTC GGGTGTCGCC CACACCGGCG TTCCGCTGGT CGGATGATGG GGCGCACGCC
GACCTCGACC TGTACCTGCC GGCGCTGTAC TGCCGCCACT GCGGCCGGGC CGGGTGGGGG
GCGCTGCGCA CCCTGGTGGG GCGGGCGATC GACGCCACGG AGACGACCGT CCGGCGGGCG
TCTCGCGCCG GTGACCAGCG CTTCCGTGCA CTGCTGCACG CCCCGGGCGA GGCCGCCGCC
GTGATCGCCG ACGCCACCGA CGCCCCTGAG ACAACCGAGA CAACCGAGAC GACTGAAGCG
ACGTCCGTCG AGGGTCTGCT GTGGCTGAAC ACGTCGACGC TGATCCTCAC GACGCACCCG
GCGACCGACG GCGAGCCGGA TGAGAGCGCC CACATCCCGG TGCTGACGGC CTGGGAGGAC
GACACCGCGG CGCGGGCGCA GACCTGCCCG TCCTGCCGCC AGCCGGACGG CATCCGCTTC
CTCGGCAGCG GGGTGTCGAC GCTGACGTCG GTGTCGCTGT CGGCGCTGTT CGGCTCGGCG
TATCTGGACA CCGCGGAGAA GAAGACGCTG CTGTTCACCG ACAGTGTGCA GGACGCCGCG
CACACCGCCG GGTTCGTCCA GGCGCGGTCG CATGCGCTGA GCCTGCGGGC GGCGCTGTTC
GACACGATCG ACAGCCGCGG GCCGCTCGCC GTCGCCGACC TGGCCCGGGA GGCGATGGAC
CGGGCCGGTG ACGACCCGGT GCGCCGGTAC CGGCTGGTGC CGCCGGCACT CGCCCAGTGG
GCGGGGTTCC AGCGCTACTG GCAGCCGGCG GGCGGGGCGC ATGAGCGCGC GCAGGCGCAG
CGGCTCGTCG AGCGGCGGCT GGCGTTCGAC GCGGCGCTGG AGTTCGGGCT CAACGCCCGC
ACCGGGCGCA CCCTGGAGCT GACCGGCGCG GCCGTCGCCG AGGTCGACAC CGGCGGTGCG
GACGCGCTGG CGCGGGTGGC GGCCAGGGCG GTGGCGCGGG CCCGCCAGCA GCATTCGCTG
TCGGCGGAGA ACGACGACGC GCTCGGCGCG GCGCTCGACG ACGCCCGGCT GGCGGCGTGG
GCACGCGGGG TGCTCGAACG GGTCCGCGTT CAGGGCGGGA TCTGGCATCG CTGGCTGGAG
TCGTTCGTCG AGCATGACGG CAACCGCTGG TTCCTCACCG GGGGCCGGCG GGCGGCACGC
GGCGAGGGGC TGCCTGCGTT CCCGCCGGGC CGGCCCGCGC CGACGTTCCC GACGACGGGA
CGGACGTCGA CACCGGCGGC GACATTCGAC AGCATCACCG GGGCGTCGTC CTGGTATGCC
CGCTGGGCGC AGCGGCAGTT GCGGCTCACG GCACGCGACG GCGGGCTGGT GACCCGGGCG
CTGCTCGACG AGATGGCGGA CGCCGGCTGG CTGTCGCGGC GGCGCACCGA GACGAACGCG
ACGGTGTTCC TGCTCGACCC GGCGCAGGTC GTGGTGAGCG TGGCCGACCC CGAGCTGCTC
GCGGGCGGGG TGCTCGCGCT GCGCTGCGAC GTGTGCCAGA CGCTCACCCC GGGCACCCGG
CGCACCGTCG ACGACCTGGC GGACGCCGCC TGCCTGCGCC AGGCGTGCCC GGGGCGGCTG
CGCCGGGCCC GGCGCGGCGA CGACTACTAC CGGCATCTGT ACCGCAGTGT CGACATGCGC
CGGGTGGTGG CGCACGAGCA CACCTCGCTG CTGCCGGACG ACGTCCGCCT CGACGTGGAG
CGGCAGTTCC GCGAGGGGGG CGGGCCGGCC GTCCCGAACG TGCTGGCGGC GACGCCGACG
CTGGAGCTGG GCATCGACAT CGGTGACCTG TCGACGGTGA TGCTCGGGTC GCTGCCGCGG
ACCGTCGCCT CCTATGTGCA GCGGGTCGGG CGGGCCGGGC GGCTGACGGG GAACGCGCTG
GTCCTGGCCT ATGTGAAGGG CCGGGGGCAC GACACGCAGC GCCTCGCCGA CCCGCTGGCG
CTCATCGACG GGGACGTCCG GCCGCCGGCG ACCTACCTGG ACGCGGTGGA GATCCTGGAA
CGCCAGTACG TGGCCTGGCT GGTGGACCGG CGTTCCCGCG GCGGCGCGGC GGACCCGGGG
ACCGCCGCCA CGATGTTCCC CGAGGCCGGC TGGGCGCCGG GCACCTGGAT GGGCGACCTG
CTCGCCGACG CCCACGCGAA CGCGGCGGCC TACGCCGAGG AGTTCCTGGC CCTGTTCGGC
GCCGCGGTGC GCCCCGACAC CGCCGACCAT CTGCGTACCT GGGCCGGCGT AGGTCTGGCG
CCGGAGACGA GCGGGGCGCA GGAGGCGAGC GGGGAGCCGG AGGCGGTCTC CGGGCTGGCG
AGGCTGCTGC GGCGGGCGGC CGTCGAGTAC CGGCGGGAGG CCGACGAGCT GGGGCACCGG
CGCAACGCCC TGCAGGACAG CATGCCCGAG CTGGCCGTCG CCGCGAACCG CCCGACCGCC
ACCGCCGAGG ACGTCACCGC CCACCGGCTG GCCCGTGCCG AGCTGCGGTT CCTGCGCGAC
CGGCGCAGCG CGCTGGCCAC CCAGTTCTGG GTCAGCGCGC TGGAGGAACG CGGCGTGCTG
CCGAACTACA CCCTGCTCGA CGACACCGTC ACCCTCGACG TCGCCCTGCA GTGGATGGAC
CAGGACACGC AGGAGTACCA CGAGGAGCCG CGCACCTACC GGCGGGGCAG CGCGCTCGCG
CTCACCGAGT GGGCGCCGGG CGCCACCTTC TACGCCCAGG CGACGGCGGT GCGCATCGAC
GCCGTCAGCA TCGGCACCGA CCTCGACGGC CTGCTCCAGA CCTGGCGGGT CTGCCCGTCC
TGCGGCTGGT CGCGGGTGCA TCGGCGCTCC CTGGGCGACG AGCCGGCCGT CCCGGCGACC
TGCCCGCGCT GCGGGGACGC CGGGATCACC GACACCGGGC AGGCGCTGCC GGTGCTGCCG
CTGGAGAAGG TCTCCGCGCA GGTGTCCCGG GACGCGGCGA TCATCTCCGA CTCGCACGAC
GACCGCGAGC GGCTGCCGTT CACCGTCGTC GCCGCCGCGG ACGTCGATCC GGAGCAGATC
AGCGACGCCT GGCACCTCGG CGGCTACCCG TTCGGCGCCG AGTACGTCCG CCAGCTCGAG
GTGCGCTGGG TCAACGTGGG GCGCGCCGTC GAGCAGGCGC CGGAACGGAT GCTGGCCGGT
ACCGCGGTGC GGGCGCCGCT GTTCCTGATC TGCCCCGGCT GCGGGATCGT GCCGGCGGCG
CAGCTCGGGG TGCGCGACCC GCAGCAGGCC CGGCACCGCG CCTGGTGCCC GCACCGCACC
CGCATCGACG TCCCGTGGAC GGAGCTGGCG CTCGGGCGGT CGCTGCGCAC GCAGGGGGTG
CGGATCCTGC TGCCGCCGCA GTTCACGCTG GACCAGTTCG CCGGGCCGTC GTTCCGGGCG
GCGCTGCTAC TCGGCCTGCG TGAGCTGCTC GGCGGCGCGC CGGATCATCT CGACGTCCTG
GAGGTGCGGA TGCCGGTCGG CGGCAGTGAG CGGACGGCGC TGCTGCTGCA CGACCGTGTC
CCCGGCGGGA CGGGCTACCT CGCCGATCTG GCCCGGCCGA GCCGGGTCCG GGAGCTGCTC
ACCGCGGCCC GGGGCGTGGT CGCCCGGTGC GGGTGCGCCG AGGACGGGCT GTTGGCCTGC
TCCCGGTGCC TGCTGCCGTT CAGTCCGCCG GCGCTGGTCG AGCGGACGTC CCGGGCGCGG
GCCGAGCAGA TGCTCACCGA GGTGCTCGGC GACGGATGGG AGCCGGTCGG GCTGCGGTCG
GTCGCCGACA TCCCGCTGCC GTCCCTCGAC AGCGCGCTGG AGCTGCGGTT CCGGCGGACG
TTCACCGACG CGCTGCGCGC CCGCGGGGCG AGCGTGCGGG AGATCCCGCA GCTCACCGGG
ACGGAGCTGC GGTTCGGGGT GCCGGGCACC GTCCACCGGC GCTGGACGCT GCGGCCGCAG
GTCCGCGCGG CCGGCTGCCG GCCCGACTTC GAGCTGCTCT GCGAGGACCC GACGGTGCCC
CGCGTGTACG TCTTCACCGA CGGCCGGGCC TGGCACGCCA CCCCCGCCCA CAACCGGATC
GCGGACGACG CGGCGAAACG CGCCGCGCTG CGCGAGGCGG GGCATGTGGT GTGGGCGGTC
ACCGACGACG ACGTCGCGGT CTTCGAGGCG CTCGCCCGCG GTGAGCAGGT GACGGCTGCC
GGCGCCGCCT GGTACGACGA CGCCGTCCGG CGCCGGCTCG TCCAGCATCG CGCCGGGCGG
ATCCCCGCCG GTTCGGTGTC GGACGGGCTG GCCACCGCCG ACGCGGTCAC CCAGCTGCTC
GACTGGGTGA TGGAGCCGCG GCGGGACGCG TGGCGGGCGC TGGCGGACGG GCTGCCGTTC
GCGCTGTTCG ACGCCCGGCC GGCGCCGGTG GCGAGCCGGT CGGCTCTCGC CGGGCTCGCC
GCGGCGGCAC TGGACCCGCA CGCGGTTCCG GACGCCGTCG CGGGCCGGGG CGCGGCCGGT
GACCTGCAGG GCTGGGTCTG GCGGCGCGAC GGGCTGGCGG CGGCCTCGGC CGGCTGGCCG
CGGCCGCCGC ACGACCTCGG CTGCGTCCTC GTTCTCGACG ACCGGGACGA CGGCCTGGCC
ACCGAGGCCG GCGTGCGGGC GTGGCGAAGC TGGCTGGCGC TGTCGAACAT CCTCGGCCAC
GCCACCCGCA CCGCCCTTCC GATCGCCCTG TCGCAGTGCA CGGACGACCG GTCGGCGGGT
GCCGACCGGC CGGGCGCCGA CGCGGCCGGG GCTCGGGCCG ACACGGGTAC CGGGGCCGGG
GCGGACAGGG AGACCACGGC GGTGACGTCG CGGGAGGACG CGCTCTCACC GCCGTGGGCG
GACCTCGTCC TCAACGCGCT CGACGACGCC GAGAAGACGC TGCTGCGCGG GCTGGCGCGG
GCCGGCGTGC CGCTGCCCGA GCAGGGCCAC GAGACAGCCG GCGGCTTCCC GCTCGACCTC
GCCTGGCCGG ACGCGCGCGT CGCCGTCCTC GTCCATCCCG AGGACGTCGA CGGGGCGCTC
CGGGCCGAGC TCACCGGCGG CGGATGGCGG ATCGCCGACC CCGACCCGGC GCGGGTCGCC
GGCCTGCTGG CGACGGCCGG CGGGGGCGGC ACAGACGGCG GCGGCACGGA CATCCCGGTG
GAGGGAACCC GATGA
 
Protein sequence
MTAPLLPSLQ AADLRRALTD YLATTFALTD DDVRAALTTF LGDPEEGIFR GPYARLRLPF 
RPADGTWMVP LDWWPKDFVP YVHQAQAFER LSSKHGRPRP TIVTTGTGSG KTEAFLVPII
DHALRMRQQG QRGMKALILY PMNALANDQA GRLAEMLTGD PRLGGLTAGI YTGEQQGQRT
QVSQAGLITS REVMRSDPPD ILLTNYKMLD HLLLRANDRG LWEGAQASLT YVCLDEFHTY
DGAQGTDVAM LLRRLGSMLG VARPGAPLGD VVPVATSATL GAGSGASSGT EDTTNAAEAT
AEDEDESEEG GAGEPFAAMR AFAETVFGRP FEADAVVVEQ RTAPDDWRRE HSRPTPGVVI
RRLPATRLLA ETVREAVPEG PEQVADVLLG ALVARDVDGS EFSAAEIRRL RRGDRLPAVL
AAHPLTEGLL TAALHPKSVR DLAKKVLPDD TSPGRADGMA VVEAYLGLLS QARARANAEP
AAGRRLLGVD TQLWVREVTR IDRRVSPTPA FRWSDDGAHA DLDLYLPALY CRHCGRAGWG
ALRTLVGRAI DATETTVRRA SRAGDQRFRA LLHAPGEAAA VIADATDAPE TTETTETTEA
TSVEGLLWLN TSTLILTTHP ATDGEPDESA HIPVLTAWED DTAARAQTCP SCRQPDGIRF
LGSGVSTLTS VSLSALFGSA YLDTAEKKTL LFTDSVQDAA HTAGFVQARS HALSLRAALF
DTIDSRGPLA VADLAREAMD RAGDDPVRRY RLVPPALAQW AGFQRYWQPA GGAHERAQAQ
RLVERRLAFD AALEFGLNAR TGRTLELTGA AVAEVDTGGA DALARVAARA VARARQQHSL
SAENDDALGA ALDDARLAAW ARGVLERVRV QGGIWHRWLE SFVEHDGNRW FLTGGRRAAR
GEGLPAFPPG RPAPTFPTTG RTSTPAATFD SITGASSWYA RWAQRQLRLT ARDGGLVTRA
LLDEMADAGW LSRRRTETNA TVFLLDPAQV VVSVADPELL AGGVLALRCD VCQTLTPGTR
RTVDDLADAA CLRQACPGRL RRARRGDDYY RHLYRSVDMR RVVAHEHTSL LPDDVRLDVE
RQFREGGGPA VPNVLAATPT LELGIDIGDL STVMLGSLPR TVASYVQRVG RAGRLTGNAL
VLAYVKGRGH DTQRLADPLA LIDGDVRPPA TYLDAVEILE RQYVAWLVDR RSRGGAADPG
TAATMFPEAG WAPGTWMGDL LADAHANAAA YAEEFLALFG AAVRPDTADH LRTWAGVGLA
PETSGAQEAS GEPEAVSGLA RLLRRAAVEY RREADELGHR RNALQDSMPE LAVAANRPTA
TAEDVTAHRL ARAELRFLRD RRSALATQFW VSALEERGVL PNYTLLDDTV TLDVALQWMD
QDTQEYHEEP RTYRRGSALA LTEWAPGATF YAQATAVRID AVSIGTDLDG LLQTWRVCPS
CGWSRVHRRS LGDEPAVPAT CPRCGDAGIT DTGQALPVLP LEKVSAQVSR DAAIISDSHD
DRERLPFTVV AAADVDPEQI SDAWHLGGYP FGAEYVRQLE VRWVNVGRAV EQAPERMLAG
TAVRAPLFLI CPGCGIVPAA QLGVRDPQQA RHRAWCPHRT RIDVPWTELA LGRSLRTQGV
RILLPPQFTL DQFAGPSFRA ALLLGLRELL GGAPDHLDVL EVRMPVGGSE RTALLLHDRV
PGGTGYLADL ARPSRVRELL TAARGVVARC GCAEDGLLAC SRCLLPFSPP ALVERTSRAR
AEQMLTEVLG DGWEPVGLRS VADIPLPSLD SALELRFRRT FTDALRARGA SVREIPQLTG
TELRFGVPGT VHRRWTLRPQ VRAAGCRPDF ELLCEDPTVP RVYVFTDGRA WHATPAHNRI
ADDAAKRAAL REAGHVVWAV TDDDVAVFEA LARGEQVTAA GAAWYDDAVR RRLVQHRAGR
IPAGSVSDGL ATADAVTQLL DWVMEPRRDA WRALADGLPF ALFDARPAPV ASRSALAGLA
AAALDPHAVP DAVAGRGAAG DLQGWVWRRD GLAAASAGWP RPPHDLGCVL VLDDRDDGLA
TEAGVRAWRS WLALSNILGH ATRTALPIAL SQCTDDRSAG ADRPGADAAG ARADTGTGAG
ADRETTAVTS REDALSPPWA DLVLNALDDA EKTLLRGLAR AGVPLPEQGH ETAGGFPLDL
AWPDARVAVL VHPEDVDGAL RAELTGGGWR IADPDPARVA GLLATAGGGG TDGGGTDIPV
EGTR