Gene Franean1_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1291 
Symbol 
ID5669704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1557868 
End bp1562193 
Gene Length4326 bp 
Protein Length1441 aa 
Translation table11 
GC content74% 
IMG OID641240223 
ProductATP-dependent helicase HrpA 
Protein accessionYP_001505651 
Protein GI158313143 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01967] ATP-dependent helicase HrpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.723467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCTCTC CTGCTGAAAC GCCGCCGCCG GCACCGCCGC CGCTGGCCGA GCTGCGTGCC 
TGCCTGCCCC AGCTCATGGC GCGCGACGCC CACCGCCTGA GCCGGCACCT CGACCGGGCC
CGCCGGATGC GCGACCCGGC GGCGCGCGAC TCCTCGATCG CGCGGCTGGC GGCGGACGTC
GAGACCGCCC GGCTGCGGGT GGAGACGCGC CGGGCCAGCG TCCCTGTGAT CGGCTACCCG
GAGGAGCTAC CGGTCAGCCA GCGCCGGGAC GAGATCCTGG CCGCCCTCCG GGACAACCAG
GTCGTGGTGA TCGCCGGCGA GACCGGTTCG GGCAAGACCA CGCAGCTCCC CAAGCTGTGC
CTGGAGCTGG GCCGGGGCGT GCGCGGCATG ATCGGGCACA CCCAGCCGCG GCGGATCGCC
GCGCGCACCG TGGCGGAACG GATCGCCGAG GAGCTCGGCA GCGTGATCCC CGAGACCGGC
CGTGACCGGG CCCAGGCCGG CCCGAACGGC AACGGAAGCG GAGGCGGGAA CGGCAGCGGG
GGCGGGGGCG GGCTGGTCGG CTACCAGATG CGCTTCACCG ACCGGGTCGG CCCTTCCACG
CTGATCAAGC TGATGACGGA TGGCATCCTG CTCAACGAGC TGACCGCCGA CCGGCTGCTC
CGCCAGTACG ACACTCTGAT CATCGACGAG GCGCACGAGC GCAGCCTCAA CATCGACTTC
ATTCTCGGCT ACCTGCACGG GCTGCTCCCC CGCCGCCCGG ACCTCAAGGT GGTCATCACC
TCGGCGACGA TCGACCCGCA CCGGTTCGCC CGCCACTTCC ACGACGCGCC GGTGATCGAG
GTTTCCGGCC GCACCTACCC GGTCGAGACC CGCTACCGCC CCCTGATCGA CACCGGGCCG
GGCGTCGATG CCGACTCCGA GGCTGACGCA TACGCCGACG CCGACGCGGA TGCCGGCACC
GCGGCACGGA GCGAACCACG AGCCGGTGGT CGCAAGGGAG CCGGCGGCGA CGGCCGAAGA
GGCAACAGCG AGGCCCGGCG AGGAGCCGGT GGCGACGCCC GAACCGGGGC GGGCGCCGAC
ACCGAGCGCG ACCAGGTCTC GGCGATCTGC GACGCGGTGG ACGAGCTGTG CGCCGAGGGG
CCGGGCGACA TCCTGGTGTT CCTCAGCGGG GAGCGCGAGA TCCGCGACAC GGCCGACGCG
CTGGCCCGCC GCGACCTCCC GATGACCGAG ATCGTCCCGC TCTACGCCCG GCTGTCCTCG
GCCGAGCAGC ACCGGGTCTT CACCCCGCAC ACCGGACGCC GGATCGTGCT GGCCACCAAC
GTCGCCGAGA CGTCGCTGAC CGTCCCGGGC ATCCGCTACG TGATCGACCC GGGGCTGGCG
CGCATCTCCC GGTACAGCCA TCGGACGAAG GTGCAGCGCC TGCCGATCGA GCCGGTCTCG
CAGGCCTCGG CCAACCAGCG GGCCGGCCGG TGCGGCCGGA CGTCCGACGG GATCTGCATC
CGGCTGTACT CGGAGGAGGA CTTCGCCGGC CGGCCGGCCT TCACCGATCC CGAGATCCTG
CGCACCAACC TGGCCTCGGT GATCCTGCAG ATGGAGGCGC TCGGCCTCGG TGAGATGGCG
GACTTCCCGT TCCTCGACCC GCCGGAGTCG CGCCAGGTCA CCGACGGCAT GCGGCTGCTG
ACCGAGCTGG GCGCCTTCAT CGAGGACGCC GAGCCCGGCA AACGGCTGAC CCCGATCGGG
CGCAGCCTCG CCCAGCTGCC GGTGGATCCC CGGCTGGCCC GGATGGTGCT CGCGGCCGGC
GAGCTGGGCT GCCTGTCCGA GGTGCTCGTG ATCGCCTCCG CGCTGGCCAT CCAGGACCCG
CGGGAGCGGC CCGTCGAGCA CCGGGCCGCC GCCGACGAGC GGCACGCCCG GTTCACCGAT
CCGACGTCGG ACTTCCTGTC GATCCTGAAC CTGTGGCGGC ACCTGCGCAC CGCGCGCGAG
GAGCGGTCGT CCAACCAGTT CCGGCGGATG TGCCGCACCG AGTTCCTGAA CTACCTGCGG
ATCCGGGAGT GGCAGGACGT CCACGGCCAG CTGCGCGCGA TCGCCCGGAA CCTGGGGCTG
GAGCCTAACG ACAGCCCGGC CGACGCCCGG TCGGTGCACC AGGCGCTGCT CACCGGGCTG
CTCTCCCACC TCGGCCGCTA CGAGCCGGAG AAGAAGGACT ATCTCGGGGC CCGGGGCGCC
CGCTTCGCGA TCTTCCCCGG TTCCGGGCTG GCGCGACGGA GGCCGGCGCG GCCCGAAGCC
GGCGACGCGG GCCGGGGCGC ACCGGCGGAA GCGGACGCGG AGGCCGGAGG GCGGCGCGGG
CCGGGCATCC CGACCTGGGT CGTCGCAGCC GAGCTGGTCG AGACGTCCCG GCTGTGGGCG
CGGACCGTGG CGCGGGTGGA GCCCGAGTGG GTCGAGCCGC TCGCCGAGCA CCTGCTGCGG
CATACCTACA GCGAGCCGCA CTGGTCCCGC AAGCAGGCCG CGGCGCTGGC CTACGAGAAG
GTCACGCTGT ACGGGATCCC GCTGGTGACG TCGCGGCTGG TCCAGTACGG CCGGATCGAC
CCGGTCGTCA GCCGGGACCT GTTCATCCGG CACGCGCTCG TCGAGGGCGA CTGGAACACC
CGGCACGCCT TCTTCCACGC GAACCGGGAG CTGCTCGCCG GGGTCGAGGA GTTGGAGCAC
CGGGCCCGCC GCCGCGACAT CCTCGTCGAC GACGAGACGC TGTTCGAGTT CTACGACCAG
CGCATACCAG CCGACGTCGT CTCCGGCCGC CATTTCGACA CCTGGTGGAA GGCGACCCGC
CGCGCCGAGC CGGACCTGCT CGACTTCTCC GCGTCGATGC TGGTCAACGA CCGCGCGGGC
GCGATCCGCC AGCAGGACTA CCCCGACACC TGGGTCGCCG ACGGCGTCGA GCTGGCGCTC
ACCTACCAGT TCACGCCCGG GGAGGCCGCC GACGGGGTGA CCGTGCGGGT GCCGCTGCCG
GTCCTCGGCA CGCTGCGCGC GGAGCCGTTC ACCTGGCAGG TGCCCGGGCT GCGGGAGGAG
CTCGTCACCG CGCTGATCCG TGCGCTGCCC AAGGTGGTGC GGCGCGGCTT CGTCCCGGCG
CCGAACTATG CCCGCGCTGT CCTGGACCGC CTGACCCCCG GCGACCGCCC GGCCCCCGGC
GGTCCCCCGT CCGAGGGCGG CGGACCGCCG GCCGACGACC GCCCGCTGGC CGACGCCGTC
GCCGGTGAGC TGCACCGGAT GAGCGGCTCG GCGCTCCCCT CGGGGGTCTG GGCGCCGCCG
CGGCTGGCCG AGCTGCTGCC GCCGCACCTG CGGATGACCT TCCGGGTCGT CGACGACGGC
GGGGCGACGC TCGCGGAGGG CAAGGACCTG GCCGAGCTGC GGGCCCGGCT GCGGCCCCGG
ACGAAGGCGG TCGTCACGGC CGCGGCCGCC GGCGTGGAAC GCTCGGGCCT GCGCGCCTGG
GACGTCGGCA CCTTGCCCGC GGTGATCGAG CGCAGCCGCG GCGGGCATGT CGTGCGCGCC
CACCCGGCTC TGGTCGACGA GGGCGGCTCG GTCGCGGTCC GGGTCTTCGA CGACCGGGAG
GAGGCCGACC GCGCGATGTG GGCCGGCACC CGCCGGCTGC TGCTGCTCAA CGTCGCCTCC
CCCGTCCGGG GGATCATCGG CCGGCTGCCG AACGCGGCGA AGCTGGCCCT GAGCAACAAC
CCGCACCGCG ACGTCACCGA CCTGCTGGAC GACTGCGTCG CCGCCGCCGT GGACACGCTG
CTGCGCCGCG CCGGCGGGGC CGCCCGGGAC GAGGCGGGTT TCGCCGCGCT ACTCGAGACG
GTGCGCGCGG AGCTGGCGGA CACCGCGTGG TCGGTGGTCC GCGGCACCGA GCGCGTGCTG
GCCGCCGCCC ACGAGATCAG CCTCGGGGTG CGGGCGCTGG GCAGCCCGGC GCTGCTGGTG
ACCGCGACCG ACCTGCGTGC CCAGCTCGAC ACACTGATCC ACCGGGGCTT CGTGACCGAG
ACGGGCGCCG AGCGCCTGCC CGATCTGGAG CGCTACCTCA CCGCCGCCCG GCGGCGCCTG
GACCGGCTGC CCGCCGACGC CGCCCGGGAC CGCCAGCTCA CCCTCCGGGT GCGTAACGTC
GTCGAGGCCT TCAACGAGCT GATCGACGAG GTGGGCCCGG CTCGCGCCGG GTCCGAACCG
GTCCAGGCGA TCCGCTGGAT GATCGAGGAG CTGCGGGTCA GCCTGTTCGC CCAGGCGCTG
CGCACGCCGT ACCCGGTCTC CGAGGAGCGC GTCTACCGGG CGATCGACCG GCTCCGCCCC
GGCTGA
 
Protein sequence
MISPAETPPP APPPLAELRA CLPQLMARDA HRLSRHLDRA RRMRDPAARD SSIARLAADV 
ETARLRVETR RASVPVIGYP EELPVSQRRD EILAALRDNQ VVVIAGETGS GKTTQLPKLC
LELGRGVRGM IGHTQPRRIA ARTVAERIAE ELGSVIPETG RDRAQAGPNG NGSGGGNGSG
GGGGLVGYQM RFTDRVGPST LIKLMTDGIL LNELTADRLL RQYDTLIIDE AHERSLNIDF
ILGYLHGLLP RRPDLKVVIT SATIDPHRFA RHFHDAPVIE VSGRTYPVET RYRPLIDTGP
GVDADSEADA YADADADAGT AARSEPRAGG RKGAGGDGRR GNSEARRGAG GDARTGAGAD
TERDQVSAIC DAVDELCAEG PGDILVFLSG EREIRDTADA LARRDLPMTE IVPLYARLSS
AEQHRVFTPH TGRRIVLATN VAETSLTVPG IRYVIDPGLA RISRYSHRTK VQRLPIEPVS
QASANQRAGR CGRTSDGICI RLYSEEDFAG RPAFTDPEIL RTNLASVILQ MEALGLGEMA
DFPFLDPPES RQVTDGMRLL TELGAFIEDA EPGKRLTPIG RSLAQLPVDP RLARMVLAAG
ELGCLSEVLV IASALAIQDP RERPVEHRAA ADERHARFTD PTSDFLSILN LWRHLRTARE
ERSSNQFRRM CRTEFLNYLR IREWQDVHGQ LRAIARNLGL EPNDSPADAR SVHQALLTGL
LSHLGRYEPE KKDYLGARGA RFAIFPGSGL ARRRPARPEA GDAGRGAPAE ADAEAGGRRG
PGIPTWVVAA ELVETSRLWA RTVARVEPEW VEPLAEHLLR HTYSEPHWSR KQAAALAYEK
VTLYGIPLVT SRLVQYGRID PVVSRDLFIR HALVEGDWNT RHAFFHANRE LLAGVEELEH
RARRRDILVD DETLFEFYDQ RIPADVVSGR HFDTWWKATR RAEPDLLDFS ASMLVNDRAG
AIRQQDYPDT WVADGVELAL TYQFTPGEAA DGVTVRVPLP VLGTLRAEPF TWQVPGLREE
LVTALIRALP KVVRRGFVPA PNYARAVLDR LTPGDRPAPG GPPSEGGGPP ADDRPLADAV
AGELHRMSGS ALPSGVWAPP RLAELLPPHL RMTFRVVDDG GATLAEGKDL AELRARLRPR
TKAVVTAAAA GVERSGLRAW DVGTLPAVIE RSRGGHVVRA HPALVDEGGS VAVRVFDDRE
EADRAMWAGT RRLLLLNVAS PVRGIIGRLP NAAKLALSNN PHRDVTDLLD DCVAAAVDTL
LRRAGGAARD EAGFAALLET VRAELADTAW SVVRGTERVL AAAHEISLGV RALGSPALLV
TATDLRAQLD TLIHRGFVTE TGAERLPDLE RYLTAARRRL DRLPADAARD RQLTLRVRNV
VEAFNELIDE VGPARAGSEP VQAIRWMIEE LRVSLFAQAL RTPYPVSEER VYRAIDRLRP
G