Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1291 |
Symbol | |
ID | 5669704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1557868 |
End bp | 1562193 |
Gene Length | 4326 bp |
Protein Length | 1441 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240223 |
Product | ATP-dependent helicase HrpA |
Protein accession | YP_001505651 |
Protein GI | 158313143 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | [TIGR01967] ATP-dependent helicase HrpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.723467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCTCTC CTGCTGAAAC GCCGCCGCCG GCACCGCCGC CGCTGGCCGA GCTGCGTGCC TGCCTGCCCC AGCTCATGGC GCGCGACGCC CACCGCCTGA GCCGGCACCT CGACCGGGCC CGCCGGATGC GCGACCCGGC GGCGCGCGAC TCCTCGATCG CGCGGCTGGC GGCGGACGTC GAGACCGCCC GGCTGCGGGT GGAGACGCGC CGGGCCAGCG TCCCTGTGAT CGGCTACCCG GAGGAGCTAC CGGTCAGCCA GCGCCGGGAC GAGATCCTGG CCGCCCTCCG GGACAACCAG GTCGTGGTGA TCGCCGGCGA GACCGGTTCG GGCAAGACCA CGCAGCTCCC CAAGCTGTGC CTGGAGCTGG GCCGGGGCGT GCGCGGCATG ATCGGGCACA CCCAGCCGCG GCGGATCGCC GCGCGCACCG TGGCGGAACG GATCGCCGAG GAGCTCGGCA GCGTGATCCC CGAGACCGGC CGTGACCGGG CCCAGGCCGG CCCGAACGGC AACGGAAGCG GAGGCGGGAA CGGCAGCGGG GGCGGGGGCG GGCTGGTCGG CTACCAGATG CGCTTCACCG ACCGGGTCGG CCCTTCCACG CTGATCAAGC TGATGACGGA TGGCATCCTG CTCAACGAGC TGACCGCCGA CCGGCTGCTC CGCCAGTACG ACACTCTGAT CATCGACGAG GCGCACGAGC GCAGCCTCAA CATCGACTTC ATTCTCGGCT ACCTGCACGG GCTGCTCCCC CGCCGCCCGG ACCTCAAGGT GGTCATCACC TCGGCGACGA TCGACCCGCA CCGGTTCGCC CGCCACTTCC ACGACGCGCC GGTGATCGAG GTTTCCGGCC GCACCTACCC GGTCGAGACC CGCTACCGCC CCCTGATCGA CACCGGGCCG GGCGTCGATG CCGACTCCGA GGCTGACGCA TACGCCGACG CCGACGCGGA TGCCGGCACC GCGGCACGGA GCGAACCACG AGCCGGTGGT CGCAAGGGAG CCGGCGGCGA CGGCCGAAGA GGCAACAGCG AGGCCCGGCG AGGAGCCGGT GGCGACGCCC GAACCGGGGC GGGCGCCGAC ACCGAGCGCG ACCAGGTCTC GGCGATCTGC GACGCGGTGG ACGAGCTGTG CGCCGAGGGG CCGGGCGACA TCCTGGTGTT CCTCAGCGGG GAGCGCGAGA TCCGCGACAC GGCCGACGCG CTGGCCCGCC GCGACCTCCC GATGACCGAG ATCGTCCCGC TCTACGCCCG GCTGTCCTCG GCCGAGCAGC ACCGGGTCTT CACCCCGCAC ACCGGACGCC GGATCGTGCT GGCCACCAAC GTCGCCGAGA CGTCGCTGAC CGTCCCGGGC ATCCGCTACG TGATCGACCC GGGGCTGGCG CGCATCTCCC GGTACAGCCA TCGGACGAAG GTGCAGCGCC TGCCGATCGA GCCGGTCTCG CAGGCCTCGG CCAACCAGCG GGCCGGCCGG TGCGGCCGGA CGTCCGACGG GATCTGCATC CGGCTGTACT CGGAGGAGGA CTTCGCCGGC CGGCCGGCCT TCACCGATCC CGAGATCCTG CGCACCAACC TGGCCTCGGT GATCCTGCAG ATGGAGGCGC TCGGCCTCGG TGAGATGGCG GACTTCCCGT TCCTCGACCC GCCGGAGTCG CGCCAGGTCA CCGACGGCAT GCGGCTGCTG ACCGAGCTGG GCGCCTTCAT CGAGGACGCC GAGCCCGGCA AACGGCTGAC CCCGATCGGG CGCAGCCTCG CCCAGCTGCC GGTGGATCCC CGGCTGGCCC GGATGGTGCT CGCGGCCGGC GAGCTGGGCT GCCTGTCCGA GGTGCTCGTG ATCGCCTCCG CGCTGGCCAT CCAGGACCCG CGGGAGCGGC CCGTCGAGCA CCGGGCCGCC GCCGACGAGC GGCACGCCCG GTTCACCGAT CCGACGTCGG ACTTCCTGTC GATCCTGAAC CTGTGGCGGC ACCTGCGCAC CGCGCGCGAG GAGCGGTCGT CCAACCAGTT CCGGCGGATG TGCCGCACCG AGTTCCTGAA CTACCTGCGG ATCCGGGAGT GGCAGGACGT CCACGGCCAG CTGCGCGCGA TCGCCCGGAA CCTGGGGCTG GAGCCTAACG ACAGCCCGGC CGACGCCCGG TCGGTGCACC AGGCGCTGCT CACCGGGCTG CTCTCCCACC TCGGCCGCTA CGAGCCGGAG AAGAAGGACT ATCTCGGGGC CCGGGGCGCC CGCTTCGCGA TCTTCCCCGG TTCCGGGCTG GCGCGACGGA GGCCGGCGCG GCCCGAAGCC GGCGACGCGG GCCGGGGCGC ACCGGCGGAA GCGGACGCGG AGGCCGGAGG GCGGCGCGGG CCGGGCATCC CGACCTGGGT CGTCGCAGCC GAGCTGGTCG AGACGTCCCG GCTGTGGGCG CGGACCGTGG CGCGGGTGGA GCCCGAGTGG GTCGAGCCGC TCGCCGAGCA CCTGCTGCGG CATACCTACA GCGAGCCGCA CTGGTCCCGC AAGCAGGCCG CGGCGCTGGC CTACGAGAAG GTCACGCTGT ACGGGATCCC GCTGGTGACG TCGCGGCTGG TCCAGTACGG CCGGATCGAC CCGGTCGTCA GCCGGGACCT GTTCATCCGG CACGCGCTCG TCGAGGGCGA CTGGAACACC CGGCACGCCT TCTTCCACGC GAACCGGGAG CTGCTCGCCG GGGTCGAGGA GTTGGAGCAC CGGGCCCGCC GCCGCGACAT CCTCGTCGAC GACGAGACGC TGTTCGAGTT CTACGACCAG CGCATACCAG CCGACGTCGT CTCCGGCCGC CATTTCGACA CCTGGTGGAA GGCGACCCGC CGCGCCGAGC CGGACCTGCT CGACTTCTCC GCGTCGATGC TGGTCAACGA CCGCGCGGGC GCGATCCGCC AGCAGGACTA CCCCGACACC TGGGTCGCCG ACGGCGTCGA GCTGGCGCTC ACCTACCAGT TCACGCCCGG GGAGGCCGCC GACGGGGTGA CCGTGCGGGT GCCGCTGCCG GTCCTCGGCA CGCTGCGCGC GGAGCCGTTC ACCTGGCAGG TGCCCGGGCT GCGGGAGGAG CTCGTCACCG CGCTGATCCG TGCGCTGCCC AAGGTGGTGC GGCGCGGCTT CGTCCCGGCG CCGAACTATG CCCGCGCTGT CCTGGACCGC CTGACCCCCG GCGACCGCCC GGCCCCCGGC GGTCCCCCGT CCGAGGGCGG CGGACCGCCG GCCGACGACC GCCCGCTGGC CGACGCCGTC GCCGGTGAGC TGCACCGGAT GAGCGGCTCG GCGCTCCCCT CGGGGGTCTG GGCGCCGCCG CGGCTGGCCG AGCTGCTGCC GCCGCACCTG CGGATGACCT TCCGGGTCGT CGACGACGGC GGGGCGACGC TCGCGGAGGG CAAGGACCTG GCCGAGCTGC GGGCCCGGCT GCGGCCCCGG ACGAAGGCGG TCGTCACGGC CGCGGCCGCC GGCGTGGAAC GCTCGGGCCT GCGCGCCTGG GACGTCGGCA CCTTGCCCGC GGTGATCGAG CGCAGCCGCG GCGGGCATGT CGTGCGCGCC CACCCGGCTC TGGTCGACGA GGGCGGCTCG GTCGCGGTCC GGGTCTTCGA CGACCGGGAG GAGGCCGACC GCGCGATGTG GGCCGGCACC CGCCGGCTGC TGCTGCTCAA CGTCGCCTCC CCCGTCCGGG GGATCATCGG CCGGCTGCCG AACGCGGCGA AGCTGGCCCT GAGCAACAAC CCGCACCGCG ACGTCACCGA CCTGCTGGAC GACTGCGTCG CCGCCGCCGT GGACACGCTG CTGCGCCGCG CCGGCGGGGC CGCCCGGGAC GAGGCGGGTT TCGCCGCGCT ACTCGAGACG GTGCGCGCGG AGCTGGCGGA CACCGCGTGG TCGGTGGTCC GCGGCACCGA GCGCGTGCTG GCCGCCGCCC ACGAGATCAG CCTCGGGGTG CGGGCGCTGG GCAGCCCGGC GCTGCTGGTG ACCGCGACCG ACCTGCGTGC CCAGCTCGAC ACACTGATCC ACCGGGGCTT CGTGACCGAG ACGGGCGCCG AGCGCCTGCC CGATCTGGAG CGCTACCTCA CCGCCGCCCG GCGGCGCCTG GACCGGCTGC CCGCCGACGC CGCCCGGGAC CGCCAGCTCA CCCTCCGGGT GCGTAACGTC GTCGAGGCCT TCAACGAGCT GATCGACGAG GTGGGCCCGG CTCGCGCCGG GTCCGAACCG GTCCAGGCGA TCCGCTGGAT GATCGAGGAG CTGCGGGTCA GCCTGTTCGC CCAGGCGCTG CGCACGCCGT ACCCGGTCTC CGAGGAGCGC GTCTACCGGG CGATCGACCG GCTCCGCCCC GGCTGA
|
Protein sequence | MISPAETPPP APPPLAELRA CLPQLMARDA HRLSRHLDRA RRMRDPAARD SSIARLAADV ETARLRVETR RASVPVIGYP EELPVSQRRD EILAALRDNQ VVVIAGETGS GKTTQLPKLC LELGRGVRGM IGHTQPRRIA ARTVAERIAE ELGSVIPETG RDRAQAGPNG NGSGGGNGSG GGGGLVGYQM RFTDRVGPST LIKLMTDGIL LNELTADRLL RQYDTLIIDE AHERSLNIDF ILGYLHGLLP RRPDLKVVIT SATIDPHRFA RHFHDAPVIE VSGRTYPVET RYRPLIDTGP GVDADSEADA YADADADAGT AARSEPRAGG RKGAGGDGRR GNSEARRGAG GDARTGAGAD TERDQVSAIC DAVDELCAEG PGDILVFLSG EREIRDTADA LARRDLPMTE IVPLYARLSS AEQHRVFTPH TGRRIVLATN VAETSLTVPG IRYVIDPGLA RISRYSHRTK VQRLPIEPVS QASANQRAGR CGRTSDGICI RLYSEEDFAG RPAFTDPEIL RTNLASVILQ MEALGLGEMA DFPFLDPPES RQVTDGMRLL TELGAFIEDA EPGKRLTPIG RSLAQLPVDP RLARMVLAAG ELGCLSEVLV IASALAIQDP RERPVEHRAA ADERHARFTD PTSDFLSILN LWRHLRTARE ERSSNQFRRM CRTEFLNYLR IREWQDVHGQ LRAIARNLGL EPNDSPADAR SVHQALLTGL LSHLGRYEPE KKDYLGARGA RFAIFPGSGL ARRRPARPEA GDAGRGAPAE ADAEAGGRRG PGIPTWVVAA ELVETSRLWA RTVARVEPEW VEPLAEHLLR HTYSEPHWSR KQAAALAYEK VTLYGIPLVT SRLVQYGRID PVVSRDLFIR HALVEGDWNT RHAFFHANRE LLAGVEELEH RARRRDILVD DETLFEFYDQ RIPADVVSGR HFDTWWKATR RAEPDLLDFS ASMLVNDRAG AIRQQDYPDT WVADGVELAL TYQFTPGEAA DGVTVRVPLP VLGTLRAEPF TWQVPGLREE LVTALIRALP KVVRRGFVPA PNYARAVLDR LTPGDRPAPG GPPSEGGGPP ADDRPLADAV AGELHRMSGS ALPSGVWAPP RLAELLPPHL RMTFRVVDDG GATLAEGKDL AELRARLRPR TKAVVTAAAA GVERSGLRAW DVGTLPAVIE RSRGGHVVRA HPALVDEGGS VAVRVFDDRE EADRAMWAGT RRLLLLNVAS PVRGIIGRLP NAAKLALSNN PHRDVTDLLD DCVAAAVDTL LRRAGGAARD EAGFAALLET VRAELADTAW SVVRGTERVL AAAHEISLGV RALGSPALLV TATDLRAQLD TLIHRGFVTE TGAERLPDLE RYLTAARRRL DRLPADAARD RQLTLRVRNV VEAFNELIDE VGPARAGSEP VQAIRWMIEE LRVSLFAQAL RTPYPVSEER VYRAIDRLRP G
|
| |