Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7212 |
Symbol | |
ID | 5675513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8806544 |
End bp | 8810491 |
Gene Length | 3948 bp |
Protein Length | 1315 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641246049 |
Product | HSP90 family heat shock protein |
Protein accession | YP_001511437 |
Protein GI | 158318929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.735343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGCCA CCCCGTTCGA CACCGCGGCG CTTCGGGCAC GTGTCATCGA GGCGTGGGCC GCCTCCGCGA GCCGCTTCCG GGAGGACGCG AACGCCGAGG AGGACGCCGC TCTCGGCGCC TACCGAGACC GCCTGCTCGT CGAGCTCCTC CAGAACGCGG TCGACGCGGC GGCGGCCGCT GGTGTGCCGG CAAAGGTACA CATCAGGCTC ACGTCGGGGC AGCACCCGCA CGGCGGGCCG GGTGGGCTGC TCGAGGTGGC GAACACCGGC GCCCCGCTGA CCGCCGCCGG CGTCGAGGCC CTGTCGACGC TGCGCGCGTC CGCGAAGCGC GACGTGCAGG CAGTCGGCCG CTTCGGCGCC GGCTTCGCCG CGGTCCTCGC GGTGACGGAC ACGCCGTCGA TTGTCTCCCG GGCGCCCTCG GGCCCCGGCG ACGCGGCGGA CTGGCGTGAC GCCGACCAGC ACAGCGCCGA CCAGCGCAGT ACGGACCTGC GCGGGGTGGA GCGGCGTGGC GTGGCGTGGC GCGGGGTGGA GTGGTCGCGC CGGCGCACCG CCGACCTGGT TGTCGCGACG GGCGCGGACG CACTGCGCCG CGAGCTCGAC CGCCGCGCCG GCGCCGTGCC CGTCCTGCGC CTGCCCTTCG ACCTCGGCGA CCCGGCGTCC GGGCCCGGCG ACGGGTTCGA CACCGTCGTC CGGCTGCCGC TGCGGGACAC GGCGGCGGCG GCCACGGCAC GCGAGCTCAT CGCCGCGTTC GACCCCACGC TGCCCCTCGT CCTGCCCGGC CTTGAGGAGA TCGTCGTCGA GGTGGACGGC GACGTCCGAA CCCACCGGTG TGTCTGGGAG CCGGTGCTCC CCGGGCCGGA CGGAACCGAC CTCGAAATCG CCACCGTCGA CGGCACGCGG TGGCGCGGCT GTGTCCACCG CGGCTCGATC CCGTCCGAGC TGCTCGCCGA CCGCCCGGTC GAGGAACGTG ACAGGACCAC CTACAACGCC CGGGTGATGA TCCCGGACGG CGGCTGGCCC GCCGAGGTGG CGCAGGTGGT GCGCGCGCCG CAGCCGACCG ACGAGGCCGT CGGCCTGCCC GCGCTGATCA GCGTCGACCT GCCGCTCGAT CCGTCCCGGC GGCACACCGT GCCCGGGCCC CTGCGGGACT GGCTCACCGA CCGGCTCGCC GACACGGTCG TCGCGCTCGC CGCTCACCTG GCCACCGCCC GACCGGACGG CGCCGAGAGC GGTGACCATG GCGCGGACGG GGGGCGCCAC GCGGCTCCCG ATCCCCTCAC CGTCCTGGAC CTCGTGCCGA CCGGGCTGCC CCGCAGCGAG GTGGACGGCC GGCTGCGCGA CCGCCTGCTC GCGCTGCTGC CGGACGCCCC CGTCCTGCCG GGCGGCCGCC GCGGCCGGGA CTGCCTCGTC CTGGACCTCG GCCCCGCCAC GGACGCCGTC ACCGACCTGC TGCGCGCGGG CGCCGAGACG GGCTCCGGCA CCGGCCTCAC CACGGACCCG GACCTGGCGG ACTTTGCCGC GGCGGGGAAG GCTGACGAGG CCGCGGCCGA CGATGCCGTG GTCGACGGGC TGCTGCCGGC CGAGTTCGCC ACCCGGCGCC GCCAGCCCGC ACTCGACCTC CTGGGCGTCC GCCGCCTGGA CACCGGGGCC GTCGTCGAGG TCCTGCGCGG CATGCGGCGG ACGCCGTCCT GGTGGGCCGG GCTGTACCCG CTGTTGCTGA CCGCACCCGA CCGGGACGCC CTGGGCGCGC TGCCGGTTCC GGTCGTGGTC GTGCCCGACG GCGACGACCT CGGCTCCACG CCGGCCTTCG CCCATGCTGA TCCGTTCACC GGGACCGGAC CGGACGGGGG TGGAGCCGCC CCGGCGACAC CACCCAGCCG GATGGTCACC GGACCGCGCG GGGCCCTGCT ACCCACCCCC GATCTGGACG TGGTGGCCCT GGCCCGATCG GGGCTGCCGC TGCGCGCCGT CCATCCCGAC GCGTGCGCGG GTGCGGCCCG CGACGCGCTG CGGACCCTGG GCGCCTCGGA GGGCACGCCG GCCGGGGTGC TGCGGGACCC GGCGGTACGC GAGGCCGTCA CCCAGGCCGA CCCCGACGAC GACCCCGGCG AGCTGGACGC CCTCGCCGCC GCCGTACTCG CGCTGGTGCG CGCGATTGTC CCGGATGGAC ACACCGCACA TTCGGACGGG CACGACCGCA ATGACGGGCT GGACGGCGGC GGCTTCCCGT CCGACACCGT GGGTGATCCG CACCCGGAGC AGGGGCCGTC CGCGATGCCG TCGTGGCTGG GTGGGCTGCT GCTCCCGGAA CAGGGCGGCG GCTTCGCGCC CGCGTCCGAG CTGGTGATCG CCGACGGGCC GCTCGACCGC CTGCTGGCCG AGGACGCGCC GTTCGGCGCC CTGGACCCAC GGCTGAGCAC CGCCTGGCCG GACCATGTCC TCGAAGCCAT CGGAGTCCTG CGGACCTTCG GCGTCCTCTA TGCGTCGGAC GTGACCCTCG ACCCGGACGA GCCCGTCCTG CTCGAGCTTG ACGACAGCGA CACCTGGGCG GACGACGTCC ACGATGCCGC CGACCGGGCG GGGGAACGGG GCACCGCGGC TTCCGGTGAC GGGCGAGGCT CCGGCCGCGG GCCCGGCGCG GCTGGCATGC CGGGTGGACA CGGTGTGCCG GGAGCGCACG GTGTGCCAGG GGCGCCGCGG GTTGTCCGGC ATTTCGCGGC CGTCCGCGAC CTTGAGCTCG TCGACCCCGA CGCATGGCCG GAGGCGCTTG CGGAGCTCGC CCGTCCGCCC CTGCGCCGGG TGGTTCTCGG CGCCGAACCC TCGTACACCC GCTGGTGGCT GGCCCGGCAC GCGCTGCTGC CGGTGGATGG CGACCCGGAC ACCCGGCTCC CCCCGGGCGA GCTCGTCCTG CCCGGCGCCG ATCCGTTGCT GACCGGGCTG TTCGCGCCGG GCGCGCCGCT GCCGGGCGTC GACCCGGAGC TGCTGCGCCG GCTCGGCTGC CGGCTCACCC TCGACGACGT GCTCACCGAC CCGGACGCGG TGATCGACCT GCTGGACCGG CTCGGCGACG CCGACCGCGA GATCGGCTGG CCGGCGGCCC GGACGCTCTA CATGGCCGCG GTGAGCGCGG CCGGCCTGCT CGGGGCCGAC CCCTCCGGCA CGACCGGCCC AAGGCTGGAC CCGCCGCTGA CCGTACGAAC CCCGACCGGG GTGTTCCGCT CCGCCGACGT GGTGGTCGTG GACGCACCGG ACCTGCTCGA AGTGATCGGC GCCGACCATC CCGCGCTGCG CCTGCCACTG GATCGCGCCG CCGAGGCCGC GCACATCCTC GGGCTGCGTC TCGCCTCGGA GCTGGCCGAC TTCGCCGTAC TGGACGAATC CGCTGGACTG AGCGAATCCA CTGGATTCGG TGGATCCGCT GCTCTGATCG ACAGCACCGT TCAGGCCGGC GGTTCCATCG CTGGCAGCGC TGTCTCGGCC GGAGCCGGCA CCGGTGGCGG TGGCGGTGGT ACCGATGGCG GTGGCGTCAG TGGCGTCCGC GGTGCGGTTC CTGGCAGGGC CGTGACGAGG ACCGTCATGG TGGCCGGCAA CGCGGTCGTC GTCGGGAGCG CGACCCCGGT CGGGGCCGCG GACACCCCGG ATTCGCTCGA CGGGGTCGAC CTGGGCGCGG TGCCGGGCGC CGCGCGTCCC CTCGTCGCGC GGTACCAGGT CCATCCCGAG CTGTGGATCT CAAGCCTGGG CGGGGGACGG GTCCGGGTGC CGTGGCGGGT CGTCGGCGGC ATCGGCGGTG AGATCCACGT CGACGCCGAG GCCGGCACGG ATGCGCTCGC CCGGGCTCTG GCCTGGCGGG CCGGCCAGTG GGAGCGCCGG CACGGACTGG CCGCCGCGCT GCGGGACCCC GACGGCGCCG GCCGCCGTCA GGCCGAGGAC GACCTCGACG ATCTTTGA
|
Protein sequence | MDATPFDTAA LRARVIEAWA ASASRFREDA NAEEDAALGA YRDRLLVELL QNAVDAAAAA GVPAKVHIRL TSGQHPHGGP GGLLEVANTG APLTAAGVEA LSTLRASAKR DVQAVGRFGA GFAAVLAVTD TPSIVSRAPS GPGDAADWRD ADQHSADQRS TDLRGVERRG VAWRGVEWSR RRTADLVVAT GADALRRELD RRAGAVPVLR LPFDLGDPAS GPGDGFDTVV RLPLRDTAAA ATARELIAAF DPTLPLVLPG LEEIVVEVDG DVRTHRCVWE PVLPGPDGTD LEIATVDGTR WRGCVHRGSI PSELLADRPV EERDRTTYNA RVMIPDGGWP AEVAQVVRAP QPTDEAVGLP ALISVDLPLD PSRRHTVPGP LRDWLTDRLA DTVVALAAHL ATARPDGAES GDHGADGGRH AAPDPLTVLD LVPTGLPRSE VDGRLRDRLL ALLPDAPVLP GGRRGRDCLV LDLGPATDAV TDLLRAGAET GSGTGLTTDP DLADFAAAGK ADEAAADDAV VDGLLPAEFA TRRRQPALDL LGVRRLDTGA VVEVLRGMRR TPSWWAGLYP LLLTAPDRDA LGALPVPVVV VPDGDDLGST PAFAHADPFT GTGPDGGGAA PATPPSRMVT GPRGALLPTP DLDVVALARS GLPLRAVHPD ACAGAARDAL RTLGASEGTP AGVLRDPAVR EAVTQADPDD DPGELDALAA AVLALVRAIV PDGHTAHSDG HDRNDGLDGG GFPSDTVGDP HPEQGPSAMP SWLGGLLLPE QGGGFAPASE LVIADGPLDR LLAEDAPFGA LDPRLSTAWP DHVLEAIGVL RTFGVLYASD VTLDPDEPVL LELDDSDTWA DDVHDAADRA GERGTAASGD GRGSGRGPGA AGMPGGHGVP GAHGVPGAPR VVRHFAAVRD LELVDPDAWP EALAELARPP LRRVVLGAEP SYTRWWLARH ALLPVDGDPD TRLPPGELVL PGADPLLTGL FAPGAPLPGV DPELLRRLGC RLTLDDVLTD PDAVIDLLDR LGDADREIGW PAARTLYMAA VSAAGLLGAD PSGTTGPRLD PPLTVRTPTG VFRSADVVVV DAPDLLEVIG ADHPALRLPL DRAAEAAHIL GLRLASELAD FAVLDESAGL SESTGFGGSA ALIDSTVQAG GSIAGSAVSA GAGTGGGGGG TDGGGVSGVR GAVPGRAVTR TVMVAGNAVV VGSATPVGAA DTPDSLDGVD LGAVPGAARP LVARYQVHPE LWISSLGGGR VRVPWRVVGG IGGEIHVDAE AGTDALARAL AWRAGQWERR HGLAAALRDP DGAGRRQAED DLDDL
|
| |