Gene Franean1_7212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7212 
Symbol 
ID5675513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8806544 
End bp8810491 
Gene Length3948 bp 
Protein Length1315 aa 
Translation table11 
GC content76% 
IMG OID641246049 
ProductHSP90 family heat shock protein 
Protein accessionYP_001511437 
Protein GI158318929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.735343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGCCA CCCCGTTCGA CACCGCGGCG CTTCGGGCAC GTGTCATCGA GGCGTGGGCC 
GCCTCCGCGA GCCGCTTCCG GGAGGACGCG AACGCCGAGG AGGACGCCGC TCTCGGCGCC
TACCGAGACC GCCTGCTCGT CGAGCTCCTC CAGAACGCGG TCGACGCGGC GGCGGCCGCT
GGTGTGCCGG CAAAGGTACA CATCAGGCTC ACGTCGGGGC AGCACCCGCA CGGCGGGCCG
GGTGGGCTGC TCGAGGTGGC GAACACCGGC GCCCCGCTGA CCGCCGCCGG CGTCGAGGCC
CTGTCGACGC TGCGCGCGTC CGCGAAGCGC GACGTGCAGG CAGTCGGCCG CTTCGGCGCC
GGCTTCGCCG CGGTCCTCGC GGTGACGGAC ACGCCGTCGA TTGTCTCCCG GGCGCCCTCG
GGCCCCGGCG ACGCGGCGGA CTGGCGTGAC GCCGACCAGC ACAGCGCCGA CCAGCGCAGT
ACGGACCTGC GCGGGGTGGA GCGGCGTGGC GTGGCGTGGC GCGGGGTGGA GTGGTCGCGC
CGGCGCACCG CCGACCTGGT TGTCGCGACG GGCGCGGACG CACTGCGCCG CGAGCTCGAC
CGCCGCGCCG GCGCCGTGCC CGTCCTGCGC CTGCCCTTCG ACCTCGGCGA CCCGGCGTCC
GGGCCCGGCG ACGGGTTCGA CACCGTCGTC CGGCTGCCGC TGCGGGACAC GGCGGCGGCG
GCCACGGCAC GCGAGCTCAT CGCCGCGTTC GACCCCACGC TGCCCCTCGT CCTGCCCGGC
CTTGAGGAGA TCGTCGTCGA GGTGGACGGC GACGTCCGAA CCCACCGGTG TGTCTGGGAG
CCGGTGCTCC CCGGGCCGGA CGGAACCGAC CTCGAAATCG CCACCGTCGA CGGCACGCGG
TGGCGCGGCT GTGTCCACCG CGGCTCGATC CCGTCCGAGC TGCTCGCCGA CCGCCCGGTC
GAGGAACGTG ACAGGACCAC CTACAACGCC CGGGTGATGA TCCCGGACGG CGGCTGGCCC
GCCGAGGTGG CGCAGGTGGT GCGCGCGCCG CAGCCGACCG ACGAGGCCGT CGGCCTGCCC
GCGCTGATCA GCGTCGACCT GCCGCTCGAT CCGTCCCGGC GGCACACCGT GCCCGGGCCC
CTGCGGGACT GGCTCACCGA CCGGCTCGCC GACACGGTCG TCGCGCTCGC CGCTCACCTG
GCCACCGCCC GACCGGACGG CGCCGAGAGC GGTGACCATG GCGCGGACGG GGGGCGCCAC
GCGGCTCCCG ATCCCCTCAC CGTCCTGGAC CTCGTGCCGA CCGGGCTGCC CCGCAGCGAG
GTGGACGGCC GGCTGCGCGA CCGCCTGCTC GCGCTGCTGC CGGACGCCCC CGTCCTGCCG
GGCGGCCGCC GCGGCCGGGA CTGCCTCGTC CTGGACCTCG GCCCCGCCAC GGACGCCGTC
ACCGACCTGC TGCGCGCGGG CGCCGAGACG GGCTCCGGCA CCGGCCTCAC CACGGACCCG
GACCTGGCGG ACTTTGCCGC GGCGGGGAAG GCTGACGAGG CCGCGGCCGA CGATGCCGTG
GTCGACGGGC TGCTGCCGGC CGAGTTCGCC ACCCGGCGCC GCCAGCCCGC ACTCGACCTC
CTGGGCGTCC GCCGCCTGGA CACCGGGGCC GTCGTCGAGG TCCTGCGCGG CATGCGGCGG
ACGCCGTCCT GGTGGGCCGG GCTGTACCCG CTGTTGCTGA CCGCACCCGA CCGGGACGCC
CTGGGCGCGC TGCCGGTTCC GGTCGTGGTC GTGCCCGACG GCGACGACCT CGGCTCCACG
CCGGCCTTCG CCCATGCTGA TCCGTTCACC GGGACCGGAC CGGACGGGGG TGGAGCCGCC
CCGGCGACAC CACCCAGCCG GATGGTCACC GGACCGCGCG GGGCCCTGCT ACCCACCCCC
GATCTGGACG TGGTGGCCCT GGCCCGATCG GGGCTGCCGC TGCGCGCCGT CCATCCCGAC
GCGTGCGCGG GTGCGGCCCG CGACGCGCTG CGGACCCTGG GCGCCTCGGA GGGCACGCCG
GCCGGGGTGC TGCGGGACCC GGCGGTACGC GAGGCCGTCA CCCAGGCCGA CCCCGACGAC
GACCCCGGCG AGCTGGACGC CCTCGCCGCC GCCGTACTCG CGCTGGTGCG CGCGATTGTC
CCGGATGGAC ACACCGCACA TTCGGACGGG CACGACCGCA ATGACGGGCT GGACGGCGGC
GGCTTCCCGT CCGACACCGT GGGTGATCCG CACCCGGAGC AGGGGCCGTC CGCGATGCCG
TCGTGGCTGG GTGGGCTGCT GCTCCCGGAA CAGGGCGGCG GCTTCGCGCC CGCGTCCGAG
CTGGTGATCG CCGACGGGCC GCTCGACCGC CTGCTGGCCG AGGACGCGCC GTTCGGCGCC
CTGGACCCAC GGCTGAGCAC CGCCTGGCCG GACCATGTCC TCGAAGCCAT CGGAGTCCTG
CGGACCTTCG GCGTCCTCTA TGCGTCGGAC GTGACCCTCG ACCCGGACGA GCCCGTCCTG
CTCGAGCTTG ACGACAGCGA CACCTGGGCG GACGACGTCC ACGATGCCGC CGACCGGGCG
GGGGAACGGG GCACCGCGGC TTCCGGTGAC GGGCGAGGCT CCGGCCGCGG GCCCGGCGCG
GCTGGCATGC CGGGTGGACA CGGTGTGCCG GGAGCGCACG GTGTGCCAGG GGCGCCGCGG
GTTGTCCGGC ATTTCGCGGC CGTCCGCGAC CTTGAGCTCG TCGACCCCGA CGCATGGCCG
GAGGCGCTTG CGGAGCTCGC CCGTCCGCCC CTGCGCCGGG TGGTTCTCGG CGCCGAACCC
TCGTACACCC GCTGGTGGCT GGCCCGGCAC GCGCTGCTGC CGGTGGATGG CGACCCGGAC
ACCCGGCTCC CCCCGGGCGA GCTCGTCCTG CCCGGCGCCG ATCCGTTGCT GACCGGGCTG
TTCGCGCCGG GCGCGCCGCT GCCGGGCGTC GACCCGGAGC TGCTGCGCCG GCTCGGCTGC
CGGCTCACCC TCGACGACGT GCTCACCGAC CCGGACGCGG TGATCGACCT GCTGGACCGG
CTCGGCGACG CCGACCGCGA GATCGGCTGG CCGGCGGCCC GGACGCTCTA CATGGCCGCG
GTGAGCGCGG CCGGCCTGCT CGGGGCCGAC CCCTCCGGCA CGACCGGCCC AAGGCTGGAC
CCGCCGCTGA CCGTACGAAC CCCGACCGGG GTGTTCCGCT CCGCCGACGT GGTGGTCGTG
GACGCACCGG ACCTGCTCGA AGTGATCGGC GCCGACCATC CCGCGCTGCG CCTGCCACTG
GATCGCGCCG CCGAGGCCGC GCACATCCTC GGGCTGCGTC TCGCCTCGGA GCTGGCCGAC
TTCGCCGTAC TGGACGAATC CGCTGGACTG AGCGAATCCA CTGGATTCGG TGGATCCGCT
GCTCTGATCG ACAGCACCGT TCAGGCCGGC GGTTCCATCG CTGGCAGCGC TGTCTCGGCC
GGAGCCGGCA CCGGTGGCGG TGGCGGTGGT ACCGATGGCG GTGGCGTCAG TGGCGTCCGC
GGTGCGGTTC CTGGCAGGGC CGTGACGAGG ACCGTCATGG TGGCCGGCAA CGCGGTCGTC
GTCGGGAGCG CGACCCCGGT CGGGGCCGCG GACACCCCGG ATTCGCTCGA CGGGGTCGAC
CTGGGCGCGG TGCCGGGCGC CGCGCGTCCC CTCGTCGCGC GGTACCAGGT CCATCCCGAG
CTGTGGATCT CAAGCCTGGG CGGGGGACGG GTCCGGGTGC CGTGGCGGGT CGTCGGCGGC
ATCGGCGGTG AGATCCACGT CGACGCCGAG GCCGGCACGG ATGCGCTCGC CCGGGCTCTG
GCCTGGCGGG CCGGCCAGTG GGAGCGCCGG CACGGACTGG CCGCCGCGCT GCGGGACCCC
GACGGCGCCG GCCGCCGTCA GGCCGAGGAC GACCTCGACG ATCTTTGA
 
Protein sequence
MDATPFDTAA LRARVIEAWA ASASRFREDA NAEEDAALGA YRDRLLVELL QNAVDAAAAA 
GVPAKVHIRL TSGQHPHGGP GGLLEVANTG APLTAAGVEA LSTLRASAKR DVQAVGRFGA
GFAAVLAVTD TPSIVSRAPS GPGDAADWRD ADQHSADQRS TDLRGVERRG VAWRGVEWSR
RRTADLVVAT GADALRRELD RRAGAVPVLR LPFDLGDPAS GPGDGFDTVV RLPLRDTAAA
ATARELIAAF DPTLPLVLPG LEEIVVEVDG DVRTHRCVWE PVLPGPDGTD LEIATVDGTR
WRGCVHRGSI PSELLADRPV EERDRTTYNA RVMIPDGGWP AEVAQVVRAP QPTDEAVGLP
ALISVDLPLD PSRRHTVPGP LRDWLTDRLA DTVVALAAHL ATARPDGAES GDHGADGGRH
AAPDPLTVLD LVPTGLPRSE VDGRLRDRLL ALLPDAPVLP GGRRGRDCLV LDLGPATDAV
TDLLRAGAET GSGTGLTTDP DLADFAAAGK ADEAAADDAV VDGLLPAEFA TRRRQPALDL
LGVRRLDTGA VVEVLRGMRR TPSWWAGLYP LLLTAPDRDA LGALPVPVVV VPDGDDLGST
PAFAHADPFT GTGPDGGGAA PATPPSRMVT GPRGALLPTP DLDVVALARS GLPLRAVHPD
ACAGAARDAL RTLGASEGTP AGVLRDPAVR EAVTQADPDD DPGELDALAA AVLALVRAIV
PDGHTAHSDG HDRNDGLDGG GFPSDTVGDP HPEQGPSAMP SWLGGLLLPE QGGGFAPASE
LVIADGPLDR LLAEDAPFGA LDPRLSTAWP DHVLEAIGVL RTFGVLYASD VTLDPDEPVL
LELDDSDTWA DDVHDAADRA GERGTAASGD GRGSGRGPGA AGMPGGHGVP GAHGVPGAPR
VVRHFAAVRD LELVDPDAWP EALAELARPP LRRVVLGAEP SYTRWWLARH ALLPVDGDPD
TRLPPGELVL PGADPLLTGL FAPGAPLPGV DPELLRRLGC RLTLDDVLTD PDAVIDLLDR
LGDADREIGW PAARTLYMAA VSAAGLLGAD PSGTTGPRLD PPLTVRTPTG VFRSADVVVV
DAPDLLEVIG ADHPALRLPL DRAAEAAHIL GLRLASELAD FAVLDESAGL SESTGFGGSA
ALIDSTVQAG GSIAGSAVSA GAGTGGGGGG TDGGGVSGVR GAVPGRAVTR TVMVAGNAVV
VGSATPVGAA DTPDSLDGVD LGAVPGAARP LVARYQVHPE LWISSLGGGR VRVPWRVVGG
IGGEIHVDAE AGTDALARAL AWRAGQWERR HGLAAALRDP DGAGRRQAED DLDDL