Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4662 |
Symbol | |
ID | 5673005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5563689 |
End bp | 5566070 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243520 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001508936 |
Protein GI | 158316428 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.576806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGCA GCCCCCTGAG CCCCGGCAAG GTCCCGGACC TGGTGTCCCG GCTGACGCTG GAGCAGAAGA TCGCCCAGCT GACGGGGTTC GCGGTCACCG ATCTCATCGT CCGCGGCGAG GGACCGGGCC CGGCCAGTCC GGACATCGAC GTGAGCCGGG TGCCGTCACT GCGGCCGCAC GGGGCGGGCC ACCTGTCCCT GTCCTGGTTC CTCGGCCACG ACGCGGCGAG CCTGCGGTCC GCGCTCGACA GGATCCAGTC GGCGGTGCGT GAGGTCGCCC CCTTCGGCAT CGGGGCCCTC GCGCACAACG AGGCCGTCAA CGGCTTCCTG CACGTCTCGG GTTCGCAGTT CCCGACCGCC TGGGCGCAGG CGGCGACCTG GGACCCCGCG CTGGTGACCC GCGCGGCGGC CGTGAGCGCC GCCCACATGC GCCAGACCGG CATCCACCTC GCGTTCTCCC CGGTCATGGA CCTGGCGCGG GACCCACGGT GGGGCCGGGT GCACGAGACG TACGGTGAGG ACCCCGAGCT CGCCGCCCAG TTCTCGGTCG CGTTCGTCCG GGGCATCCAG GGTTCCGAGG ACGACTCCGG CCTGCTCGCG ACCGGGAAGC ACTTCGCCGG GTACGGCGCC TCCGAAGGCG GCCTCAACCA GGCGGTCACC CAGCTGGGAC GCCGCGCCCT GGTCGACGAG TACGCCGAGC CCTTCCGCCG GGCCATCGCC GAGGCCGGCC TGGTCGCCGT CATGAACTCG TACAACGAGC TCGACGGCGT GCCCTGCGCC GCCGACCGCT GGCTGCTCAC CGACCTGCTC CGCGGCCAGC TCGGCTTCGA CGGCATCGTG GTCAGCGACT ACAGCGCCGT GGACATGCTG CGGACGATCT ACCACACCGC ATCCTCGGCA GGGCAGGCCG CCGTCCAGGC GATCAGCGCC GGCCTGGACG TCGAGCTCCC CAGCGACGTG AACTTCTCCC ACCTGGCCGA CGAGGTCACC GGTGGCCGTC TCGACGAACA CGTCCTCGAC ACGGCGGTCG CCCGGGTCCT CACCGTCAAG GCCCGCGTCG GGCTCATCCC GGGGATGGCG GCGCCACGCG CCGCGTCCGC CGCGGCGCCC CCGGACCGGG CCGAGGCGGC CGAGGTGCGC CGCGCCGTCG CCGCCCGCGG CCTGGTCCTG CTCGCCAACG ACGGCACCCT GCCCCTGGCG CCCCGGAGTG GACGCATCGT CGTCGTGGGC CCGGCCGCCG ACGAGCTGCG GATCCACTTC GGCGCCTACA CCTCCGTCTC CAACGCCGAG ATGCCGCTGG GGATGATGGC GGTGATGACG GGGCAGGTGC CCGGAGTCGA TCCGGCCACC TTCGTGTTCA CCGACATCTT CCAGCCCCGG ATGCCCGGGT TCGACGAGCG GTTCGAGGCA GAGGCCCGGC GGATCCACCC CGACGCGCCC ACCGTGCTCG ACGCGCTGCG GGGCTTCGAT CCGACCGTCG GGTTCGTTCC GCTCGGCCGG TTCGAGGCCG GCCCCGGGCC CGCGCTGGAC CAGGCGACGG TGACGGCGGC CGTCACCGAC GCCGACCTCG TCATCGCCGT CGTCGGCGAG CGGACCGGAT GGGTCGGGAA CAACACCGCC GGCGAAGGCC AGTCCACCGC GTCCCCCGAC CTGCCCGGCG ACCAGGACGA GCTCGTCGCT CTGCTCGCCG CGACCGGCAG GCCCCTGGTG ACCGTGATGG TCTCCGGACG TCCCCTCCTC CTCGACAGCG TGGCTCGGGC CTCGCGCGCG GTCCTTCTCG CGCCACTGCT GGGAGAAGAA GCCGGCCCGG CGATCGCCGG GGCCGTCTTC GGGACGATCA ACCCGAGCGG GAAGCTCCCA AGCACCTTCC CCCGGCACCT GGGCCAGGTC CCGATCTATC ACGGGCACCA CTTCGGCAGC GGCTACGACC ACCCCACCGG CACCCGGCAC GGCTACAACG ACCTCGACGA CGACGCCCCG CTCTACGCCT TCGGGCACGG CCTGTCCTAC AGCACCTTCG ACATCGCCCT CGACGAGTCG GCCGAACCGG CGGTCGAGGA GATCGACGGG CTGCTGCGGG CGCGGCTCAT CGTGTCGAAC ACCGGCACCG TCGACGGCGA GACCGTCGTA CAGCTCTACG CACGGGACGA AGCCGCGACG ATCGTCCGCC CCGTCCGCCA GCTCCTCGGA TTCACCCGCC TGGCCCTGGC CGCGGGAGAG ACCCGGCGCG TTCTCCTCGA AGCCCCGACC GAACGGCTGT TCTACACCAT GGCCGACGGC ACCCGTGGCC TCGAGGCCGG CGACGTCACC GTGCTGGCCG CGCTCAGCAG TGACGACGTC CGCTGCTCGC GGACCGTCCC GCTGTCCGCC CGCCGCGCCT GA
|
Protein sequence | MNRSPLSPGK VPDLVSRLTL EQKIAQLTGF AVTDLIVRGE GPGPASPDID VSRVPSLRPH GAGHLSLSWF LGHDAASLRS ALDRIQSAVR EVAPFGIGAL AHNEAVNGFL HVSGSQFPTA WAQAATWDPA LVTRAAAVSA AHMRQTGIHL AFSPVMDLAR DPRWGRVHET YGEDPELAAQ FSVAFVRGIQ GSEDDSGLLA TGKHFAGYGA SEGGLNQAVT QLGRRALVDE YAEPFRRAIA EAGLVAVMNS YNELDGVPCA ADRWLLTDLL RGQLGFDGIV VSDYSAVDML RTIYHTASSA GQAAVQAISA GLDVELPSDV NFSHLADEVT GGRLDEHVLD TAVARVLTVK ARVGLIPGMA APRAASAAAP PDRAEAAEVR RAVAARGLVL LANDGTLPLA PRSGRIVVVG PAADELRIHF GAYTSVSNAE MPLGMMAVMT GQVPGVDPAT FVFTDIFQPR MPGFDERFEA EARRIHPDAP TVLDALRGFD PTVGFVPLGR FEAGPGPALD QATVTAAVTD ADLVIAVVGE RTGWVGNNTA GEGQSTASPD LPGDQDELVA LLAATGRPLV TVMVSGRPLL LDSVARASRA VLLAPLLGEE AGPAIAGAVF GTINPSGKLP STFPRHLGQV PIYHGHHFGS GYDHPTGTRH GYNDLDDDAP LYAFGHGLSY STFDIALDES AEPAVEEIDG LLRARLIVSN TGTVDGETVV QLYARDEAAT IVRPVRQLLG FTRLALAAGE TRRVLLEAPT ERLFYTMADG TRGLEAGDVT VLAALSSDDV RCSRTVPLSA RRA
|
| |