Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3292 |
Symbol | |
ID | 5671664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3900100 |
End bp | 3903213 |
Gene Length | 3114 bp |
Protein Length | 1037 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242181 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001507601 |
Protein GI | 158315093 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAGA CTTCGCCCGG ATCTCTGCCG GTCTATCGTG ACCCGGCGCG GTCGACAGCG GCTCGGGTCG CCGATCTGAT CGAGCGGATG TCGCTCGAGG AGAAGGTCGC CCAGCTGCGG TCGATCTGGA TCTCGAAGTA CAAGGTTGTG CTGCCCGACG GCACCTTCGA CCCGGCGCGG GCCCACGTGC TCATACCTGA TGGCATCGGA TTCGTGGGAC GGCCGGTGGA CGCGATGGGG ATGGCGGGCT TTCCGGCGAA CTGGCATCGG AGCCGGGAGG AGACCATCGC GTTCGTCGAC GCGGTCCAGC GGTACCTGGT CGAGGAGACC CGCCTCGGCA TCCCGGCCCT GTTCCACGAC GAGACGGCGC ACGGCTTCGT CGCCCGCGGT GCGACGATCT TCCCGATTCC CCCGGCTCTG GCGAGCACCT GGGACGAGGA TCTGGTGGAG GAGGTCTTCA CCGTGGTGGC CCGGGAGGCC CGGTCCGTCG GATCCACCGT GTCCCTCGGT CCTGTGCTCG ATCTGGCGCG GGATCCGCGC TACGGCCGTG TGGAGGAGTT CTTCGGGGAG GACCCGTATC TGGTAGGCCG GATGGGAGTC GCCGCCGTGC GGGGGTTGCA GGGCCGGTCC CGACCGCTCG CGGCCGACCG GATGTTCGCC ACACTCAAGC ACTTCCTCCA CGCGTCGCCT GAGGGTGGGA TCAACGCGGC GCCCGCGCCG GCTCATGAGC GGTCCCTGCG GGAGACCTAC CTGGCTCCGT TCGTCGACGT CGTCCGCGAG GCCAATCCGG CCTTCATCAT GCCCTCGTAT AACGAGGTCG GCGGGCTGCC GTCGCACGCC AGCCGCGACC TCCTGCAGCG GCTGGGGCGC GCTCTGCTCG GGTTCGAGGG TGTGTACCTG AGTGACTACG ACGCGCTTGC GCGGCTGATC TCGGATCATC GGGTGGCCGC CGGTCTCGGG GAGGCGGCGG CGATCGGGCT GACAGCAGGC GTCGACGTGG ACCTCCCGGA CGGCGAGGCA TTCTCGATGC TCGCCCCGCT GGTCCGGGAG GGCCTCGTCG ACGAGACCCT CGTCGACGAG GCCCTCGCCC GGGTGCTCGC GCTCAAGTTC GAGGCCGGTC TGTTCGAGCA GCCGTACGGG CGCCTGGAAC AGGCGGAGTA CAACAGCGCC GAGGCGGTCC GGTTGGCACG CAACTCGGCG ACGCGTGCGC TCACCCTCCT GACGAACGAC GGCATCCTGC CGCTCGATCC GAACGCCGAG ATCCGGCTCG CCGTCGTGGG CCCGAATGCC GGCGAGCTCT ACTACGGCGG GTACTCGGGC GAGAACGACG CTGGGGTGAG CGTGCTCGAC GGCCTGCGGG CGGCCATCGT CGGAAGCGCG ATCACGGTCG AGCACGCGGA GGGCGTCCGG CTGGTGGGCG CCGAGGAGGA GGCGGCCATG CCCGGCCCGG GCCGCGCGCC CGTGCTGCCG GTCGACGACG CGGAGAACCG GCGGCGGATC AAGGACGCCG TCGCTGTCGT CGAGCGGGCG GACGTCGTGC TCCTCGTGGT CGGCGACCAC CCGGCCATCG CCCGTGAGAC GACCCGGCCC CTCTTCCCGG GTGACCGCAA CGAGCTCGGT CTCTACGGCC TGCAGGAGGA ACTCGTCGAG GCCGTTGTGC AGGTCGGGAA GCCGGTCATC GCCCTGCTCG TCAACGGCCG GCCGATCGCC GCCACCCGGC TCGCGGCGGG CGCGAACGCT CTGCTCGAGG GGTGGTACCT GGGCCAGGAG ACGGGGAACG CCGTCGCGGA CGTGCTCTTC GGGCGGGCCG AACCAGGTGG TCGACTCCCG GTCTCCGTGC CCCGTGCCTC GGGTGCTGTG CCCGTCTACT ACGACCGCCA CACCTCGGCC AACCTGTACC CGTACGTCGA GGTGGACCGG ACGCCGCTGT TCCCGTTCGG TCACGGCCTG GGCTACACCA CGTTCGACAT CTCCGAGCCG GTGCTCGACC GGTCCAGCAT CCACGTCGGG GAATCCGTCG GCATCAGCGT CGAGGTGAGC AACACCGGGA ACCGCGCCGG GGACGAGGTG GTCCAGCTCT ATGTCCGGGA CGACGTCTCG TCGGTACCGA GACCGGAACT GCAGTTGCGC GGCTTCCGCC GCATCACGCT GGAACCGGGG CGCTCGACGA CGGTGCGGTT CGTCCTCGAG CCACACCAGC TCGCCTTCTG GAACATCGAC CTCACCGAAC GGATCGTGGA GCCGGGAACA TTCACCATCT CGGTGGGCCG GAGCTCGACG CAGCTCCGTT CCGTGACGCT GTCCGTGGCG GGGCCCGGTA GCGAGCGAGC CCGGCGGCGT CGTGGGCGGC CCGGTGGAAT GTGCCGGTAC TTCCTCGGCG GCAGGGCTGT CCGCGTGATC CGGCCCATGT CGTGGCAACG AGAAGAAGCC CGGTGTCAGC TGTCGTTACT GAAACGTTCC TGTTCCCCTG GGGTGAGGCT GGTGTCGTTG AGTGCGCCGA CGCGTCTGCG CGTCGAACAT CTGGACGAGG CGTTCGGGAC AGAGGTCCGT CGTCCGCGGC TGTCGTGGTG GCTGCCGGCG GGCTCCGCCC GCCAGACGGC CCACCGCATC AGCACGGGGG AGTGGGACTC GGGGCGGATC GAGAGCGACC GGGCCAGGTG GTGGTGTGGC GGGTCAAGGT GTGGACGGAT CTCGGCGAGG GCAGTTGGTC CCAGACCTGT TCCTGGGAGG TGGGGATCGG GCCGGACGAG TGGGTGGCGC GGTGGATCGA ACCGGTCGAG TACGAGACAC CCGTGCCGGG GCACCGCCCC GCCTATCTGC TCCGGCACGG GTTCGACCTC GACGGGCCGC TGGCCCGCGC CCGGTTGTAC GCGACGGCGC ACGGGTTCTA CGAGTTCTTC GTCAACGGCA TGCCCTGAGC GGCGCCGCCA GCAGCCAGTG ATGATGTGCC GTACCCGATG GCGAATCCGG ACGACCTCGA GCAGACCGTG CAGGCGGTCG CGTCGATCGG CGGGCGGATC GTCGCCCGGC AGGCTGACGT CCGTGACGTG TCCGCGCTGC GGCGGGCCTT CGAGGAGAGC GTCGGCGGGC TGGGCCCGGT CGACATCGTG CTGA
|
Protein sequence | MQETSPGSLP VYRDPARSTA ARVADLIERM SLEEKVAQLR SIWISKYKVV LPDGTFDPAR AHVLIPDGIG FVGRPVDAMG MAGFPANWHR SREETIAFVD AVQRYLVEET RLGIPALFHD ETAHGFVARG ATIFPIPPAL ASTWDEDLVE EVFTVVAREA RSVGSTVSLG PVLDLARDPR YGRVEEFFGE DPYLVGRMGV AAVRGLQGRS RPLAADRMFA TLKHFLHASP EGGINAAPAP AHERSLRETY LAPFVDVVRE ANPAFIMPSY NEVGGLPSHA SRDLLQRLGR ALLGFEGVYL SDYDALARLI SDHRVAAGLG EAAAIGLTAG VDVDLPDGEA FSMLAPLVRE GLVDETLVDE ALARVLALKF EAGLFEQPYG RLEQAEYNSA EAVRLARNSA TRALTLLTND GILPLDPNAE IRLAVVGPNA GELYYGGYSG ENDAGVSVLD GLRAAIVGSA ITVEHAEGVR LVGAEEEAAM PGPGRAPVLP VDDAENRRRI KDAVAVVERA DVVLLVVGDH PAIARETTRP LFPGDRNELG LYGLQEELVE AVVQVGKPVI ALLVNGRPIA ATRLAAGANA LLEGWYLGQE TGNAVADVLF GRAEPGGRLP VSVPRASGAV PVYYDRHTSA NLYPYVEVDR TPLFPFGHGL GYTTFDISEP VLDRSSIHVG ESVGISVEVS NTGNRAGDEV VQLYVRDDVS SVPRPELQLR GFRRITLEPG RSTTVRFVLE PHQLAFWNID LTERIVEPGT FTISVGRSST QLRSVTLSVA GPGSERARRR RGRPGGMCRY FLGGRAVRVI RPMSWQREEA RCQLSLLKRS CSPGVRLVSL SAPTRLRVEH LDEAFGTEVR RPRLSWWLPA GSARQTAHRI STGEWDSGRI ESDRARWWCG GSRCGRISAR AVGPRPVPGR WGSGRTSGWR GGSNRSSTRH PCRGTAPPIC SGTGSTSTGR WPAPGCTRRR TGSTSSSSTA CPERRRQQPV MMCRTRWRIR TTSSRPCRRS RRSAGGSSPG RLTSVTCPRC GGPSRRASAG WARSTSC
|
| |