Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3257 |
Symbol | |
ID | 5671631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3854969 |
End bp | 3858022 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242149 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001507569 |
Protein GI | 158315061 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGGGA CCGACTTCAC CGCAGTGCCG TGGGAGGACA GCGGCTTCCA GGCCCTGGGT CGGCTGCCGA TGCATTCGAT CAGGCGACCC GCCGAGGTCG AGCTGGACGG GGTGTGGGAC TTCCAGTTGC TGGACAGCCC GACCGCGCCC CCCGGTGCGG AGTGGAAGCG CGCCGACGTC CCCGGCCTCT GGACGATGAG CGAGGCCGGC GACCCGGCGC ACTACACGAA TGTCCGGATG CCGTTCGACG AGACGCCCCC GCGCATCCCG GCGCGCAACC CCACCGGGGT CTACCGGCGT TCCTTCGAGC TGCGCCCCGC CGCGGGCCGC CGGGCGATCC TGCATGTCGG GGCGGCGGAG GGCCTGCTGC GGGTGTTCGT GAACGGGCGC GCGATCGGCG TCTCCACTGA CTCGCATCTC GCCGCCGAGT TCGACATCAC CGAGGCCTGT GTCGCCGGCG GACGGGACCG GCACACGGTG GAGCTGGTGA TCTCCAAGTG GTCGTCGGTC TCCTATCTCG AGGACCAGGA TCACTGGTGG CAGTCGGGTA TCACCCGGTC GGTGTACGTC TACACGCTCC CGGAGATCCG GCTGGCCGAC CTGGCTGTCG TCGCCGACTT CGATCCCGAG GCGCGTCGCG GGACGCTGCG CCTGGAGGTG TCGACCGCCG GTCTCGACCA CCTGCCGGAA CTCGCGTGGA CCGTGCGGGT CGACGTCCTG GGGCGCCGAT CGACGCTGCC GGTCACCCCC TGCTCCCCGG CTCCGGGACT CCCACCGCCC TGGGACGACC GGTCGGTCCG GCCCGAGCCC CGCGTCCCGG CCGACTTCAT GAGCCTGGTC TCCCAGATAG CCGCGGACGC CCCGATCCCG CCGAGGTGGT CGGCGGCGGT TCCGTCCCTG AGACGCAGCC TCGCGCCCCG GCGGCCTGCC GGCACCGCCA CGCTCCACCT CGACGGCCTC GAGGTGGAGC CGTGGTCGGC CGAGTCTCCC CATCTGGAGG ATCTCGTCGT CGCGCTCGTC TCCCCCGGCG GCGAGGTCGT GGACGAGACC CGCACCAGGA TCGGCTTCCG GCGGGTCCGG ATCGAGGGCC GGGATCTGCT GGTCAACGGC GGACGGATCC TCATCCAGGG GGTGAACCGC CACGACACGG ACGCCCGCAC CGGCCGGGTT CTCTCGGCGC GGACGATGCT CGCGGAGCTC TCCCTGCTCA AACGGTTCAA CGTGAACGCG ATCCGCACCT CGCACTATCC CAACGACCCG CGCCTGCTGG AGCTGTGTGA CGAGCTCGGC TTCTACGTCG TCGACGAGGC GGACATCGAG GCGCACGCCT TCGCCAACGC GATCTGCGAC GATCCGCGCT ACCTGCCGGC TTTCCTCGAC CGGGTGTCGC GGATGACGCT GCGCGACCGC AACCATCCGA GCGTGATCGT CTGGTCGCTC GGCAACGAGA CGGGCTACGG AGCGAACCAC GACGCCGCCG CCGGATGGCT GCGCCGCTTC GACCCGACCC GCCCCCTGCA CTACGAGGGG GCGATCGCCC TCGACTGGCA CGGCGGCCGG GCCGCGACCG ACATCGTCTG CCCGATGTAC CCCTCGTTCG AGGCGCTGGC CGCCTTCTCC GCCGATCCGC GGGCCGACCG TCCGGTGATC CTGTGCGAGT ACGCCTACTC GCAGGGGAAC TCGACCGGAG GGCTGGCCGA GTACTGGGAG ATGTTCGAGA CGCTGCCCGG CCTGCAGGGC GGGTTCATCT GGGAGTTCAA GGACCATTCC CTCGATCCCG CGGGCGACGG CGGGTACCGC TACGGCGGCG ACTTCGGGGA CGTCCCCCAC GACGGCGCGA CCCTGGTGAA CGGCATCGTC TTTCCCGACC TCACCCCCCA GCCCGCGCTG TACGAGGCAC GCGGCCTGTT CAGCCCGGTA CGGATCGTCT CGGACGCCGC GGCCGCGCTC GCGGGGGCGA TCGGCATCCG CAACCGCCAG TCCTTCAACG ACCTGAGCGC CTACGTCCTG GAACTACAGG TGGAGAACGA CCGGTCGACG GACCCGGTCA CCGTCGACGT GCCGGACGTC GCGCCCGGGG CGACCGGGAC CATCGAGCTG CCCGACCCGA TCCGCAAGCT GCTCGGGTCC GCACCGCCGT TGGCCCTGAC CCTGACCGTG CGGACCAGGC AGGGCGCGCG GTGGGCACCG GCGGGCACCG TCGTCGCCGC ACACCAGATC ACCTTCCCGC GGCCGCCGGT CGCCCTGCTG TCCGCCCCGG TTCCCGACGC CCTCCGCGTC GACCAGGACG GGTCCGTCGT ACACCCCCTG CTGCGGCGCG CCCCGGCACT GTGCCTGTGG CGGGCAGTCA CCGACAACGA CAAGTCCTTC TCCCTCGACC AGCGCTTCGT CCGCTCCGGC TTCTTCCGCC TCACCCCCGG GGCAGTCACG GCCGAAGCGG ACGGGGGCAG CCTGACGATC ACCACCGCCT ACGCCACCGC GTTCGGCACC GAGGTGACGC ACCGCCGCGT CATCAGCGCG CTGGCCGAGC ACGACTACCG GTTCGACGAG CACGTCCAGC TGCCCGCGGA CACCGAGGAC GCGCTGCGGG TCGGCGTGGA GTTCGAGCTC ACGCCCGGTT TCGACGACGC CCGCTGGGTC GGGCTCGGGC CGTGGGAGAA CTACCCCGAC CGCCGCTCCT CCGCGCTCCT CGGCTCCTGG CGGGAACGCA TCGACGACCT GGCCGTGCCC TACCTCGTTC CGCAGGAGAA CGGCGGGCGC GGGGCGGTGA GCGAGCTGTG CCTGAGCGGT CCGGCCGGCA CCGTGCGCAC CTTCCACCCG ACGCCGGTGC AGATGGCCGT CGGCCGGCAT CGCGTCGATC AGCTCGAGGC CGCGGCCCAC TGGTGGGAGC TCCCGCCCAG CGACGTCACC GTCGTCCACC TCGACGTGGC GCACCGGGGT GTCGGCACCG CCCAGCTGGG CCCGGACACC CGTCCCCGTC ACCGCCTCAC CGACCACGAG TACACATGGA CGTGGCGGCT ACGGCTCGAG TCGGCCGCCG GGGCCGCGGT CTGA
|
Protein sequence | MTGTDFTAVP WEDSGFQALG RLPMHSIRRP AEVELDGVWD FQLLDSPTAP PGAEWKRADV PGLWTMSEAG DPAHYTNVRM PFDETPPRIP ARNPTGVYRR SFELRPAAGR RAILHVGAAE GLLRVFVNGR AIGVSTDSHL AAEFDITEAC VAGGRDRHTV ELVISKWSSV SYLEDQDHWW QSGITRSVYV YTLPEIRLAD LAVVADFDPE ARRGTLRLEV STAGLDHLPE LAWTVRVDVL GRRSTLPVTP CSPAPGLPPP WDDRSVRPEP RVPADFMSLV SQIAADAPIP PRWSAAVPSL RRSLAPRRPA GTATLHLDGL EVEPWSAESP HLEDLVVALV SPGGEVVDET RTRIGFRRVR IEGRDLLVNG GRILIQGVNR HDTDARTGRV LSARTMLAEL SLLKRFNVNA IRTSHYPNDP RLLELCDELG FYVVDEADIE AHAFANAICD DPRYLPAFLD RVSRMTLRDR NHPSVIVWSL GNETGYGANH DAAAGWLRRF DPTRPLHYEG AIALDWHGGR AATDIVCPMY PSFEALAAFS ADPRADRPVI LCEYAYSQGN STGGLAEYWE MFETLPGLQG GFIWEFKDHS LDPAGDGGYR YGGDFGDVPH DGATLVNGIV FPDLTPQPAL YEARGLFSPV RIVSDAAAAL AGAIGIRNRQ SFNDLSAYVL ELQVENDRST DPVTVDVPDV APGATGTIEL PDPIRKLLGS APPLALTLTV RTRQGARWAP AGTVVAAHQI TFPRPPVALL SAPVPDALRV DQDGSVVHPL LRRAPALCLW RAVTDNDKSF SLDQRFVRSG FFRLTPGAVT AEADGGSLTI TTAYATAFGT EVTHRRVISA LAEHDYRFDE HVQLPADTED ALRVGVEFEL TPGFDDARWV GLGPWENYPD RRSSALLGSW RERIDDLAVP YLVPQENGGR GAVSELCLSG PAGTVRTFHP TPVQMAVGRH RVDQLEAAAH WWELPPSDVT VVHLDVAHRG VGTAQLGPDT RPRHRLTDHE YTWTWRLRLE SAAGAAV
|
| |