Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3192 |
Symbol | |
ID | 5671568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3766259 |
End bp | 3768694 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242086 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001507506 |
Protein GI | 158314998 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTACG ACGCCACGCT GACACCTGAA GCCCGGGCGG ACGCGCTTCT CCGGACGATG ACGGTGGAGG AGAAGGCGCA GCAGATCACG GGTCTGATGC CGGTCGGGCT GCTCGGCGTG GACGGCCTGG TCCAGACGGA GGCCGAACGC CGCCTGGGCA CCGGGATCGG CCACATCGCG CCGCTCGGGA TGCTCAGCCA CCGGACGCCC GCGAACCTGG CGAAGGCTGT CAACGAGATC CAGCGTTTCC TCGTCACCCG GACCCGCCTG GGCATTCCCG CGCTCTTCCA CGTGGAGGCG CTGAACGGTG TGGTGTCCCC TGGTCTGACC ACGTTCCCGA CCGCGATCGG GCTCGCCGCC ACGTGGAACC CCGCCGGGGT GACGGAGATG GCGGACGTCC TGCGCCGGCA GGCACGGGCC ATCGGTCACC CGCTCGTCCT GTCCCCGGTG ATGGACGTGG CCCGCGACGC CCGCTGGGGC CGGGTGCACG AGACCTACGG CGAGGACCCG TACCTGGTGT CCGCGATGAG CGTCGCGTTC GTCCGCGGCA TCCAGGGCGA CGACCCGCGC GAGGGCGTCA TCGCGACCGG GAAGCACTTC CTCGGCTATG CCCTCACCGA GGCCGGGCAG AACATGGCCC GCACCGCCGT CGGGGCCCGG GAGCTGTACG AGGTGTACGC CCGCCCGTTC GAGGCCGCCA TCAAACTGGC CGGCCTCGCG GCGGTCATGA ACTCCTACAG CACCGTCGAC GGCGTCCCGG TCGGTGCCAA CCGCGAGATC CTCACCGGCC TGCTGCGGGA CCGCCTCGGC TTCGCCGGCA CCGTCGTCTC CGACTACGAG ACCATCCGCC ACCTGTACAA GCGGCTCGGT GTGGCGCGGG ACGCCGAGGA GGCCGGCCGG CTCGCCCTCG CCGCCGGGCT CGACGTCGAG CTTCCCGTCG CCGACGGCTA CGGCCCCACG CTCGCGCGGG CGGTCCACGC CGGCACGGTT CCCGTCGATC AGCTGGACCA GGCCGTGTGG CGGGTCCTGC GGGACAAGTT CGCGCTGGGC CTGTTCGACC GGCCCTACGC CGACGAGGAT CCCGTCGTCG TCAACGAGGT CGCCCGCCAG GGTGTCGACC TCTCCTACCG GCTCGCCCAG CAGTCCGTGA CCCTGCTGGC GAACGACGGC ACCCTGCCGC TGTCCCGGGA CCTGCGCCGG ATCGCGGTCG TCGGCCCGCA CGCCGACGGG ATCTCCTTCG CGTTCCCGCC CTACACCTAC CCCGCCGCGC TGGAGATGCT CCGGGCCCGG TTCACCGGCG AGCGGGCCCA CATACCCGGC ACCGAGAACA TGGCCGGTGA CATCACCCCC GAGGCGGCCG CGCTGATGCG CCAGGAGCTC GCCGGCCCCA TCGGGACACC GATCGACGAC TACATCCGCG ACGCCTACGG CGCGCTGTCC CTCGCCGACG CCGTCCGGCG GGCCGTCCCC GGCGCGCAGG TGACCGTGGC CACCGGCTGC GGGGTCCTCG ACGAGGAGCC GGCCGACATC CCGGCCGCGG TCGCCGCCGC CGCGGGCGCC GACGTGGTGA TCCTCGCCCT CGGCGGGCGG GCCGGCTGGT TCACTCCCCG GATCACCGAG GGCGAGGGCT GCGACACCGC CGACATCGAC CTTCCCGCGA ACCAGATCGC GCTCGTCCAG GCCGTCGCAG GCACCGGGAC TCCCTGCGTG GGCATTGTGT ACACCGGCCG GCCGATGGCC CTGACCCCGA TCGTCGGGCT GCTCCCGGCG CTGCTCTACG GCTACTACGG CGGCCAGCAC GCCGCCACCG CCATGGCCGA CGTGCTGTTC GGCAGCGTCA ACCCGGCCGG CAGGCTCCCG ATCTCCATCC CGCGGCACTC CGGGCAGGTG CCCGTCTACT CCGGCCAGCC CACCGGCACC GGCTACCGGC GCACCGACCA GGACATGCAC CTGGGCTATC TCGACATGCC GTCAGGTCCC CTGTTCCCGT TCGGCCACGG GCTGAGCTAC ACCACCTTCG ACTACACCGA CCTCACGGTC AGCTTCCCCG AGGTCGACAG CGAGGGCGCC GTCACCGTCG GGCTGACCGT CCGCAACACC GGCGCACGGG CCGGCGACGA GGTCGTGCAG CTCTACTTCT CCGACCAGGC CACCGGGGTC ACCCGGCCGG CCCAGGAGCT GGTCGGCTTC ACCCGCCTCA GCCTGGACGC CGGCGCGGCC GCGACCGTGG CGTTCACCGT GCCCATGAGC CAGCTCGGCT ACGTCGCCCT CGACGGCGGC TTCGTTCTCG AACCCGGCCC CATCCAGATT CTCGCGGGCA GCTCGTCGGA CGACATCCGC CTGCGCGGCA GCTTCGACGT CGCCGGAAAA GTCGCCGAAC TGGACAGCCG CCGTTCCTTC CTCTCGGACG TCACCGTCAG CGACACACGC CCTTGA
|
Protein sequence | MTYDATLTPE ARADALLRTM TVEEKAQQIT GLMPVGLLGV DGLVQTEAER RLGTGIGHIA PLGMLSHRTP ANLAKAVNEI QRFLVTRTRL GIPALFHVEA LNGVVSPGLT TFPTAIGLAA TWNPAGVTEM ADVLRRQARA IGHPLVLSPV MDVARDARWG RVHETYGEDP YLVSAMSVAF VRGIQGDDPR EGVIATGKHF LGYALTEAGQ NMARTAVGAR ELYEVYARPF EAAIKLAGLA AVMNSYSTVD GVPVGANREI LTGLLRDRLG FAGTVVSDYE TIRHLYKRLG VARDAEEAGR LALAAGLDVE LPVADGYGPT LARAVHAGTV PVDQLDQAVW RVLRDKFALG LFDRPYADED PVVVNEVARQ GVDLSYRLAQ QSVTLLANDG TLPLSRDLRR IAVVGPHADG ISFAFPPYTY PAALEMLRAR FTGERAHIPG TENMAGDITP EAAALMRQEL AGPIGTPIDD YIRDAYGALS LADAVRRAVP GAQVTVATGC GVLDEEPADI PAAVAAAAGA DVVILALGGR AGWFTPRITE GEGCDTADID LPANQIALVQ AVAGTGTPCV GIVYTGRPMA LTPIVGLLPA LLYGYYGGQH AATAMADVLF GSVNPAGRLP ISIPRHSGQV PVYSGQPTGT GYRRTDQDMH LGYLDMPSGP LFPFGHGLSY TTFDYTDLTV SFPEVDSEGA VTVGLTVRNT GARAGDEVVQ LYFSDQATGV TRPAQELVGF TRLSLDAGAA ATVAFTVPMS QLGYVALDGG FVLEPGPIQI LAGSSSDDIR LRGSFDVAGK VAELDSRRSF LSDVTVSDTR P
|
| |