Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0435 |
Symbol | |
ID | 5668858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 514801 |
End bp | 517170 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239367 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001504806 |
Protein GI | 158312298 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCCA CCTGGCGCGA CGAGCACCTG GAGGCCGGCC ACCGGGCCGA GCTGCTGCTG GCCGAGATGA CCACCGCCGA GAAATGCCAT CAGCTCACCG CCCGGATGGC GTGGTCGCTG GTGAAGGCCG ACGGCACCGA CGCCGACGGC GCCGAGGACG TGCTGCGCCA TCCGCCCGGG CATGTGGCGC AGCTGATCCT CGACGACCCG GCGCAGTTGG CCGCGATGGT CGGCACGATC CAGAAACAGG TGCTGTCCCG TAGCCGGCTC GGAATTCCGG TGCTCTTCCA CGCCGAGGCG CTCAGCGGCT TCCTCAACGG TGGGCACATG GTGTTCCCCT CCGGCACAGG GCTCGCCGCC GGATGGAGCC CCGACCTCGT CGAAGAGATG ACCGACCTGA TCCGGCGGCA GATGCGCCGC ACCGGTCTGA TGCACGCGCT GTCACCGGTG CTCGACGTGG CCCTCGACCC GCGGTGGGGA CGCGTGCACG AGACGTTCGG TGAGGACCCG TACCTCTGTG CGGCCTTGGG GGTGGCCTAT GTGCGCGGCC TGCAGGGGAC CGACGTCTCG CAGGGCGTGA TGGCCACCGG CAAGCACTTC CTGGGATACG CGGCGCCGCC GGGCGGCCTG AACCTGTCGG CCTTCGAGTC CGGCGCCCGG CGCACGCGTG ACCTGTTCGC CTATCCGTTC GAGGCGGCGA TTCGCCTGGC CGGGCTGCGG TCGGTGATGA ACTCGTACGC CGACGTCGAC GGCGTACCCG CCGGGGCCTC GCCGGAGGTG CTGCAGGACC TGCTGCGCGA CACGCTCGGC TTCGACGGCT TCGTCTCCTC CGACTACACC ACCCTCGAGC AGCTCGTCGA CCGGCAGCGG ATCGCGAAGG ACGCGGCCGA GGCCGGCCGC CTGGCCATCC GGGCCGGGCT CGACGTCGAG ATGCCCAATC CGTACGGGTA CGGCGACGTG CTCGCCGCCG AGGTCGAGCG CGGGGTCGTC GACGTGCGGC ACGTCGACAA CTGCGTCCGC CGGGTCCTGC GGGCGAAGTT CGAGGTCGGG CTCTTCGAGC ACCCGTACCC GGTGGAGAGC ATCGACGTGG CCGCGGCGGC CCGCGAGGGC GCCGACCTGT CGTTGGAGCT GGCCCGGCGC TCGATCGTGC TCGGCCAGAA CAACGGCATC CTGCCCCTGA CCCCCGGCGG GCTCGACATC GCCGTCATCG GGCCGCACGC CGACGCGCCG ACGTTCCAGT TCCCGACCTA TACGTACGTG TCGTGGCGCG AGGCCAACGA GGCCGTGATG CGCGGCGAGC TCGGCACCAT GAACGGCGCC GAGGAGACCG TGTCGGCCTG GTATTCGGCC CTGTTCGGAG CGTGGGACGG CGACAGCGTG GCTCCCTCGC TGGCGGCTGA GATCGGTGAT CACGCCCGCA CGGTCCGGGT CGAGCGCGGC TGCACCCTGA CCGAGGATCT AGGTTCTCTC GACCCGGCGG TCGAGGCGGC GCGGGCGGCG GACGTCGTCG TGCTGGCGCT GGGCGGGGCG AGCCTGTGGT TCACGGGGGA GCGGACCGAG GGGGAGGCCA GCGACACCGC GGACATCACC CTGCCGGCGG CGCAGGTGCG GCTGGCCGAG GCAGTGGCGG CCACCGGCAC CCCGCTGGCG GTCGTGCTCG TGCAGGGCCG GGCCTACGCG CTGCCGAAGG TCATCCAGGA CGCCGCGGCC ATCGTCGTCG CCCCGTACGC CGGACCGTTC GGCACCCAGG CGATCGCCGA CGTGCTCTTC GGCGTGGTCA ACCCCTGCGG CAAGATGCCG TACAGCCTGC CCCGGCACAG CGGCCAGATC CCGGTCTACC ACCACCAGAA GGCCGGGTCG GGCTACCGCA ATCCGTTGCC GCCCAGCGTC TCCCGCCACT ATCTCGACCT GGAGGCGACG CCGTTGTGGC CGTTCGCCCA TGGGCTGAGC TACACCACGT TCGCGCTGAG CGACCTCGAC TGCGGTGCCG ACATCGACGC GTACGGCACG GCCCACGTCA GCGCCACCGT CGCCAACACG GGGCCACGGG ACGGGGCCAC GGTGATGCAG CTCTACCTGC GGGTCAACAC CTCACCGGTG ACCCGGCCCG CCCAGCAGCT CGCCGGCTTC ACCCGGGTCG AGCTGGCCGC CGGCGAGAGC CGCCGCGTCA CGTTCGAGAT CGACGCCTCG CAACTGGCGT ACACCAGTCT CGCCCGCCAG GTGGCGGTCG AGCCGGCCCT GGTGGACGTC TTCGTCGGCC TCGACTCCGA CGACCGGTCG CTGGCGGGCT CGTTCGCCGT GGTGGGGTCC GCGCGGGTGG TCGAGGGCGC CGAGCGGTCC TTCCTCTCGC GAGCACGCCC GCACCCATAG
|
Protein sequence | MSPTWRDEHL EAGHRAELLL AEMTTAEKCH QLTARMAWSL VKADGTDADG AEDVLRHPPG HVAQLILDDP AQLAAMVGTI QKQVLSRSRL GIPVLFHAEA LSGFLNGGHM VFPSGTGLAA GWSPDLVEEM TDLIRRQMRR TGLMHALSPV LDVALDPRWG RVHETFGEDP YLCAALGVAY VRGLQGTDVS QGVMATGKHF LGYAAPPGGL NLSAFESGAR RTRDLFAYPF EAAIRLAGLR SVMNSYADVD GVPAGASPEV LQDLLRDTLG FDGFVSSDYT TLEQLVDRQR IAKDAAEAGR LAIRAGLDVE MPNPYGYGDV LAAEVERGVV DVRHVDNCVR RVLRAKFEVG LFEHPYPVES IDVAAAAREG ADLSLELARR SIVLGQNNGI LPLTPGGLDI AVIGPHADAP TFQFPTYTYV SWREANEAVM RGELGTMNGA EETVSAWYSA LFGAWDGDSV APSLAAEIGD HARTVRVERG CTLTEDLGSL DPAVEAARAA DVVVLALGGA SLWFTGERTE GEASDTADIT LPAAQVRLAE AVAATGTPLA VVLVQGRAYA LPKVIQDAAA IVVAPYAGPF GTQAIADVLF GVVNPCGKMP YSLPRHSGQI PVYHHQKAGS GYRNPLPPSV SRHYLDLEAT PLWPFAHGLS YTTFALSDLD CGADIDAYGT AHVSATVANT GPRDGATVMQ LYLRVNTSPV TRPAQQLAGF TRVELAAGES RRVTFEIDAS QLAYTSLARQ VAVEPALVDV FVGLDSDDRS LAGSFAVVGS ARVVEGAERS FLSRARPHP
|
| |