Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4660 |
Symbol | |
ID | 5673003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5561378 |
End bp | 5562619 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243518 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001508934 |
Protein GI | 158316426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.503774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAT TCCCCGAAGG CTTCCTCTGG GGTGCGTCGA CCGCGGCGCA CCAGGTGGAG GGCGGCAACG TCAACTCCGA CATGTGGCAC AGCGAATGGG CGGAGAACTC GACGTTCGCC GAGCCGTCGG GAGACGCCTG CGACCACTAC CACCGGTACC CCGAGGACAT CGCGACCCTG GCCGGCCTCG GCCTGAACGC CTACCGGTTC GGGGTCGAGT GGGCACGGAT CGAACCGGAG GAGGGCTATT TCTCCCGGGC CGCCCTCGAC CACTACCGCC GGATGGTCGG CAGCTGCCTC GAGCACGGCG TCACCCCGGT CGTGACCTAC AGCCACTTCT CGACACCGCG GTGGTTCGCC GACGCGGGAG GATGGGGCGA CCCGGCGGCG GCGGACCGGT TCGCCCGGTA CGCGGGCCGG GTGACCGAGC ACATCGGTGA CCTCGTGCCC TGGGTGTGCA CGTTCAACGA GCCGAACGTC ATCTCCCTGA TGGTGCATCT CGGTGTCATC CCGGCCGCGT CCCGCGACGA GGCCCTCGGC CTGCCGACCG GCGACGAACG CCAGGATCCC GGCGGCGGGG CAGGGGCGGG CGGGGCACGG TCGGGCGCGG CGTGGGCCGC CCCGAGCGTC GAGGTGATGG CGACCGCGCA CCGCAAGGCC GTGGAGGCCA TCAAGTCCGG CCCGGGGAAC CCCGCCGTCG GCTGGACGCT GGCCCTCATC GACCTCCAGC CCGCCGACGG CGGTGAGCAA CGCTGGCAGG CGGTACGCCA GGCGGCCCTG CTCGACTGGC TCGACGTCTC CCGCGACGAC GACTTCGTCG GCGTCCAGAC CTACACCCGG GAACGCGTCG GACCCGACGG TGTCCTGCCC GTTCCCACCG GAGCCCCCAC CACGCAGACC GGCTGGGAGA TCTACCCGCA GGCGCTGGGC CACACCGTCC GCCTCGCCGC CGAACACGCC GGTGTGCCGA TCCTGGTCAC CGAGAACGGC ATGGCCACCG ACGACGACGA CGCCCGGATC GCCTACACCA CCGCCGCCCT CGACGGACTG GCCGGTGCCA TCGCCGACGG TGTCGACGTC CGCGGGTACC TGCACTGGAC GCTGCTCGAC AACTTCGAGT GGACGTCCGG CTACCAGATG ACCTTCGGGC TCGTCGCCGT CGACCGCACC ACCTTCGCCC GCACCGTCAA ACCCTCCGCC CGCTGGCTCG GCAAGGTCGC CCGCGCCGGC GGACTCACCT GA
|
Protein sequence | MSTFPEGFLW GASTAAHQVE GGNVNSDMWH SEWAENSTFA EPSGDACDHY HRYPEDIATL AGLGLNAYRF GVEWARIEPE EGYFSRAALD HYRRMVGSCL EHGVTPVVTY SHFSTPRWFA DAGGWGDPAA ADRFARYAGR VTEHIGDLVP WVCTFNEPNV ISLMVHLGVI PAASRDEALG LPTGDERQDP GGGAGAGGAR SGAAWAAPSV EVMATAHRKA VEAIKSGPGN PAVGWTLALI DLQPADGGEQ RWQAVRQAAL LDWLDVSRDD DFVGVQTYTR ERVGPDGVLP VPTGAPTTQT GWEIYPQALG HTVRLAAEHA GVPILVTENG MATDDDDARI AYTTAALDGL AGAIADGVDV RGYLHWTLLD NFEWTSGYQM TFGLVAVDRT TFARTVKPSA RWLGKVARAG GLT
|
| |