Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7197 |
Symbol | |
ID | 5675498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8789630 |
End bp | 8790811 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641246034 |
Product | cellulase |
Protein accession | YP_001511422 |
Protein GI | 158318914 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTCC GGGCGGAAGA CGGCAACGGG CCGAAGCCGC GGTTGCGCGA GCCTCAGGTC CGGCCAGGTC GGCCAGGTCG CACCCGCCTC CGGCGCCGGG CCTGTGCCGC CGGGGCGACG GTGGTCGCGG TGCTCGCCCT CGCCGCCTGC GGCGGGGGCT CGAGCCCCAC CCCGGTGACA CCCAGCGTGA CCGCGGCCGC ACCGAACGTT CCCACCGGTC CGCTGACAGC GGTCGGACCG CAATGCCCGG CCGGCGACCC GGCCGGCGCC CAGCGGCCGA CCGAGACCGT CGTGCCCGCC GATCCCGCTC CCGCCGCGCC CTTCCTCGTC GACCACGGCA GCCAGGCGGC CGAGGAGGCG CAGCGCAACC CCGCGCGCGC CCAGATCCTC GCGCCACTGG TCAACACCCC CACCGCCTAT CCGGTCGGTG ACTGGCTGAG AGATGTGCCC GGCGAGGTCC ACAAGCGCGC CTCCGCGAGC CGCGACACCG GCACGACCGC GACATTCATG ATCTATGCGA TTCCGCACCG TGACGCGGAG GCCGTCTACT CCGCCGGTGG CCTGCCCAAC GCCGACGCCT ACCGGACGTT CACCCGCCAG GTCGCCGGCG CGATCGGCGA CGCCCGCGCA GTGATCATCC TTGAGCCGGA CTCGCTCGGT CAGATGGACA GCCTGCCCGC CGACCAGCAG GCCGAGCGCT ACGCGCTGCT CAACGACGCG GTCGGCGTGT ACGGCGCGCT GCCCAACACC AGCGTCTACC TGGACGGCGC GAACTGCGGC TGGATGCCGG CGGGAGCGGC GCCGGTGATC GCCGAACGGC TCCTGCGGGC CGGGGTGAAG GGCGCCCGCG GCTTCGCGGT CAACGTGTCC AACTACTACC GGACCGAGGA CGAGACCGCC CGAGGCGAGA TCATCTCGGC CCTGACCGGC GGCACCCACT TCGTGGTCGA CACCTCGCGC AACGGCCGGG GCCCCGCCGA GGGCATCCAG AACCAGTGGT GCAACCCGCC GGACCGTGGC CTGGGCGTCG CCCCGACGAT CGAGACGGGC TCACCGCACG CCGACGCGTT CCTCTGGATC AAGACCCCTG GTGCCAGCGA CGGCGAGTGC GGACGCGGCA ACCCCGCGGC CGGCGCCTGG TGGCAACAGC AGGCGGAGGA GCTGGTCCGC AATGCGGCCT GA
|
Protein sequence | MTVRAEDGNG PKPRLREPQV RPGRPGRTRL RRRACAAGAT VVAVLALAAC GGGSSPTPVT PSVTAAAPNV PTGPLTAVGP QCPAGDPAGA QRPTETVVPA DPAPAAPFLV DHGSQAAEEA QRNPARAQIL APLVNTPTAY PVGDWLRDVP GEVHKRASAS RDTGTTATFM IYAIPHRDAE AVYSAGGLPN ADAYRTFTRQ VAGAIGDARA VIILEPDSLG QMDSLPADQQ AERYALLNDA VGVYGALPNT SVYLDGANCG WMPAGAAPVI AERLLRAGVK GARGFAVNVS NYYRTEDETA RGEIISALTG GTHFVVDTSR NGRGPAEGIQ NQWCNPPDRG LGVAPTIETG SPHADAFLWI KTPGASDGEC GRGNPAAGAW WQQQAEELVR NAA
|
| |