Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5276 |
Symbol | |
ID | 5675758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6347184 |
End bp | 6349154 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244133 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001509540 |
Protein GI | 158317032 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.337424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0935603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATGC GGCGTGTGCG GGTCGTCGGG GTGGCGCTCG CGGTCGTCCT GGCGACGCTG GCCGGTGTCA CGGCGTTCGC CTGGGCGGTC CGCGGCACCG ACGCGGGCGG CTCGATGGCG GACCGGCTCA GCCTCGCCGA CGGCGACACC GCAGGCGGCT CCGCCGGGGC GGGGGACCCG GGCGGCGGCG CGGCGGGCGC GGAATCCGGG CGCGGCACCG CGGAGCACCC CGCCGACGGC CCGGAAACCG GCGCCGACGA GCTGTCGGCC GCCGACCCGG GCGGCGCGGG GAGCGGCCCC GGCGCCGGTT CGGCCGAGGA GAACAGCTGG GTCGAGCCCA CGCTGGCCGG CCTGTCGCTG GAGCAGCGCG TAGGCCAGAT GATGATGGGA TACGTCTTCG GTACCGCGGG CGCCGACCGG AGCCCGGCCG TTGTCACCGC GAACCGGCGG ACGTCCGGTG TGGACACGGC CGCCGAAGCC GTCGCGAGAC GGGGCCTGGG CGGCGTGATC TACTTCGACG CCGGCGGGAC GGGCCCGGGG GCGCTCCCGG ACAACATCGT CAACCCGAAC CAGGTCAAGA CGCTGTCCGC GGACCTGAGC GCGGCGGCCA GCATCCCGCT GCTGATCGCC GCGGACCAGG AGCAGGGAAC GGTGCTGCGC GTCCGGGACG GCGTGACCCT GCTGCCCGGC CAGATGGCAC AGGGCGCGAC GGGACGTCCC ACCGACGCGC GGGACGCCGC GCAGATCACC GGCGCGGACC TGCGCGCCCT GGGCATCAAC GTCGACTTCG CCCCGGACGC AGACGTCAAC AGCGACCCGG CGAACCCCGT GATCGGTGAG CGCTCCTTCG GCGACGACCC TACGGCGGTC GGGCGGTTCA CCGCGGCGGC GGTCGAGGGA TACCGGCAGG TCGGGGTGGC CGCGGCCGCG AAACACTTTC CCGGGCACGG CGCGACGTCC GTCGACAGCC ACGCCGACCT GCCGACGATC ACCAGAGACC GGGCGGCGCT GACGGCGCTC GACCTGCCGC CGTTCCGGGC GGCGATCGCC GCGGGCGTAC CGATGGTCAT GGTCGGTCAT CTGAACGTCC CCGCGCTCGA CCCGGCCGCG CCGGCGACGC TGTCGAAGCC GGTGGTCGAC GGCCTGCTGC GCCACGAGCT CGGCTTCGAC GGCGTCATCG TCACCGACGC GCTGAACATG GCCGCGATCA CCGAGCACAA CACGCCCGGC GGCGCGGCGG TGCGAGCCGT CCAGGCCGGC GTCGACATGC TGCTGATGCC GCCGGACCTG ACGCAGGCGC TTGATGCTGT GGTTTCCGCT GTGCGCTCCG GGGCGATCGT TCCGGAGCGG ATCGACGCCT CGGTACGTCG AATCCTGAGG ATGAAGTGGA GGCTGGCGCA CACCGAGCCC GCCGCGGCCC GAACCCCCGA GGAGGCGGCG GCCACCGCGG CCGCGATCGC CGAGCGGGCG ATCACGCTCC TTGACCAGCC GACGTGTGAC CTGCTCCCGC TCAGCCGCGG CACCGCCGGC GCCGGTGCGG GCGGGACGCA GCCGGCGGTG GAGGTCTCCG GGCCGTCGGG CGCCGCGAAG ATGCTCGTGG ACGCACTGGG TGCCCGGGGG ATCGGCGCCC GGTTGGCCGC CCAGGGCTCC GGCCGCCCGC CAGCGCCGCC CGCAGCGGGC GCCTCGGGAG CGGGTACGGC TGTGGTCCGG GTGGTGCTGG TCGGGAACTC GCCGCCGCCC GTCACCGACC GGCGGACCGT GGTCGTGTCG ACCGGGACGC CCTACCGTCC GCCGGTGGCC GCCGGGGCGT GGCTGGCGAG CTATTCACGC GACCCAGCGT CGATGAAGGC GCTGGCGGCG GTGCTGGCCG GCGCGGTGCC ACCCTCCGGC CGGCTGCCGG TCGTCACCCG AACCGCGACC GGCACCGCGT TGCCGCGCGG CGCAGGCCTA CCGACCCCGC GCGCCTGCTG A
|
Protein sequence | MVMRRVRVVG VALAVVLATL AGVTAFAWAV RGTDAGGSMA DRLSLADGDT AGGSAGAGDP GGGAAGAESG RGTAEHPADG PETGADELSA ADPGGAGSGP GAGSAEENSW VEPTLAGLSL EQRVGQMMMG YVFGTAGADR SPAVVTANRR TSGVDTAAEA VARRGLGGVI YFDAGGTGPG ALPDNIVNPN QVKTLSADLS AAASIPLLIA ADQEQGTVLR VRDGVTLLPG QMAQGATGRP TDARDAAQIT GADLRALGIN VDFAPDADVN SDPANPVIGE RSFGDDPTAV GRFTAAAVEG YRQVGVAAAA KHFPGHGATS VDSHADLPTI TRDRAALTAL DLPPFRAAIA AGVPMVMVGH LNVPALDPAA PATLSKPVVD GLLRHELGFD GVIVTDALNM AAITEHNTPG GAAVRAVQAG VDMLLMPPDL TQALDAVVSA VRSGAIVPER IDASVRRILR MKWRLAHTEP AAARTPEEAA ATAAAIAERA ITLLDQPTCD LLPLSRGTAG AGAGGTQPAV EVSGPSGAAK MLVDALGARG IGARLAAQGS GRPPAPPAAG ASGAGTAVVR VVLVGNSPPP VTDRRTVVVS TGTPYRPPVA AGAWLASYSR DPASMKALAA VLAGAVPPSG RLPVVTRTAT GTALPRGAGL PTPRAC
|
| |