Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3254 |
Symbol | |
ID | 5671628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3850564 |
End bp | 3851907 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242146 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001507566 |
Protein GI | 158315058 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAACG CCCGCGTCCC ACTCCACCGC GACTCCTCTC GACAGTTCCC CGCCGAGCCG GGGGCGCCCA CGGCGGCGCC GGCGACCACG GCTGCCTCAA AGGCCCCGAC GGACTCGTCG AGGGCGGTGT TTCCCGATGG CTTTCTCTGG GGGGCGGCCA CCGCCCCGCA TCAGGTGGAG GGCGGTAACG TCGGCTCGGA GATGTGGCGC TCCGAGTGGA TGCCGAACTC GACGTTCGCC GAGCCGTCCG GGGACGCCTG CGACCACTAC CACCGGTATC CGCAGGACAT CGCCACTCTG GCGGGGCTGG GCCTGAACGC CTACCGGTTC GGGGTCGAGT GGGCGAGGGT CGAGCCCGAG GAGGGGTACT TCTCCCGCGC CGCGCTCGAC CACTACCGGC GCATGGTGGC CACCTGCCTT GAGCACGGCG TGACACCGGT GGTGACCTAC AGTCATTTCT CGTTGCCCCG GTGGTTCGCC GCGGCCGGCG GGTGGAGCAA CCCGGCGGCC CCGGACCAGT TCGCCCGGTA CGCGGCCCGG TTGACCGCGC ACATCGGCGA TCTGGTGCCC TGGGTGTGCA CCCTCAACGA GTCGAACGTC ATCTCGTTGT TGCTGCACCT GCGGGTCGCG CCGGCTGCCG CCCGCGAGGA CGGTCTCGGG CTGGCGGAGG CCCTCAGGGC CCCGGCGCCC GCCGGGACAC CGAAACGCGG CGGTTGGCCG CCCCCGGACG TCGAGATCAT GGCCAAGGTG CACCGCAGGG CGGTGGAGGC GATCAAATCC GGTCCCGGCA ACCCGGCGGT GGGCTGGACG CTGGCCCTGA TCGACATCCA GGCGGCCGAA GGCGGCGAGC AGCGTCAGCT GGCGGTGCGC CAGGCGGCCG AGCTCGACTG GCTCGAGGTG TCCCGGGACG ACGACTTCGT GGGTGTACAG ACCTACACGC GGGAACGAGT GGGGTCTGAA AAGGTGCTCC CGCCGCCGGA GGGCGCGGCC ACGACGCAGA CGGGCTGGGA GGTGTACCCG CCCGCGCTCG GGCACACGGT CCGGCTCGCC GCCGAACACG CCAGGGTCCC GATCCTGGTC ACCGAGAACG GCATGGCCAC CGATGACGAC GACGCCCGCG TCGCTTACAC CCGCGCCGCC CTGCATGGCC TGGCCGCTGC CGTCGCCGAC GGCGTCGACG TGCGCGGCTA CCTGCACTGG ACGCTGCTCG ACAACTTCGA GTGGACGTCC GGCTTCGCGA TGACCTTCGG CCTGATCGCG GTCGACCGGA CGAACTTCGC GCGGGCGGTG AAGCCGTCGG CGCGCTGGCT CGGCGCGGTC GCGCGCGCCA ACGGACTCGT CTGA
|
Protein sequence | MPNARVPLHR DSSRQFPAEP GAPTAAPATT AASKAPTDSS RAVFPDGFLW GAATAPHQVE GGNVGSEMWR SEWMPNSTFA EPSGDACDHY HRYPQDIATL AGLGLNAYRF GVEWARVEPE EGYFSRAALD HYRRMVATCL EHGVTPVVTY SHFSLPRWFA AAGGWSNPAA PDQFARYAAR LTAHIGDLVP WVCTLNESNV ISLLLHLRVA PAAAREDGLG LAEALRAPAP AGTPKRGGWP PPDVEIMAKV HRRAVEAIKS GPGNPAVGWT LALIDIQAAE GGEQRQLAVR QAAELDWLEV SRDDDFVGVQ TYTRERVGSE KVLPPPEGAA TTQTGWEVYP PALGHTVRLA AEHARVPILV TENGMATDDD DARVAYTRAA LHGLAAAVAD GVDVRGYLHW TLLDNFEWTS GFAMTFGLIA VDRTNFARAV KPSARWLGAV ARANGLV
|
| |