Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0241 |
Symbol | |
ID | 5668666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 293991 |
End bp | 296393 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239170 |
Product | Kojibiose phosphorylase |
Protein accession | YP_001504614 |
Protein GI | 158312106 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.532549 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGTCA GGCCCTCCTA CCCCATCGAG TCCTGGTCGC TGACCGAGCA CGGGCTCGAC ATCGACGACC TGGCCCGCTC CGAGTCGCTG TTCTCGCTGT CCAACGGGCA CGTGGGCATG CGCGGGAACC TCGACGAGGG CGATCCGCAC GGGCTGCCCG GTACCTACCT GAACTCCGTC CACGAGCTGC GGCCGCTTCC GTACGCCGAA GCGGGCTACG GGTACCCGGA GTCCGGGCAG ACGGTCATCA ACGTCACGAA CGGCAAGATC GTCCGCCTGC TGGTCGACGA CGAGCCGTTC GACGTCCGCT ACGGGGATCT TCTCGCGCAC ACCCGCACGA TCGACTTCCG CGAGGGGGTG CTGCGCCGGG AGGCCGACTG GGTCTCCCCG GCCGGGCAGC GGGTCAGGAT CCGCACCCAG CGTCTCATCT CCTTCTCCCA GCGCTCCGCC GCCGCGATCC ACTACGAGAT CGAACCGGTG GGCGACACCG CGCGGATCGT CATCCAGTCC GAGCTGGTCG CCAACGAGCA GCTTCCGGGG CGCAGGGGCG ACCCGCGCGC CGCCGCCGTC CTGGAGTCGC CGCTCATCTC CGAACGGCAC CGCGCCCGCG AGACCATGGT CGAGCTCGTC CACCGCACCC GGCACAGCGA CATCCGGGTC GCCGCGGCGA TGGACCACAT CTTCGACGGC CCGCGTTCGC TCGGCGTCAC CTCGGAGAGC GAGCCCAACA CCGGCTGGGT CACCGCGACG GCTGTCCTCA AGCCGGGCGA GACCCTGCGG ATGGTCAAGT TCCTCGCCTA CGGCTGGTCC GAGCAGCGCT CCCTGCCGGC GCTGCGCGAC CAGGCCACCG CCGCCCTCGT CGCTGCCCGC CAGACCGGCT GGGACGGCCT CGTCGCCGAA CAGCGTGCGT ACCTGCAGGA CTTCTGGAAC CGGTCCGACG TCGAGGTCGA CGGCGACGCC GAGGTCCAGC AGGCCGTCCG GTTCGCGCTC TTCCACGTCC TGCAGGCCGG CGCGCGGGCC GAGCGCCGGG CCATCCCCGC GAAAGGGCTC ACCGGTCCCG GCTACGACGG TCACGCCTTC TGGGACACGG AGAGCTACGT CCTGCCCGTC CTCACCTACA CCGCGCCGGC CGCCGCCGCC GACGCGCTCC GCTGGCGGCA CTCGATCCTC CCGCTGGCCC GCGAGCGCGC CCAGTTGCTC AACCTCGACG GCGCCGCCTA TCCCTGGCGC ACCATCCACG GCGAGGAGTG CTCCGGCTAC TGGCCGGCCG GGACGGCCGC CTACCACGTC AACGCGGACA TCGCCGACGC CGTCCTGCGC TACCTGTGGG CCACCGAGGA CGAGCAGTTC GAGCTCGAGG TGGGCCTGGA GATCCTCATC GAGACGGCGC GGCTGTGGCG CTCGCTGGGC CACCACGACC TCTCCGGCCG TTTCCGCATA GACGGGGTGA CCGGCCCGGA CGAGTACTCC GCGCTCGCCG ACAACAACGT CTACACCAAC CTGATGGCCC AGCGGAACCT CATCGGGGCG GCCGACGCCG TCCGGCGACA CCCCGAACGC GCGCGCGCCT TCGGGGTCGA CGCGGAGACC GCGGCGAACT GGCGCGATGC CGCCGACGAC ATGTTCATCC CGTTCGACGA ACGCCTCGGG GTGCACCCGC AGTCCGAAGG ATTCACCGAG CACCAGGTCT GGGACTTCGA ACGGACCAGG CCGGAGCAGT ACCCGCTGCT GCTGCACTTC ACCTACTTCG ACCTTTACCG CAAGCAGGTC GTGAAACAGG CCGACCTGGT GCTGGCGATG CAGCGGCGCG GCGACGCGTT CACCGCCGAG CAGAAGGCAC GCAACTTCGC CTACTACGAG GCGCTCACCG TCCGCGACTC GTCGCTGTCC GCCTGCTGCC AGGCCGTCAT GGCGGCCGAA TGCGGGCACA TGTCCCTCGC GCACGACTAC CTGCGCGAAG CCGCGTTCAT GGACCTGAAG GACATCGAGC ACAACACCGG CGACGGCCTG CACATGGCCT CGCTGGCCGG CAGCTGGATC GCGCTCGTCG AGGGCTTCGG CGGGCTGCGC GACACCGGTG AGCTGCTCTC CTTCAGCCCC CGCCTGCCCG AGGGTCTCAG CCGGCTCGCC TTCGGCCTGC GCGTGCGCTC CCGCCAGCTC CGCGTCGAGG TGCTCGAATC GTCCGCCACC TACACGGTGC TCGAGGGGGA GGCGATCACC ATCCTGCACC ACGGCGAGAA GGCCCGCGTC TCACCCGACC AGCCCTGCAA ACTGGACGTC CCACCGGTGC CGGCGCAGGA ACGCCCGAGG CAGCCCGCCG GCCGGGAGCC GCTGCGGTTC CTGCACCACG GCAACGGCAC CGAGGTCACC CGCGCGGCCG ACCTCCACCC CGTCGACGGG TGA
|
Protein sequence | MIVRPSYPIE SWSLTEHGLD IDDLARSESL FSLSNGHVGM RGNLDEGDPH GLPGTYLNSV HELRPLPYAE AGYGYPESGQ TVINVTNGKI VRLLVDDEPF DVRYGDLLAH TRTIDFREGV LRREADWVSP AGQRVRIRTQ RLISFSQRSA AAIHYEIEPV GDTARIVIQS ELVANEQLPG RRGDPRAAAV LESPLISERH RARETMVELV HRTRHSDIRV AAAMDHIFDG PRSLGVTSES EPNTGWVTAT AVLKPGETLR MVKFLAYGWS EQRSLPALRD QATAALVAAR QTGWDGLVAE QRAYLQDFWN RSDVEVDGDA EVQQAVRFAL FHVLQAGARA ERRAIPAKGL TGPGYDGHAF WDTESYVLPV LTYTAPAAAA DALRWRHSIL PLARERAQLL NLDGAAYPWR TIHGEECSGY WPAGTAAYHV NADIADAVLR YLWATEDEQF ELEVGLEILI ETARLWRSLG HHDLSGRFRI DGVTGPDEYS ALADNNVYTN LMAQRNLIGA ADAVRRHPER ARAFGVDAET AANWRDAADD MFIPFDERLG VHPQSEGFTE HQVWDFERTR PEQYPLLLHF TYFDLYRKQV VKQADLVLAM QRRGDAFTAE QKARNFAYYE ALTVRDSSLS ACCQAVMAAE CGHMSLAHDY LREAAFMDLK DIEHNTGDGL HMASLAGSWI ALVEGFGGLR DTGELLSFSP RLPEGLSRLA FGLRVRSRQL RVEVLESSAT YTVLEGEAIT ILHHGEKARV SPDQPCKLDV PPVPAQERPR QPAGREPLRF LHHGNGTEVT RAADLHPVDG
|
| |