Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0456 |
Symbol | |
ID | 5675657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 538299 |
End bp | 540581 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239387 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001504825 |
Protein GI | 158312317 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGAGG CACAGCATCT TGTTTCGACG CTCGCCCTCG AGGAGAAGGC CGAGCTGGTG TCTGGTGGCG GTCTCTGGGT GACCGCAGAG ATACCACACG CCGGGGTGCG GCCGGCGGTG CTCACCGATG GCCCGCACGG GGTCCGCATG ACGCGGGATG GCGTTGATAC CGGGCATATC GCGAACAGTT TTCCCGCTAC GGCATTTCCC ACGGCGGCGG CATTGGGATC GTCGTGGAAT GAAGACCTGT TGCGCGAGAT CGGCGCGGCG CTCGGCGCCG AGTCCCGTGC CCTCGGCGTG GACGTGCTAC TCGGCCCCGG AATCAACATC AAGCGCTCCC CGCTGTGCGG GCGCAACTTC GAGTACTTCG CCGAGGATCC GCTGCTGGCG GGAGCACTCG GGGCGGCGTG GGTCGAGGGT GTGCAGTCAC ACGGGGTCGG CGCCTCGGTC AAGCACTTCG CGGCGAACAA CCAGGAGACC GACCGTATGC GGGTGAGCGC GGACGTCGAC GAGCGCACCC TGCGCGAGAT CTACCTGCCC GCGTTCGAGC ACGTGGTCCG CCAGGCGCAC GTAGCCACCG TGATGTGCTC GTACAACCGG ATCAACGGCG TGCGCGCTTC CCAGAACCGA TGGCTGCTCA CAACGGTACT GCGAGAGGAA TGGGGCTTCG AAGGCTACGT TCTCTCGGAT TGGGGGGCCG TTCACGATCC AGTCGCCGCT CTGCAGGCCG GCCTTGACCT GGAAATGCCT TCCAGCAGAG GCCGCAGCGC CGCCGAGATC GTTGGTGCCG TCCGCGCCGG CGCGCTCGAC GAACGGTGCC TGGATCTGGC CGTAGAACGC CAACTGGCCA CGCACCAGCG GCTGTGGCTC GCCCGCGGCG ACGGCATGGA GACGCCCGAT TTGCCCGCAT ACCACGTGCT GGCCCGGCGC GCGGCGGCCG AGGGCGCGGT GCTGCTGAAG AACGATGGCG ATCTCCTTCC GCTCGATCCG GCCACCGGCG GCCGGATCGC GGTCGTGGGT GAGTTCGCCC GCAGCCCCCG CTATCAGGGC GCCGGTAGCT CGCAGGTCAA TCCGACCCGC CTGGACGACG CGTTGACCGC GATCCTTGCC GCGACCTCAC GCGAGGTCAC CTTCGCTCCC GGCTTCCGCC TCGACGGCAC CGCCGATCCG GTCCTCCTTG CCGAGGCCGT GCAGGCCGCC CGCGACGCGG AGTCGGTGGT GATGTTCCTC GGTCTGCCGG AGCAGACGGA GTCGGAGGGC TTCGACCGGA CCGACCTGGA CCTGCCCGCC ATCCAGGTGG AGCTACTCGA AGCCGTCGCT TCGGTCAATT CGCGTGTTGC CGTCATCCTC AGCAACGGTG GTGTCGTGCT GACCGATCCG GTGATCTCGC GCGCCGCCAC GCTGCTGGAG ATGTGGCTGT CGGGCCAGGC TGGTGGGAGC GCCGCCGCCG ACCTCATCTT CGGCCATGCC GCTCCCGCCG GTCGGCTGGC CGAGACCATC CCGCACCGCC TCCAGGACAT CCCGACCTAC GTGAACTGGC CTGGCGCCGA AGGGCACGTG AACTACGGAG AACGCCTGTA TGTCGGCTAC CGATGGTACG ACCGCACCGA TGAGGACGTC GCGTTCCCTT TCGGCTTCGG GCTGACCTAC ACCACCTTCG CCTATTCAGA CCTCGCCGTG CACCTCCCCG ACCCGGCCAG GCCGGAGGCT CGCGTCGAGG TTGTGGTGAC CAACACCGGC AGGCGGGAAG CGGCCGAGGT CGTCCAGCTG TACATCAGCG ACCCAGTCGC CGACGTTGAT CGCCCGGTGC GTGAACTGCG CGGCTTCCGC AAGGTCCGCG TGGCCCCCGG ACACAGCGAA CGGGTGGTGA TCGAGCTTGA CGCCCGTGCG TTCAGCTACT GGAGTACCCG ACGCGGGCAG TGGGTCGTCG AACCTGGGGA GTACGGCATC CATGTGGGAT CCTCCTCACG CGACCTGCCG ATGACCCAGA CGATCAACCT GGACATGGCG ATGCCGACAC CACCGCTCAC CGAAGAAAGC ACGCTGGCCG AATGGTACGA CCATCCGAGT GGTCGACACA TCGTGCGAGA ACTCCTGGGC AAGACGATGG GCGCGCAGGA CGGCCTGCTG GAGAACCTCG GAACCTGGCA GTTCGTCGCC CAGATGCCAC TCACCGCTCT GCTCAAAATG TCCCCATCCG CGGGCACCGA GGCCAAGCCG TTGAAGGACC TGCTGCACGC CGCGAACAAT GGAAACGCCG CGCAGCCAAC GGCGAACGTC TGA
|
Protein sequence | MIEAQHLVST LALEEKAELV SGGGLWVTAE IPHAGVRPAV LTDGPHGVRM TRDGVDTGHI ANSFPATAFP TAAALGSSWN EDLLREIGAA LGAESRALGV DVLLGPGINI KRSPLCGRNF EYFAEDPLLA GALGAAWVEG VQSHGVGASV KHFAANNQET DRMRVSADVD ERTLREIYLP AFEHVVRQAH VATVMCSYNR INGVRASQNR WLLTTVLREE WGFEGYVLSD WGAVHDPVAA LQAGLDLEMP SSRGRSAAEI VGAVRAGALD ERCLDLAVER QLATHQRLWL ARGDGMETPD LPAYHVLARR AAAEGAVLLK NDGDLLPLDP ATGGRIAVVG EFARSPRYQG AGSSQVNPTR LDDALTAILA ATSREVTFAP GFRLDGTADP VLLAEAVQAA RDAESVVMFL GLPEQTESEG FDRTDLDLPA IQVELLEAVA SVNSRVAVIL SNGGVVLTDP VISRAATLLE MWLSGQAGGS AAADLIFGHA APAGRLAETI PHRLQDIPTY VNWPGAEGHV NYGERLYVGY RWYDRTDEDV AFPFGFGLTY TTFAYSDLAV HLPDPARPEA RVEVVVTNTG RREAAEVVQL YISDPVADVD RPVRELRGFR KVRVAPGHSE RVVIELDARA FSYWSTRRGQ WVVEPGEYGI HVGSSSRDLP MTQTINLDMA MPTPPLTEES TLAEWYDHPS GRHIVRELLG KTMGAQDGLL ENLGTWQFVA QMPLTALLKM SPSAGTEAKP LKDLLHAANN GNAAQPTANV
|
| |