Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5502 |
Symbol | |
ID | 5673833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6666554 |
End bp | 6668044 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244357 |
Product | hypothetical protein |
Protein accession | YP_001509763 |
Protein GI | 158317255 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0681729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.325123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACAGCG CGCGCGGCCA TCGCATCGTC CGCGCACCGA CCGGTCGTCG GCGGTCCTAC CTGCGATCGA CGATGCTGCG TCTCGCCGTC CTCGCGCCGA CCCTGTTGGC CGGGACGACG ATCGCCGGCA GCGGCGCGGA GGCCTTGCCG GTGCCGGGAG CCCTCATCGA GGACACGGCG ATCGGTTCGG GGGTCTCCCA GGTCAGCTAC TTCGGCGACT GGTCGGCGTG CACGCGGTGC GGCCCGGCCA CGCCCAACAA TAGTTACCGG GAGTCGGTCC AGCCGTCGAG CGGCGCGGTG CTCCGGTTCT CCGGGACACA GGTTGACGTC TACGGCGTCC TGGGCCCGTC CGGGGGCGTC GCGACGATCA GTGTCGATGG CGGGACACCG ACCGCCGTCG ACACCTACGC GGCGGCCGGT GCGGTGAGCC GCATCTACCA GTCCGGGTTG CTCAATCCCG GGATCCACAC CGCGGTGATC GTCAACATCG GCTGGCGCAA CCCGGCTTCC AGTGGTGTTC GGGTCGCCTT CGACCGGGCC CAGGTGTTCG TCGAGCAGGG CGGAGGGGAT CCGGGGAACC GGTCGGGTCA GCCGTGGCTC TCCGGGGCGA ACGGAGATCC GATCCAGAAC TCGGCGAACG TGGACACGTT CTGCGAGCGC CGGGGCAGTC CCTGCGACCT CGCTCATGTC TTCGTCTCCC GTAACAACTG GCAGAACATC GTGCAGCCGT CCTGGACCCA GGCGAACTTC GCCGGATGGC CGGGCCGCCT CGTCATCTCG GTGCCTCCCT TCCCCGAGAA CTCGGGGAGC ACACTCACCG CCTGCGCATC GGGTGCCTAC GACTCGCAGT GGCGCACGTT CGGCCAGACA CTGAACTCCA CCGGACGGCA GAACTCGATT ATCCGTATCG CGTGGGAGGC GAACGGGAAC TGGTACCAGT GGTCGGGTAG CAACCCGTCC GCCTATGTGG GCTGTTGGCG GCGGATCGCC GACGCCATCA ACTCCACGGC CGAGCCTGAC CCGCTGCTCG ACTGGACCAT CAACGCGCAC TACTCGCAGA ACCCCGCGAG CCATAACCCG CTCGACCTGT ACCCGGGCGA CGCCTGGGTG GACATCGTGG GCATCGACGC CTACGACCAC TACCCGCCGT CCCGTACCCT CGCCGAGTTC AACAACCAGG CGAACGCGGT CGGGGGCATC ACCTGGCTGT ACAACTTCGC CCGCGCCCAC AACAAGTTGT TCGGTGTCGG TGAATGGGGG GTCGTGAGCG GACGTAACGA GAACGGTGCC GGGGACAACC CGAACTTCAT CCAGTTCATG CGCGACTGGA TGAATGCGCG CGCTGGACAG GGAATGTTCT ACGAGAACTA CTACAGCACC TGCGAGCCGC CGAATGTCGG GTCCAACCTG TACCGGCCGA CCGGGCCGTC CTGCCTGTTC ATCAACAACG CCTCCGCCCA GCGCTACACC GATCTGTGGA GCAGCCCTTA G
|
Protein sequence | MDSARGHRIV RAPTGRRRSY LRSTMLRLAV LAPTLLAGTT IAGSGAEALP VPGALIEDTA IGSGVSQVSY FGDWSACTRC GPATPNNSYR ESVQPSSGAV LRFSGTQVDV YGVLGPSGGV ATISVDGGTP TAVDTYAAAG AVSRIYQSGL LNPGIHTAVI VNIGWRNPAS SGVRVAFDRA QVFVEQGGGD PGNRSGQPWL SGANGDPIQN SANVDTFCER RGSPCDLAHV FVSRNNWQNI VQPSWTQANF AGWPGRLVIS VPPFPENSGS TLTACASGAY DSQWRTFGQT LNSTGRQNSI IRIAWEANGN WYQWSGSNPS AYVGCWRRIA DAINSTAEPD PLLDWTINAH YSQNPASHNP LDLYPGDAWV DIVGIDAYDH YPPSRTLAEF NNQANAVGGI TWLYNFARAH NKLFGVGEWG VVSGRNENGA GDNPNFIQFM RDWMNARAGQ GMFYENYYST CEPPNVGSNL YRPTGPSCLF INNASAQRYT DLWSSP
|
| |