Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3155 |
Symbol | |
ID | 5671532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3714848 |
End bp | 3717070 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242050 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001507470 |
Protein GI | 158314962 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCGC GCAACGATCC CGTCGACCTC ACCTTACAGC ACAAGGCATC CCTTCTATCG GGACACGACT TCTGGTCCAC AAAATCCCTG GACGACGCGG GGATCCCCTC GATAGTTCTC GCTGACGGTC CGCACGGCGT GCGCCTTCAG CGCGATAGAG CAGACCACGC GGGCATCTAC GACAGCTTGC CCTCCACCTG CCTGCCTCCG GCGGTCGCGG TCGGGTCCAG CTGGGATGTC GAGGTGGCGA GCCGAGTGGG TGCCGCAGTC GGGCGGGAGG CCCGCGCTCT GGGGGTCGCG GTCGTGCTGG GCCCGGGAGT GAACATCAAG CGGTCGCCTC TGTGCGGCCG CAACTTCGAG TACTACTCCG AGGATCCGCT GCTAACCGGC GTGCTGGCGA CCGCGCACAC ACAGGCGTTG CAGGCCCAGG GAGTGGGCGC GTCGGTGAAG CACTTCGCGG CCAACAACCA GGAGACCGAC CGGATGGTCA TCAGCGCCGA CGTCGACGAA CGCACCCTGC GGGAGATCTA CCTTCCCGCC TTCGAGCGGA TCGTCACCGA AGCGCAGCCC GCGACCGTTA TGGCCGCCTA CAACCGCGTC AACGGGGTCG CCGCGTCGCA GAACCGGTGG CTTCTGACTG ACATCCTGCG CACCGAGTGG GGCTTCACAG GGATCGTTGT CTCGGACTGG GGCGGCGTCT ACGACCGGAT CGCCGCCCTC GCGGCGGGAC TGGACCTCGA GATGCCAGGA TCCTCTGGTG TCAACGACGC GAAGGTCGCG CAGGCCGTCC GCGACGGGCT CCTGGCCGAG ACCGTCGTCG ACGCGAGCGT ACAGCGCCTG ATCACGCTTG CCGGTCTCGG TAAGGCCACC GAGGGCGATC TGCCCGCCGA GAGAGATCAC CACGCCCTCG CCAGGGAGGT GGCCGCCGAC TGTGCCGTCC TACTCAGAAA CGAACACGCG ACGCTTCCGC TCGACCCAGC CGCTCGCATC GCGGTCATCG GCGAGCTCGC CGTCTCCCCC AGGTTCCAGG GCGGGGGCAG CTCACGGGTG AACCCGACCC GGATCGACGT CCCGTTGGAG GCCATCACCG CCCTCGGGCA GAGCGTCGTG TACGCCCGCG GTTTCACCAC CGACGGCTCC AAAGACGCGC GGGAGCTTCG AGACGACGCC GTCCGCATCG CGCTCGACGC CGACGTCTCC ATCATTTTCG CCGGAGGAGA TGAAGACTCG GAAGGCATCG ACCGCGAACA CATCAACCTG CCCGCCGACC AGATCGACCT GATACGAGCA GTCGCCGCCG TGGCCCCGCG GACTGTCGTC GTCTTGTCCA ACGGAGGCGT GGTCTCGCTC GAGGGTTGGC ACGACGACGT GGACGCGATC CTGGAGGCCT GGTTTCTCGG ACAGGCAGCC GGCGGCGCGA TCGCCGACCT TCTGTTCGGC GTGGTCAACC CCTCCGGTCA CCTCGCCGAG ACCATTCCAC GGCAGCTGCA GGACACTCCT TCCTACCTCA CCTTCCCCGG AGAGCAGGGC CACGTCCGGT ACGGCGAGGG CGTCATGGTC GGCTATCGCT ACTACGAAAG CGTCCAACGC GCCGCCCGCT ACCCTTTCGG GCACGGTCTG AGCTACACGA AGTTCGCCAC CAGCGAGCTG CAGGTGACCG TGGACGGTGA CGATTCCGCG ACCGTCCGCG TCACCGTCAC GAACATCGGC GACCGAGCTG GTAAGCACGT CGTGCAAGTC TATGTTGCCA CCGACGCGGG ACGGGTTCGC CGCCCGGCAC GGGAGCTGCG CGCGTTCACC AAGATCGCAC TACAGCCCGG CGAGAGCCGA ACCGTCGAAC TGGCCCTCGA TCGGCGCTCC TTCGCCTACT ACGACATCAA ACAGGCCCGA TGGACCGTCG CGCCCGGTAG CTACTCGATC CAGATCGGCG AGAGCGCGAC GCGGATCATC GCCGAGCAGA ACATCACCCT CCCAGGCGAC ACCGTGACCC AGGAGCTTTC CCTGGACTCA GCCGTCTCCG ACTGGCTCGA CCACCCCATC GTCGGGCCCA TGTTCCGGCG CACCCTTGAC AAGGCCTCGC CTGACCGACG AAGCCTCCTC ACCGACCAGG CCTCGAGGGC CATCGAAATG GTCGCGTCGA TGCCCCTGCG CCAAATCATG CAGTTTCCCA CCGTGGACCT GCCCGTCGAC GCACTCACCC ACATGATGGA GCTGACCCGA TGA
|
Protein sequence | MTPRNDPVDL TLQHKASLLS GHDFWSTKSL DDAGIPSIVL ADGPHGVRLQ RDRADHAGIY DSLPSTCLPP AVAVGSSWDV EVASRVGAAV GREARALGVA VVLGPGVNIK RSPLCGRNFE YYSEDPLLTG VLATAHTQAL QAQGVGASVK HFAANNQETD RMVISADVDE RTLREIYLPA FERIVTEAQP ATVMAAYNRV NGVAASQNRW LLTDILRTEW GFTGIVVSDW GGVYDRIAAL AAGLDLEMPG SSGVNDAKVA QAVRDGLLAE TVVDASVQRL ITLAGLGKAT EGDLPAERDH HALAREVAAD CAVLLRNEHA TLPLDPAARI AVIGELAVSP RFQGGGSSRV NPTRIDVPLE AITALGQSVV YARGFTTDGS KDARELRDDA VRIALDADVS IIFAGGDEDS EGIDREHINL PADQIDLIRA VAAVAPRTVV VLSNGGVVSL EGWHDDVDAI LEAWFLGQAA GGAIADLLFG VVNPSGHLAE TIPRQLQDTP SYLTFPGEQG HVRYGEGVMV GYRYYESVQR AARYPFGHGL SYTKFATSEL QVTVDGDDSA TVRVTVTNIG DRAGKHVVQV YVATDAGRVR RPARELRAFT KIALQPGESR TVELALDRRS FAYYDIKQAR WTVAPGSYSI QIGESATRII AEQNITLPGD TVTQELSLDS AVSDWLDHPI VGPMFRRTLD KASPDRRSLL TDQASRAIEM VASMPLRQIM QFPTVDLPVD ALTHMMELTR
|
| |