Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3187 |
Symbol | |
ID | 5671563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3755431 |
End bp | 3758499 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242081 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001507501 |
Protein GI | 158314993 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA ACGGAACCGG GACGTCCGCC ATCCAGCCGT GGGACGCGCC TGAGCTCACT TCCGCGAACC GCGTCCCGAT GCACGCCGTT CCGCACGCTG ATCGTCTCGT CCTGGACGGT GTCTGGGATT TCCAGCTCCT TCCCGGCCCG CTCGCGGAGC GCGGCGAGCA GTGGCGCACC GTCGAGGTGC CGGGTGTGTG GACGATGCAG GACAGCGGGG ACCTTCCGCA GTACACGAAC GTCGTCATGC CGTTCGACTC CCCGTTCCCG CACCCGCCCG AGGCGAACCC GACCGGGGTC TACCGGCGCG GCTTCACCGC CGCGGCGGAC TGGACGGGGC GACGGGTCGT CCTGCACGTC GGCGCCGCGG AAAGCGTCCT GCTGGTGCGG GTGAACGGAC GCGACGTCGG CTTCAGCAAG GACTCCCACC TCGCGGCCGA GTTCGACGTG ACCAGGTTCG TCCGGCCCGG GACGAACGAA CTCGAGCTGA CGGTCGTGAA GTGGTCGGAC GCCTCGTTCG TCGAGGATCA GGACCACTGG TGGCACGGCG GCATCACCCG CTCGGTCTAC CTCTACACCA CCGCTCCCGT TTACCTGGCC GACATCCAGG CGATGGCCGA CCTGCATGTG ACGGCAGCAG CAACCGGCAG CCTGCGGCTG GACGTGAAGA TCGGCGGCGC CGGGACGCGC ACCGCCGGCT GGACGGTGCG CTCCCGGATC GCGGGACTGT ACGAACCGGA CCCGCAGCCC GTCCTGGAAG CGGGTGCCGC GACGGGTATG CCCCAGGCCG GGGCGGGTTC GGGCTACCCG GATGAGCCGC CGCCCTCGCT GATTCCCGAC GGTCTGCTGG ACCTGCTGTC CCTCGGTGCT TCGGGGGCGC CGCTCCCGCC CGAGCTGAAG GCCCGGGCCC AGGCGATGCA GGAACGGGCG ATGCCGACCC GTGTCGGCCG TACGCGGTTC GAGGGCGAGG GCCTCGCCGT CGCGCCGTGG TCGGCGGAGA ACCCGCGTCT GTACCCGTTG GAGGTCGAAC TCGTCGCGCC GGACGGTGCG GTCGTCGAAC ACGCCACGAT CCGGGTCGGT TTCCGCCGGG TCGAGATCCG CGGTCGGAAC CTGCTGGTCA ACGGCGGCCG GGTGTGGATC CAGGGCGTGA ACCGGCACGA CTTCAACGCG CGGACCGGCC GGGTCATCAC CGCCGGGCAG CTGCGCGCCG AACTCGCGCT GCTCAAGCGG TTCAACGTCA ACGCGGTTCG CACTTCCCAC TACCCCAACG ACCCGCTGTT CCTCGATCTG TGCGACGAGT ACGGCCTGTA TGTCGTGGAC GAGGCGAACA TCGAGGCGCA CGCCCATGCC GGAACGGTCT GCGGGGACCC GCGCTACCTC GGCGCGTTCG TGGACCGGGT GTCGCGCATG GTGCTGCGCG ACAAGAACCA TCCCTGCGTG ATCTTCTGGT CGCTCGGCAA CGAGAGCGGT TACGGCCCCA ACCACGACGC CGCGGCTGGG TGGGCCCGCG CCTACGACCC CAGCCGGCCC CTGCACTACG AAGGGGCGAT CAGCGCCGAC TGGCACGGCG GCCACCGGGC CACCGACGTC GTCTGCCCCA TGTACCCCGC CTTCGACGCG CTGCGCGCCT ACGCGGCGCA CCCCGACGCC GACCGCCCCG TGATCCTCTG CGAGTACGCC TACTCCCAGG GCAACTCGAC CGGCGGGCTC GGCACCTACT GGGACCTGTT CGAGTCCACT CCCGGCCTGC AGGGCGGGTT CATCTGGGAG CTCTACGACC ACGGCCTCGA CCCCGACGGC GACGGCCGGT TCCGCTATGG CGGGGACTTC GGCGACCAGC CCAACGACGG CGTCGTCTGC ATCAACGGCA TCCTGTTCTC CGACGGCGCC CCCAAGCCCG CCTTCCACGA GGCGCGCCAC CTCTTCGCGC CCGTCCGGGT CCTGTCCGGT GCCACCGAGG CGCGTCTCGG CCGGGTGCGG CTGAGGAACC GGCAGACCTT CGAGGACCTC TCCGGTCTCC GCCTCGCGCT CCACGTCGAA CGGACCGACG GGCCGGGCGA TCCGACGCTG GTCGCGGCGC CGGCCATCCC CGCCGGCGGC GAAGGCGTCC TCGACCTCCC CGAGACAGTC ACCGCGCAGC TCACCGGGCC GGATGCCGTC GCCCTGGCCC TCGTCGTCCA GCTCGCCGAC GCGACGCCGT GGGCCGAGCA GGGCACCGAA CTCGCCCGGC TCCAGGTCCT GCTCGACGTC GATGTGCCTG ACCTGATCGA AACCCGTCCC GCCACCGGCG TGCTGCGCCT GGACGGCGAC GGGCTGCTCC AGCACCCCGT CCTGAGCGCG GCACCGGTGT TGTCCTTCTG GCGCGCGCCG ACCGACAACG ACACCTCCAT CGGCCTCGAC TCCCGGTTCG TGCGCACCGG TCTGTTCCGC GTCACCCGCA CGCTGGTCGA CCAGAAGATC ACCGGCTCCA CGGCGACGAT CGTCAGCCGG TACACCGCCG CCTACGGCGC GGAGATCGAA CACCGGCAGC GAATCACCGC CCTGTCCGAC ACCAGCTTCC GTTTCGACGA ACACGTCACC CTGCCGGAGG AGCTCGACGA CATCCCCCGC CTGGGCGTCA CCTTCGCCAC CAACCCCGGC TTCGAACACC TCACGTGGTT CGGCCTCGGG CCCCACGAGA CCTACCCCGA CCGGAAGAAG TCAGGACTTC TCGGCCGCTG GACCTCCCAG GTCGACGACC TGTTCGTCCC CTACCTGCTG CCGCAGGAGA ACGGCGGCCG CGCCGACGTC CAAGAACTCA CCCTCACCGG CCCCGACGGG TACGCAATCA CCATCAGCAC CGACCGACCC GTGCAGATGA ACGTGTCGCA CTACCAGGTA GCCGACCTGG AACCCGCCCG GCACACCTGG GAGCTCAGGC CCCGGGCGGA GACGTACGTT CACCTCGACC TCGCGCACCG GGGACTCGGC ACCGGAGCCC TCGGCCCGGA CACCCTGGCG TGGTACCGGG TACGCGGCGG CAGGTACGAG TGGTCCTGGC AGCTTGATCT CACCGCCCCG CAGCGCTGA
|
Protein sequence | MTVNGTGTSA IQPWDAPELT SANRVPMHAV PHADRLVLDG VWDFQLLPGP LAERGEQWRT VEVPGVWTMQ DSGDLPQYTN VVMPFDSPFP HPPEANPTGV YRRGFTAAAD WTGRRVVLHV GAAESVLLVR VNGRDVGFSK DSHLAAEFDV TRFVRPGTNE LELTVVKWSD ASFVEDQDHW WHGGITRSVY LYTTAPVYLA DIQAMADLHV TAAATGSLRL DVKIGGAGTR TAGWTVRSRI AGLYEPDPQP VLEAGAATGM PQAGAGSGYP DEPPPSLIPD GLLDLLSLGA SGAPLPPELK ARAQAMQERA MPTRVGRTRF EGEGLAVAPW SAENPRLYPL EVELVAPDGA VVEHATIRVG FRRVEIRGRN LLVNGGRVWI QGVNRHDFNA RTGRVITAGQ LRAELALLKR FNVNAVRTSH YPNDPLFLDL CDEYGLYVVD EANIEAHAHA GTVCGDPRYL GAFVDRVSRM VLRDKNHPCV IFWSLGNESG YGPNHDAAAG WARAYDPSRP LHYEGAISAD WHGGHRATDV VCPMYPAFDA LRAYAAHPDA DRPVILCEYA YSQGNSTGGL GTYWDLFEST PGLQGGFIWE LYDHGLDPDG DGRFRYGGDF GDQPNDGVVC INGILFSDGA PKPAFHEARH LFAPVRVLSG ATEARLGRVR LRNRQTFEDL SGLRLALHVE RTDGPGDPTL VAAPAIPAGG EGVLDLPETV TAQLTGPDAV ALALVVQLAD ATPWAEQGTE LARLQVLLDV DVPDLIETRP ATGVLRLDGD GLLQHPVLSA APVLSFWRAP TDNDTSIGLD SRFVRTGLFR VTRTLVDQKI TGSTATIVSR YTAAYGAEIE HRQRITALSD TSFRFDEHVT LPEELDDIPR LGVTFATNPG FEHLTWFGLG PHETYPDRKK SGLLGRWTSQ VDDLFVPYLL PQENGGRADV QELTLTGPDG YAITISTDRP VQMNVSHYQV ADLEPARHTW ELRPRAETYV HLDLAHRGLG TGALGPDTLA WYRVRGGRYE WSWQLDLTAP QR
|
| |