Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3783 |
Symbol | |
ID | 5672147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4485921 |
End bp | 4486907 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242662 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_001508082 |
Protein GI | 158315574 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.83161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGATC ACCGGAGTCC CAGCCGCACG CGGGCGTACG AGGGGCGCCC GCTGGCCCGG CCCGACGAGG AGATCGTTGA CCAGGGCCTC GCCTTCGACG TCGGCACACT GCTGAGCCGG CGGCGGATGC TGGCGTTCTT CGGCCTCGGC GCCGCGTCGG CCGGCCTGGC GGCCTGCGGC TTGGACTCCG CCGGATCCAC TGCCTCAGCC ACCTCGGGGA CGTCCGCCTC GGCGGCCTCG GCCGCGACGA CGTCGGCCAC GATCGGCGCG GCGGCCGGGG AGATCCCCGA GGAGACCGCG GGCCCCTACC CGGGCGACGG GTCCAACGGG CCGGACGTCC TCGAGCAGAG CGGTGTGGTC CGCAGTGACA TCCGGTCCAG CTTCGGCGAC TCGACCGGTA CCGCCGAAGG CGTCCCCATG ACGCTGGCGC TGACGGTCCG CGACCTCGCG AACAGCGGCA CGCCCTTCGC CGGGGTGGCC GTGTACGTGT GGCACTGCGA CCGAGAGGGC CGCTACTCGC TGTACTCCGA CGGCGTCACC GACCAGAACT ACCTGCGCGG GGTCCAGGTC GCCGACTCCG CCGGCATGGT CCGTTTCACC AGCGTCTTCC CGGCGTGCTA CTCGGGACGC TGGCCGCACG TCCACTTCGA GGTCTATCCC GACCAGGCCA GCATCACCGA CTCGTCCAAG GCCATCGCCA CCTCGCAGCT CGCGCTGCCG CAGGACGTCT GCGCCAAGGT CTTCACGCAG CCGGGCTACG AGGCGTCCGT GCGCAACCTG GCGCAGGTCA GCCTCGACAG CGACGGTGTC TTCGGGGACG ACGGGGCTGC CAGCCAGCTC GCCACCGTCA CCGGTGATGT CACCGGCGGC TACGCGGTCT CTCTCGCCTT GGGCGTCGAC ACGTCGACGG CCGCGGGCGG CGGTCAGATC TCCGGCGGCG GCGCGCCGGG CGGCGCACCC GGGGGCCGGG CGCCTGGCGG GCGGTGA
|
Protein sequence | MADHRSPSRT RAYEGRPLAR PDEEIVDQGL AFDVGTLLSR RRMLAFFGLG AASAGLAACG LDSAGSTASA TSGTSASAAS AATTSATIGA AAGEIPEETA GPYPGDGSNG PDVLEQSGVV RSDIRSSFGD STGTAEGVPM TLALTVRDLA NSGTPFAGVA VYVWHCDREG RYSLYSDGVT DQNYLRGVQV ADSAGMVRFT SVFPACYSGR WPHVHFEVYP DQASITDSSK AIATSQLALP QDVCAKVFTQ PGYEASVRNL AQVSLDSDGV FGDDGAASQL ATVTGDVTGG YAVSLALGVD TSTAAGGGQI SGGGAPGGAP GGRAPGGR
|
| |