Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1630 |
Symbol | |
ID | 5670032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1946364 |
End bp | 1948145 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240548 |
Product | hypothetical protein |
Protein accession | YP_001505974 |
Protein GI | 158313466 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.468389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCACAA ACCCGGCGGG GGTGCTTTCC CGCGGCGTTC ACCCTGAGTC CACCGCCCGG CAGCCGCAGG CTCCGAGAAT TTCTCTCAGC CGCTCCTCCG ACGGGTCGGC GGGAGGTTTT CTCGTGGACC ACGCGGTTTA CCACGTGCTC GAGCAGAGCC GGGAATACGA GGTCGATCTT GTCGTCACCG CCGGCCAGTG GCCGGACGGG CTGACCGGGT TCGCATTTGT CGTCGGGCCG GCCCAGCCGA CGGTGCTGGA CTTCGCGCCG AGCGGACCGG GAATGCTGAC CCGGGTCGAT CTGGCGAAAC GCACCTGGCG GACCCGCCGA GTGGTCACCC CGGACCTGGC GATGCTCGGC GGGCTGCGTG CCGCACTGGC TCCGGAGGAG CTCCAGCGGC TGCTGACCGG CGCCCACCCG TCACTCAGCC ACACCGCGCC GCACTTCTTC GGCGACCGGC TGCTGCTCAC CGCCGACCGG CAGCGGCCGG TGGAGCTCGA CCCGGTCACG ATGACCTACC GGACGTTCCT CGGCGCGGTA CACGAGTACC CCCAGGTCGG GGCGCACCCG CTGTTCCCCG GGGTGCAGAC GACGTCCCAC CCGGTGGTCG ACCCCGACGA GGGCTGTCTG TGGTGGAGCA ACATCCACCT GCGCCCGCGC GGCCGGTCGA CCACCGACGT GGAGGGCCCG CTGTCGGTGG TGCGCTGGGA CGGCCACGGT GAGCTGGAGA CCTGGGAGGT ACCCGGGGCG CGCATCACCC AGGGCACCCA CGAGATCGCG GTGACCCGCG ACTACGTCAT CTTCACCGAG ATCGGCTTCC AGCCGGAGCC CGGCAGCGTC GCCGGCCGCG GCCGCACCAG GCCGCACCTG CCGTTCACCG ACATCTATCT GGTCGCCAAG CGGGACCTGA CCCGTGCCCG GGTCGGCTCC GCCGTGCCGG TGACGCACGC GCGGGTCCCC TACGAGTCGT TCCACGAGTT CGCCGACTAC GGCCAGGACG GCGACGACGT CACGATGTAC GTCGCGCACT CGAACGGCTG GGACATGAAC TACGCCATCA CCCGCTCGGA CACCGTTTGG CGCACCGGGG ACAGGCTGCG CAGCTGCCTG TCCGGGTTCA TGCCGACGCC GGTCGACGCC GCACCGGTCG GGCGGCACGT CATCGACGGC CGTACCGGGC AGGTGCGGCA GAGCAGGTAC TTCCTCGACC CGCAGCGGCA CTGGGGAACC CTGCTCTACG CCCGCGACAC CCGCCCGGCG GCGCTCGAGC GGGGCCGCCA CCTGTGGCAG GCGTACTGGG GCGCCACGCC CGACACGATG GTCTCGGCGA TCGTCGAGAT GTACGCCGAC CATCCGTTCC GGGTGGTCGG CGTCGACGAC CTGCCCACGA CCGAGATCCC GTCGTCGCTG GTCTGCATCG ACCTGGAGTC GATGACCGAG CAGTCGGCGT GGACGTTCCC GGCCGGGACG ATCTGCGAGT CCCCGGTCTT CGTGCCGGAC AAGGCCGGCG GCGATGGCTG GGTGGTGGTC TTCGTCAAGC ATGCGGACCG CACCGAACTG CAGGTCTTCG ACGCCCTGGC GCTGGATCTC GGCCCGTGCG CCGTGGTGAC GGCGCCAGGC CTGCGAATGC CCGTGCTGTT CCACTCGGGC TACACGGAGA CCATCCGCTC CCCCGGTACC GACTACCGGC GCTCGTTCGC CGCCGACCTC GGCACCGGAT GGCGCGACCT CTCCCCGGCC GCGCGCGCCA TCGTCACCGA GATCGTGGAG GCGTTCGGCT AG
|
Protein sequence | MGTNPAGVLS RGVHPESTAR QPQAPRISLS RSSDGSAGGF LVDHAVYHVL EQSREYEVDL VVTAGQWPDG LTGFAFVVGP AQPTVLDFAP SGPGMLTRVD LAKRTWRTRR VVTPDLAMLG GLRAALAPEE LQRLLTGAHP SLSHTAPHFF GDRLLLTADR QRPVELDPVT MTYRTFLGAV HEYPQVGAHP LFPGVQTTSH PVVDPDEGCL WWSNIHLRPR GRSTTDVEGP LSVVRWDGHG ELETWEVPGA RITQGTHEIA VTRDYVIFTE IGFQPEPGSV AGRGRTRPHL PFTDIYLVAK RDLTRARVGS AVPVTHARVP YESFHEFADY GQDGDDVTMY VAHSNGWDMN YAITRSDTVW RTGDRLRSCL SGFMPTPVDA APVGRHVIDG RTGQVRQSRY FLDPQRHWGT LLYARDTRPA ALERGRHLWQ AYWGATPDTM VSAIVEMYAD HPFRVVGVDD LPTTEIPSSL VCIDLESMTE QSAWTFPAGT ICESPVFVPD KAGGDGWVVV FVKHADRTEL QVFDALALDL GPCAVVTAPG LRMPVLFHSG YTETIRSPGT DYRRSFAADL GTGWRDLSPA ARAIVTEIVE AFG
|
| |