Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4133 |
Symbol | |
ID | 5672491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4915169 |
End bp | 4916608 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243009 |
Product | pentachlorophenol monooxygenase |
Protein accession | YP_001508426 |
Protein GI | 158315918 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.636421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGATG TGATAGTGGT GGGCGGCGGA CCGACCGGCC TGATGCTCGC CAGTGAACTG CGACTGCATG GCATCGACGT GCGGATCCTC GAGAAGGACA CCGAGCCGAC CCCGGTCGTC CGCTCCCTCG GCCTCCACGC CCGCAGCATC GAGATCATGG ACCAGCGTGG CCTGCTCGAC CGGTTCCTCG CCCTCGGCAC CCAACACCCG GTCGGTGGCT TCTTCGCCGG CATCAGCAAA CCCGCACCCG ACCAGCTCGA CACCGCACAC CCCTACGTCC TCGGCATCCC ACAGCCCACC ACCGACCGCC TCCTCGCCGA ACACGCCACC GCACTCGGCG CCGACATCCG CCGCGGCCAC GAACTGGTCG GACTCAACCA AGACGACGAC ACGGTCACCG CACACCTGGC CGACGGCACA CACCTCCACG CCCGCTACCT CGTCGGCTGC GACGGCGGCC GCAGCACCGT CCGCAAACTG CTCGGCGTCC CCTTCCCCGG CGAACCCACC CGACACCAGA CGCTCCTCGC CGAAGCGGAA CTGACCGCCC CACCGGAGAC GATCAACGCC ATAGTGACCA AGGTCCGCGA AACCCACAAA CGGTTCGGCG CCGGCCCGCT GGGAAACGGG CTGTACCGCC TCATCGCACC CGCCGACGGG GTCGTCGAGG ACCGCACCCC ACCCACCCTC GACGAACTCA AACAGCAGAT ACGCAGACTC GCCGGCACCG ACTTCGGCGC ACACTCACCC CGCTGGCTCT CCCGCTTCGG TGACGCCACC CGCCAGGCCG AGCGGTACCG CACCGGCCGG GTTCTGCTCG CCGGCGACGC CGCGCACATC CACCCGCCGA CCGGAGGACA GGGACTCAAC CTCGGCGTGC AGGACGCGTT CAACCTCGGC TGGAAACTCG CCGCCACGAT CAACGGCTGG GCACCGGCCG GACTGCTGGA CACCTACCAC ACCGAACGAC ACCCGGTCGC CGCCGACGTC CTCACCAACA CCCGTGCGCA GATGGAGCTG ATGTCCCTCG ACCCCGGCGC GCAGGCAGTA CGCCGACTCC TGGTCGAACT GATGGACTTC GACGACGTCA ACCGACACCT CACCGAAAAA ATCATCGCGA TCGGGATCCG CTACGACTTC GGCGCCGGCC ATCATCTGCT CGGCCGGCGG ATGCGCGACA TCGCGCTCAA ACGCGGCCGC CTCTACCCGC TGATGCACCA CGGCCGCGGA CTGCTCCTCG ACCAGACCGG ACAGCTCTCC GTCACCGGCT GGGCGGACCG GGTCGACCAC ATCATCGACG TCAGCGACGA ACTCGACGTC CCCGCCGCGC TCCTACGCCC CGACGGCCAC ATCGCCTGGG TCGGCGACAA CCAGCAGGAC CTGCTCGGCC ACCTACCCAC CTGGTTCGGC ACCGCCACCA CCCGAGCACA ACGAAGCTGA
|
Protein sequence | MIDVIVVGGG PTGLMLASEL RLHGIDVRIL EKDTEPTPVV RSLGLHARSI EIMDQRGLLD RFLALGTQHP VGGFFAGISK PAPDQLDTAH PYVLGIPQPT TDRLLAEHAT ALGADIRRGH ELVGLNQDDD TVTAHLADGT HLHARYLVGC DGGRSTVRKL LGVPFPGEPT RHQTLLAEAE LTAPPETINA IVTKVRETHK RFGAGPLGNG LYRLIAPADG VVEDRTPPTL DELKQQIRRL AGTDFGAHSP RWLSRFGDAT RQAERYRTGR VLLAGDAAHI HPPTGGQGLN LGVQDAFNLG WKLAATINGW APAGLLDTYH TERHPVAADV LTNTRAQMEL MSLDPGAQAV RRLLVELMDF DDVNRHLTEK IIAIGIRYDF GAGHHLLGRR MRDIALKRGR LYPLMHHGRG LLLDQTGQLS VTGWADRVDH IIDVSDELDV PAALLRPDGH IAWVGDNQQD LLGHLPTWFG TATTRAQRS
|
| |