Gene Franean1_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4133 
Symbol 
ID5672491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4915169 
End bp4916608 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID641243009 
Productpentachlorophenol monooxygenase 
Protein accessionYP_001508426 
Protein GI158315918 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.636421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATG TGATAGTGGT GGGCGGCGGA CCGACCGGCC TGATGCTCGC CAGTGAACTG 
CGACTGCATG GCATCGACGT GCGGATCCTC GAGAAGGACA CCGAGCCGAC CCCGGTCGTC
CGCTCCCTCG GCCTCCACGC CCGCAGCATC GAGATCATGG ACCAGCGTGG CCTGCTCGAC
CGGTTCCTCG CCCTCGGCAC CCAACACCCG GTCGGTGGCT TCTTCGCCGG CATCAGCAAA
CCCGCACCCG ACCAGCTCGA CACCGCACAC CCCTACGTCC TCGGCATCCC ACAGCCCACC
ACCGACCGCC TCCTCGCCGA ACACGCCACC GCACTCGGCG CCGACATCCG CCGCGGCCAC
GAACTGGTCG GACTCAACCA AGACGACGAC ACGGTCACCG CACACCTGGC CGACGGCACA
CACCTCCACG CCCGCTACCT CGTCGGCTGC GACGGCGGCC GCAGCACCGT CCGCAAACTG
CTCGGCGTCC CCTTCCCCGG CGAACCCACC CGACACCAGA CGCTCCTCGC CGAAGCGGAA
CTGACCGCCC CACCGGAGAC GATCAACGCC ATAGTGACCA AGGTCCGCGA AACCCACAAA
CGGTTCGGCG CCGGCCCGCT GGGAAACGGG CTGTACCGCC TCATCGCACC CGCCGACGGG
GTCGTCGAGG ACCGCACCCC ACCCACCCTC GACGAACTCA AACAGCAGAT ACGCAGACTC
GCCGGCACCG ACTTCGGCGC ACACTCACCC CGCTGGCTCT CCCGCTTCGG TGACGCCACC
CGCCAGGCCG AGCGGTACCG CACCGGCCGG GTTCTGCTCG CCGGCGACGC CGCGCACATC
CACCCGCCGA CCGGAGGACA GGGACTCAAC CTCGGCGTGC AGGACGCGTT CAACCTCGGC
TGGAAACTCG CCGCCACGAT CAACGGCTGG GCACCGGCCG GACTGCTGGA CACCTACCAC
ACCGAACGAC ACCCGGTCGC CGCCGACGTC CTCACCAACA CCCGTGCGCA GATGGAGCTG
ATGTCCCTCG ACCCCGGCGC GCAGGCAGTA CGCCGACTCC TGGTCGAACT GATGGACTTC
GACGACGTCA ACCGACACCT CACCGAAAAA ATCATCGCGA TCGGGATCCG CTACGACTTC
GGCGCCGGCC ATCATCTGCT CGGCCGGCGG ATGCGCGACA TCGCGCTCAA ACGCGGCCGC
CTCTACCCGC TGATGCACCA CGGCCGCGGA CTGCTCCTCG ACCAGACCGG ACAGCTCTCC
GTCACCGGCT GGGCGGACCG GGTCGACCAC ATCATCGACG TCAGCGACGA ACTCGACGTC
CCCGCCGCGC TCCTACGCCC CGACGGCCAC ATCGCCTGGG TCGGCGACAA CCAGCAGGAC
CTGCTCGGCC ACCTACCCAC CTGGTTCGGC ACCGCCACCA CCCGAGCACA ACGAAGCTGA
 
Protein sequence
MIDVIVVGGG PTGLMLASEL RLHGIDVRIL EKDTEPTPVV RSLGLHARSI EIMDQRGLLD 
RFLALGTQHP VGGFFAGISK PAPDQLDTAH PYVLGIPQPT TDRLLAEHAT ALGADIRRGH
ELVGLNQDDD TVTAHLADGT HLHARYLVGC DGGRSTVRKL LGVPFPGEPT RHQTLLAEAE
LTAPPETINA IVTKVRETHK RFGAGPLGNG LYRLIAPADG VVEDRTPPTL DELKQQIRRL
AGTDFGAHSP RWLSRFGDAT RQAERYRTGR VLLAGDAAHI HPPTGGQGLN LGVQDAFNLG
WKLAATINGW APAGLLDTYH TERHPVAADV LTNTRAQMEL MSLDPGAQAV RRLLVELMDF
DDVNRHLTEK IIAIGIRYDF GAGHHLLGRR MRDIALKRGR LYPLMHHGRG LLLDQTGQLS
VTGWADRVDH IIDVSDELDV PAALLRPDGH IAWVGDNQQD LLGHLPTWFG TATTRAQRS