Gene Francci3_2770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2770 
Symbol 
ID3906481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3266082 
End bp3267503 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content71% 
IMG OID637880093 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_481859 
Protein GI86741459 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID[TIGR02309] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGG AGCGGTACCG GGCGAGCCTC AACGACGGCC GCGAGGTGTG GCTCGACGGG 
GAGAAGGTCG CGAACGTGGC GACGCATCCA GCCTTCCGGG GCGCCGTTGA CGAGCTGGCC
CGGCTGTTCG ACCTGCAGCA CGACCCGGAT CTGCGCGACG TGCTCACCGT CCACGAACCC
GAGGCGGGAA CCCGCGTCGG CTGGTCCTAC CAGCTGCCGC GCACGCTGGA CGATCTGCGG
GCCCGGCGGC GCAGCACGGG CGTGTGGATG CGCGAGTCCT GGGGTCAGCA CGGCCGGTCG
CCGTCGTTCA TGGCGAACGT GGCGGTGGGT CTGCTCGACT TCCGGGAGCA TCTGGAGTGC
AACCAGGCCG GCTTCGGCGC CAACGCGGTG GCCTTCTACC GGCACTGTGC GGCCAATGAC
CTGGTCCTCG GCCACGCTTT GGGCGACCCG CAGATCGACC GGACCACCAG CCCCGTGGAC
GATCCCGATC TCGCGCTGCG CATCATCGCC GAGCGGGACG ACGGCGTCGT CGTCCGGGGC
GCCAAGCAGC TCACGACGCT GGCCCCACTG GCCCACGAGG TGCTCGTGTA CCTCTCGGCG
AGCTTCGCGC AGCGCGAGGC CGAGCAGTTC GTGATCTGGT TCGGCCTGCC GCTGGCTACC
CCAGGCGTGA AGATCCTCTG TCGGGAGCCG CTGGGGGCCC GGCCGTACGG GCACGCCCAC
GCGCTCGCGT CCCGGTTCGA CGAGCAGGAC GCCATGCTGT TCTTCGACGA CGTGCTGGTT
CCGTGGGACC GGGTGTTCCT GCTGCGCGAC GGCAACCTCG CCCGGGTCGG ACTCGGACGG
ATCAACGCCT GGAGCGCGGC CGCCGGCCAC ATCCGCTACC AGGAACGGCT GCGGACCCTG
CTCGCCGTCG GCACGCTGCT CGCCGACGCG ATCGGAGCCT CGGGGCTGCG TCACGTCCAG
GAGGACCTCG GGGAGTTGGC GAGCTACGTT GACCTCGTCG GGTACTTTCT CGACGCGGGG
GAGGCCAGGG CCGAGACGAC GGACGGCGGC CTGCTCGCGC CGGGTAACAC CGACGCGAGC
CGCGTGTGGT CGGCGCAGGT CGCCGGCCGG GCGGTGGAGA TCGTGCGGCG GATCGGCTCG
TCGGGTGTCC TCATGCAGCC CAGCGAGCGT GACCTCGCCG CCGCCGACCT GCGGCCCTAT
CTCGATCGGT ACATGCGGGG CGCGAACCTC CCGGTCGAGG AGAAGTCACG GCTGTTCCGG
CTGGCCTGGG AGCTCACGGC GGACAGCTTC GGGCAGCGTC AGGACCTCTA TGAGTTCGTC
CACCGCGGGG ACATCACCCG CAACCGGATC AACCTGCTCC GCCGGGATGA CCTGGGACCT
GTCACCGACC AGATCCGGGA GCTGATCACC CGTCCGCTCT GA
 
Protein sequence
MTGERYRASL NDGREVWLDG EKVANVATHP AFRGAVDELA RLFDLQHDPD LRDVLTVHEP 
EAGTRVGWSY QLPRTLDDLR ARRRSTGVWM RESWGQHGRS PSFMANVAVG LLDFREHLEC
NQAGFGANAV AFYRHCAAND LVLGHALGDP QIDRTTSPVD DPDLALRIIA ERDDGVVVRG
AKQLTTLAPL AHEVLVYLSA SFAQREAEQF VIWFGLPLAT PGVKILCREP LGARPYGHAH
ALASRFDEQD AMLFFDDVLV PWDRVFLLRD GNLARVGLGR INAWSAAAGH IRYQERLRTL
LAVGTLLADA IGASGLRHVQ EDLGELASYV DLVGYFLDAG EARAETTDGG LLAPGNTDAS
RVWSAQVAGR AVEIVRRIGS SGVLMQPSER DLAAADLRPY LDRYMRGANL PVEEKSRLFR
LAWELTADSF GQRQDLYEFV HRGDITRNRI NLLRRDDLGP VTDQIRELIT RPL