Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2454 |
Symbol | |
ID | 3905066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2861770 |
End bp | 2862837 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879784 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_481550 |
Protein GI | 86741150 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.19659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTC ATTCGATCGA CCACATTGAG CTGTTCGTCG AGGACGCCGC GCAGGCTGCG GCCGAACTGT GCGATTCCTT CGGTTTCACC GTCACCGGCC GAGGCGGGCC GCGCACCGGG CTCAAGGGCT GCGAGTCGGT GCTGCTGCGG CAGTGCGACA TCACTGTCGT CGTCACCGCC GCGACCAGCT CCGACCACCG CGCCGCCGAG TTCGTCCGTC GGCACGGCGA CGGAGTGGCG GTCATCGGCT TCGCCGTCGA CCAGGCGCAG GCCGCGTTCG CCGAGGCGGT GAATCGTGGA GCGGTGCCGG TCACGCCGCC CGAGACCCTG GGAACGCCGG GCGGCCGCGT GACCTTCGCG TCGGTGGCCG GGTTCGGCGA CGTGGAGCAC CGCTTCACCT CCCGGGAGGC GGTCGAGGGG CCCTTCTCGC CCGGCCTCAT CGAGGAGACC GTCCCAGACC GCTCCAACGA AGGCCTGCTC AGGGCCATCG ACCACGTCGC GGTCTGCCTG CCCGCCGGCG AACTGCACCC GACCGTACGC GCCTATCGGG ACGTGTTCGG CTTCACCCGG ACCTTCGAGG AGCGCATCGT GGTCGGCTCT CAGGCCATGG ACTCCCAGGT GGTGCGCAGC CCGTCCGGCA AGGTCACCTT CACCATCATC GAACCGGACA CCACCCGCGC CCCCGGCCAG ATCGACGAGT TCGTCCGCTC GCACGGCGGG GCGGGAATCC AGCACATCGC GTTCCGCACC GACGACATCA CGGCGGCGGT CCGGGACAGC GCGAAGCGCG GGGTGCGGTT CCTCACCACC CCGGCGAGCT ACTACGAGGC GCTGCCGGCG CGGCTCGGCC CGGTCGGCGT CCCGGTGGAG ACGCTGCGTG AGCTCAATAT CCTGGCCGAT CGCGACCACG GCGGCGTCAT GCTGCAGATC TTCACCGCGT CCCGGCACCC CAGGCGGACC TTTTTCCACG AGCTGATCGA CCGCCGCGGC GCCCACACGT TCGGCAGCAA CAACATCAAG GCCCTGTACG AGGCCGTCGA ACGCCAACGG GCCGCCGAGA GCGCCTGA
|
Protein sequence | MDIHSIDHIE LFVEDAAQAA AELCDSFGFT VTGRGGPRTG LKGCESVLLR QCDITVVVTA ATSSDHRAAE FVRRHGDGVA VIGFAVDQAQ AAFAEAVNRG AVPVTPPETL GTPGGRVTFA SVAGFGDVEH RFTSREAVEG PFSPGLIEET VPDRSNEGLL RAIDHVAVCL PAGELHPTVR AYRDVFGFTR TFEERIVVGS QAMDSQVVRS PSGKVTFTII EPDTTRAPGQ IDEFVRSHGG AGIQHIAFRT DDITAAVRDS AKRGVRFLTT PASYYEALPA RLGPVGVPVE TLRELNILAD RDHGGVMLQI FTASRHPRRT FFHELIDRRG AHTFGSNNIK ALYEAVERQR AAESA
|
| |