Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55069 |
Symbol | FAO1 |
ID | 7198195 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 94995 |
End bp | 97341 |
Gene Length | 2347 bp |
Protein Length | 767 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | peroxisomal bifunctional enzyme |
Protein accession | XP_002184394 |
Protein GI | 219128384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0170037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTGTATGTA GCATGACCCG TTCGCAGGTC GTTTCATCCT CGTCGCAGGG TCTCCCCACG GATACCACGG CAGTAATTTG TCTAGCGGAC GCTGCCGCCG GAGCGTCGCC GTTGCATCCA CTCAACGCGA AACTCCGTCT CTATCTACTC CAGCAACTCC GTGCGGCGGA AGACAATCCC GCCGTGACGT CTCTCATCCT TACGGGTGGA GGTCCAAACT TCTCGGCCGG TGCCGATTTG ACCGAATTCG CCTCGTTGAT ACCAGCAGCA GCAACAACAG TTCCAACACC CAGTGACGTC GGGCACCACC ATCCCCACGC TGCCCCTTCC TTGATTGACG TCGTCACGGC GATAGAAGCC TGTCGCAAAC CCATCGTGGC AGCGATTGAC GGAGTCTGTT TGGGTGGCGG TTGTGAACTG GCGCTCGCCT GTCACGCACG GGTCGCCACG GCCCGGGCTC GTCTCGGACT ACCCGAAGTA CACGTCGGGG TCATACCCGG GGCCGGAGGA ACCCAACGAT TGCCCAAACT CGTCGGACTT CGGCAAGCCT TGCCTATGAT TCTACAAGGA TCCACCGTGT CGGCTGCTAG TGCCTTGCGC ATGGGATTGT TGGACGCGAT TGCTCCGGAA GCGACCGCGG ACTCCTTGCG AACGACCGCA CGGCAGTGGG CGGCGTACGG AGAAGTCTTA CCCACCATTC GGCGGACGGG AGACTTGCAG CGTCCGGAAT CACCCGCGGA AGCGCACGCA CTCCTACACG TGGCGGAATT GAGTCTCCCC CCGCTCGGCT CGCACGGCAT GCGGGCCGCC ATTCAAGCTC TCCGAGCCTC CGGTACCCCC ATCCAGCACG GAATGCGAGT GGAAACCACA CACTTTCTCG AAACTCTAGC GGAGTCCGCA AGGACGTGCC CGACGGCACG TCTTTTTTGC CACCAGGCGA GCACAAAAAG TGTGGAGTCC GGTGTCGAAC GCTGTCCCGA CGGCCGGACC TCCGCACGGG CTCTGGTCCA AGGACGACGC ACAGTCGGCC GTGCCCGTCG CCGTGGTCGG AGCCGGTACC ATGGGCAGTG GTATTGCCCT CGTCCTGCTC CAAGCCGGCT TTCACGTGAC ATTGGTCGAC GTCCACGCGC CAGCTTTGGC CAAAGGAATG GAATTTCTAA AACGCACCCT CGCCTCTATG GTTCAACGTC GTCAACTCAA ACCGACTCAA TTAACCGCAC TCGAGGCAAA GTTACGGGCA ACGTCCAATT TACAAGAATT GTCGCAGTGT CTACTCGTGG TCGAGGCCGT CGTGGAAAAA CTTATCGTCA AACAGAGCAT TTTTGCCACG CTCGACAAAG TCACTCCCCC GACCGCTCTG CTCTTGAGCA ATACCAGCAC GCTAGATATT GACGCCATGG CGTCGGCCGT TTCCAGCCGC CGGCGCGGAT TGTTCGCCGG TTGGCACTTT TTCTCCCCCG CCCATCGCAT GAAACTCGTC GAAATAGTCC GTGGATCGGC CACGTCCGAC GACACCACGG CGCTGCTACA AGCCTTGACG AAGCAAATTG GCAAAATCGG GGTTGTGGTG GGCAACTGCG ATGGTTTCTG CGGCAATCGC ATGCTTAAAC CCTACTCGGC CGAAACCGTA CTCCTCTTGA CGGAGACACA GACCACTGTA GCCGCACAAG ATTACGCGAT TCGGGGAGTG TACGGAATGG CCTTGGGACC TTTCGAAATG GCCGACTTAG CCGGAAATGA CGTCGGCTAC AACATTCGCG TAGAACGTCA GTGGGCCCGC AGAGCCAGGG ACGATCCCCT TCCGCCCAAC CGCCCCGCGC GGTATACGGA ACTGCCCGAC GTGATGATAT CTGATTACGG TCGACTGGGA CAAAAGGTAG GAAAGGGATG GTACGACTAC GATTCGAATA TTGGCAAAGG TCGTAAGCCG TTGCCGTCGT CCGAAATGGA TGCCTTGATT CGACGGTACC TGGCCCCCAA GCCATCGCCA GCTTTGGTCG CTGCAGAAAT AATAGAGCGG GTGTTGTACC CACTGATCAA CGAGGGCTTC AAGTGTCTCG AGGAAAGCAT TGTTCGCCAA CCCAGTGATA TTGATGTGGT GTACGTGTAC GGTTACGGCT GGCCCGTGTG GAGGGGTGGT CCCATGTATG CGGCCGATCA CGAGATTGGC TTGCCTCGTC TGCTGCGAAC CCTTCGTGAA CTGTCGAAGC AATTTCCCAC TACGGAACAT TACGTCCCGT CGGCGCTACT CGTTGAATGC GTCGCACGGA AAGTAACCGT GGAAGAGTAC TATCAGAAGA ACTATCACAC GGCATCGACT GGATCAGCAA TGCTCTCCAA GCTTTAG
|
Protein sequence | MTRSQVVSSS SQGLPTDTTA VICLADAAAG ASPLHPLNAK LRLYLLQQLR AAEDNPAVTS LILTGGGPNF SAGADLTEFA SLIPAAATTV PTPSDVGHHH PHAAPSLIDV VTAIEACRKP IVAAIDGVCL GGGCELALAC HARVATARAR LGLPEVHVGV IPGAGGTQRL PKLVGLRQAL PMILQGSTVS AASALRMGLL DAIAPEATAD SLRTTARQWA AYGEVLPTIR RTGDLQRPES PAEAHALLHV AELSLPPLGS HGMRAAIQAL RASGTPIQHG MRRSPQGRAR RHVFFATRRA QKVWSPVSNA VPTAGPPHGL WSKDDAQSAV PVAVVGAGTM GSGIALVLLQ AGFHVTLVDV HAPALAKGME FLKRTLASMV QRRQLKPTQL TALEAKLRAT SNLQELSQCL LVVEAVVEKL IVKQSIFATL DKVTPPTALL LSNTSTLDID AMASAVSSRR RGLFAGWHFF SPAHRMKLVE IVRGSATSDD TTALLQALTK QIGKIGVVVG NCDGFCGNRM LKPYSAETVL LLTETQTTVA AQDYAIRGVY GMALGPFEMA DLAGNDVGYN IRVERQWARR ARDDPLPPNR PARYTELPDV MISDYGRLGQ KVGKGWYDYD SNIGKGRKPL PSSEMDALIR RYLAPKPSPA LVAAEIIERV LYPLINEGFK CLEESIVRQP SDIDVVYVYG YGWPVWRGGP MYAADHEIGL PRLLRTLREL SKQFPTTEHY VPSALLVECV ARKVTVEEYY QKNYHTASTG SAMLSKL
|
| |