Gene PHATRDRAFT_55069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55069 
SymbolFAO1 
ID7198195 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp94995 
End bp97341 
Gene Length2347 bp 
Protein Length767 aa 
Translation table 
GC content58% 
IMG OID 
Productperoxisomal bifunctional enzyme 
Protein accessionXP_002184394 
Protein GI219128384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0170037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTGTATGTA GCATGACCCG TTCGCAGGTC GTTTCATCCT CGTCGCAGGG TCTCCCCACG 
GATACCACGG CAGTAATTTG TCTAGCGGAC GCTGCCGCCG GAGCGTCGCC GTTGCATCCA
CTCAACGCGA AACTCCGTCT CTATCTACTC CAGCAACTCC GTGCGGCGGA AGACAATCCC
GCCGTGACGT CTCTCATCCT TACGGGTGGA GGTCCAAACT TCTCGGCCGG TGCCGATTTG
ACCGAATTCG CCTCGTTGAT ACCAGCAGCA GCAACAACAG TTCCAACACC CAGTGACGTC
GGGCACCACC ATCCCCACGC TGCCCCTTCC TTGATTGACG TCGTCACGGC GATAGAAGCC
TGTCGCAAAC CCATCGTGGC AGCGATTGAC GGAGTCTGTT TGGGTGGCGG TTGTGAACTG
GCGCTCGCCT GTCACGCACG GGTCGCCACG GCCCGGGCTC GTCTCGGACT ACCCGAAGTA
CACGTCGGGG TCATACCCGG GGCCGGAGGA ACCCAACGAT TGCCCAAACT CGTCGGACTT
CGGCAAGCCT TGCCTATGAT TCTACAAGGA TCCACCGTGT CGGCTGCTAG TGCCTTGCGC
ATGGGATTGT TGGACGCGAT TGCTCCGGAA GCGACCGCGG ACTCCTTGCG AACGACCGCA
CGGCAGTGGG CGGCGTACGG AGAAGTCTTA CCCACCATTC GGCGGACGGG AGACTTGCAG
CGTCCGGAAT CACCCGCGGA AGCGCACGCA CTCCTACACG TGGCGGAATT GAGTCTCCCC
CCGCTCGGCT CGCACGGCAT GCGGGCCGCC ATTCAAGCTC TCCGAGCCTC CGGTACCCCC
ATCCAGCACG GAATGCGAGT GGAAACCACA CACTTTCTCG AAACTCTAGC GGAGTCCGCA
AGGACGTGCC CGACGGCACG TCTTTTTTGC CACCAGGCGA GCACAAAAAG TGTGGAGTCC
GGTGTCGAAC GCTGTCCCGA CGGCCGGACC TCCGCACGGG CTCTGGTCCA AGGACGACGC
ACAGTCGGCC GTGCCCGTCG CCGTGGTCGG AGCCGGTACC ATGGGCAGTG GTATTGCCCT
CGTCCTGCTC CAAGCCGGCT TTCACGTGAC ATTGGTCGAC GTCCACGCGC CAGCTTTGGC
CAAAGGAATG GAATTTCTAA AACGCACCCT CGCCTCTATG GTTCAACGTC GTCAACTCAA
ACCGACTCAA TTAACCGCAC TCGAGGCAAA GTTACGGGCA ACGTCCAATT TACAAGAATT
GTCGCAGTGT CTACTCGTGG TCGAGGCCGT CGTGGAAAAA CTTATCGTCA AACAGAGCAT
TTTTGCCACG CTCGACAAAG TCACTCCCCC GACCGCTCTG CTCTTGAGCA ATACCAGCAC
GCTAGATATT GACGCCATGG CGTCGGCCGT TTCCAGCCGC CGGCGCGGAT TGTTCGCCGG
TTGGCACTTT TTCTCCCCCG CCCATCGCAT GAAACTCGTC GAAATAGTCC GTGGATCGGC
CACGTCCGAC GACACCACGG CGCTGCTACA AGCCTTGACG AAGCAAATTG GCAAAATCGG
GGTTGTGGTG GGCAACTGCG ATGGTTTCTG CGGCAATCGC ATGCTTAAAC CCTACTCGGC
CGAAACCGTA CTCCTCTTGA CGGAGACACA GACCACTGTA GCCGCACAAG ATTACGCGAT
TCGGGGAGTG TACGGAATGG CCTTGGGACC TTTCGAAATG GCCGACTTAG CCGGAAATGA
CGTCGGCTAC AACATTCGCG TAGAACGTCA GTGGGCCCGC AGAGCCAGGG ACGATCCCCT
TCCGCCCAAC CGCCCCGCGC GGTATACGGA ACTGCCCGAC GTGATGATAT CTGATTACGG
TCGACTGGGA CAAAAGGTAG GAAAGGGATG GTACGACTAC GATTCGAATA TTGGCAAAGG
TCGTAAGCCG TTGCCGTCGT CCGAAATGGA TGCCTTGATT CGACGGTACC TGGCCCCCAA
GCCATCGCCA GCTTTGGTCG CTGCAGAAAT AATAGAGCGG GTGTTGTACC CACTGATCAA
CGAGGGCTTC AAGTGTCTCG AGGAAAGCAT TGTTCGCCAA CCCAGTGATA TTGATGTGGT
GTACGTGTAC GGTTACGGCT GGCCCGTGTG GAGGGGTGGT CCCATGTATG CGGCCGATCA
CGAGATTGGC TTGCCTCGTC TGCTGCGAAC CCTTCGTGAA CTGTCGAAGC AATTTCCCAC
TACGGAACAT TACGTCCCGT CGGCGCTACT CGTTGAATGC GTCGCACGGA AAGTAACCGT
GGAAGAGTAC TATCAGAAGA ACTATCACAC GGCATCGACT GGATCAGCAA TGCTCTCCAA
GCTTTAG
 
Protein sequence
MTRSQVVSSS SQGLPTDTTA VICLADAAAG ASPLHPLNAK LRLYLLQQLR AAEDNPAVTS 
LILTGGGPNF SAGADLTEFA SLIPAAATTV PTPSDVGHHH PHAAPSLIDV VTAIEACRKP
IVAAIDGVCL GGGCELALAC HARVATARAR LGLPEVHVGV IPGAGGTQRL PKLVGLRQAL
PMILQGSTVS AASALRMGLL DAIAPEATAD SLRTTARQWA AYGEVLPTIR RTGDLQRPES
PAEAHALLHV AELSLPPLGS HGMRAAIQAL RASGTPIQHG MRRSPQGRAR RHVFFATRRA
QKVWSPVSNA VPTAGPPHGL WSKDDAQSAV PVAVVGAGTM GSGIALVLLQ AGFHVTLVDV
HAPALAKGME FLKRTLASMV QRRQLKPTQL TALEAKLRAT SNLQELSQCL LVVEAVVEKL
IVKQSIFATL DKVTPPTALL LSNTSTLDID AMASAVSSRR RGLFAGWHFF SPAHRMKLVE
IVRGSATSDD TTALLQALTK QIGKIGVVVG NCDGFCGNRM LKPYSAETVL LLTETQTTVA
AQDYAIRGVY GMALGPFEMA DLAGNDVGYN IRVERQWARR ARDDPLPPNR PARYTELPDV
MISDYGRLGQ KVGKGWYDYD SNIGKGRKPL PSSEMDALIR RYLAPKPSPA LVAAEIIERV
LYPLINEGFK CLEESIVRQP SDIDVVYVYG YGWPVWRGGP MYAADHEIGL PRLLRTLREL
SKQFPTTEHY VPSALLVECV ARKVTVEEYY QKNYHTASTG SAMLSKL