Gene PHATRDRAFT_24353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_24353 
SymbolAroA 
ID7196691 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp278070 
End bp279983 
Gene Length1914 bp 
Protein Length500 aa 
Translation table 
GC content51% 
IMG OID 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionXP_002177054 
Protein GI219110605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACCAATCAG TCAACGCCTC TATAGTCGGT ACCTCCCTAC CGTATCCTTT CGCGTGACCA 
TGATGCTGAA ATTAGCAAGT GCCCTATTGC TGCTGGCATC GTCAGAGGCT TTTACGCCCC
AGTCGTTAGT GGGGTAAGTT ATGATCCGAT TCTAATATTC CGTGACTGTG ACCGATTTGT
TCCTTGGAAT GCAATAGTGA TAGGTTTTGC ATTTTTCGCC AAAACTTTAT TTGTCGAAGC
GAATCTTGTA TTCACGTGTA AGGACACATA TCGTTGGCAA TTGTTCCCAG CGTAAGACTC
ATGGTGCTTC GATCTCGTGT TGTTCTTGTT CATTAGACGC CGTCAATCGG CGTTGTCGGT
GGCGACGGAA CCGCCCGCTT CGACGGCTGC CACTACCGGG GGCAATACCG ACTGGAGTCC
TCGTTCGTGG AGAGAAAGAG AAGTTCAACA AGGACCAAAC TACGAAGATG AAGAAGAATT
GGAACAAGCC ATCGACACCA TAAAGAAGTT TTCTCCTTTG GTCTTTGCGG GTGAAGTTCG
TTCCTTGCAC GAGCAGTTGG CTCGTGCCTG TTCGGGGCAA GGCTTCTTGC TGATGGGCGG
CGACTGTGCC GAAGCGTTTA ACGAATTTAA TGTCGATCAC GTCCGTGACA GTTTCCGTGT
CCTTTTGCAA ATGGCTCTGG TGTTAACCTT TGGTTCGGCC ATGCCAGTCA TCAAGGTCGG
TCGCATGGCT GGTCAGTTTG CCAAGCCTCG CAGTGAACCG GATGAGGTCC GTGACGGTGT
GGCGTTGCCT TCGTACCGTG GAGATATCAT CAACCGCGAA GAGTTCACAC CGGAAGCCCG
TCGCCACAAT CCCAATAATA TGGTCGAAGC GTACCACCAA TCGGCGCAGA CATTAAACAT
TCTCCGTGCC TTTTCAACCG GAGGATACGC TGATATGAGT AGGCTACACG CCTGGAATTT
GGACTTTGTT GAGACGACGG ATGAGGGTAG CCGGTACGTC CGTCTTTGCA GCCAAATGTG
TATGCCGCGT TTTGGATCCA GAAACTCACC GCTCGATATG CAAAATTTTA TTTCCTTCTC
TCAGCTACCG AAAGTTCGCG ACCAAGGTTG ACGAGTCGCT CCGTTTCATG AAGGCCATTG
GTGTCGACAC CAGCAGCCCC ACCTTTACCA AAACCGAGTT TTACACGGCT CACGAATGTC
TTTTACTGCC GTACGAAGAA GCCCTGACCC GCAAGGACTC AACTACCGGA CGCTACTACG
ATTGCTCAGG CCATATGCTG TGGGTTGGAG AGCGCACTCG TCAATTGGAT GGTGCCCACT
TGGAATTCGT ACGCGGAATT GGAAATCCGC TGGGAGTCAA GATTTCGGAC AAGTGCACAC
CGGAAGAACT CATCCGCATC ATCGATACTA TGAATCCCCA AAATATTCCC GGGCGTCTGA
CGATTGTCGT CCGCATGGGG GCCGAGAAAG TTCGCAAGAA TCTTCCAGCC TTAATCCGTG
CCGTACAACG TGAGGGCAAG TCGGTCTTGT GGATTTCCGA CCCTGTTCAC GGCAATACTT
ACAAGACCGA TTCTGGTATC AAGACGCGCA ACTTTGACGC AATCCGTGAC GAGCTTCGTG
CTTTCTTCGA CGTGCACGAC GAAATGGGCA GCCATCCCGG TGGCGTGCAT TTGGAAATGA
CCGGAGAAGA TGTGACCGAG TGCACGGGAG GAATTAGTGG CGTGTCTGAG GATACTCTGA
ACGATCGCTA CCACACGTTT TGTGATCCTC GCTTGAACGG AGCTCAAGCT TTGGAGCTGG
CCTTTTTGAT TGCCGAGCGA ATGCGTCTGC GAACTGGACT ACCACCGATC GAGTAAATTG
TGTAAGGACT ACATAAACAG AAACTTACAA AGAATTAGGA TTTGTGGAGC GAGA
 
Protein sequence
MMLKLASALL LLASSEAFTP QSLVGRRQSA LSVATEPPAS TAATTGGNTD WSPRSWRERE 
VQQGPNYEDE EELEQAIDTI KKFSPLVFAG EVRSLHEQLA RACSGQGFLL MGGDCAEAFN
EFNVDHVRDS FRVLLQMALV LTFGSAMPVI KVGRMAGQFA KPRSEPDEVR DGVALPSYRG
DIINREEFTP EARRHNPNNM VEAYHQSAQT LNILRAFSTG GYADMSRLHA WNLDFVETTD
EGSRYRKFAT KVDESLRFMK AIGVDTSSPT FTKTEFYTAH ECLLLPYEEA LTRKDSTTGR
YYDCSGHMLW VGERTRQLDG AHLEFVRGIG NPLGVKISDK CTPEELIRII DTMNPQNIPG
RLTIVVRMGA EKVRKNLPAL IRAVQREGKS VLWISDPVHG NTYKTDSGIK TRNFDAIRDE
LRAFFDVHDE MGSHPGGVHL EMTGEDVTEC TGGISGVSED TLNDRYHTFC DPRLNGAQAL
ELAFLIAERM RLRTGLPPIE