Gene PHATRDRAFT_28056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_28056 
Symbol 
ID7201888 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp942142 
End bp944703 
Gene Length2562 bp 
Protein Length757 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181110 
Protein GI219120756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAATTGCA GCAGATCTTT TGTGACACGT CCATAAATTT TGATAGAGCT TTATGCCAGC 
CGTCAAACTT CCAAAAAGCT ATCTCGATCA ACAATGAAAA TCACAACCAC CGCTACGCTT
GGATTCCCTC GGATGGGGCC GAATCGCGAG CTCAAGTTTG CTCTGGAAAA GCATTGGAAA
GGATCGCTGA GTGAAGTGGA TCTTATCAAG GTTAGCGAAG ATGTTGAGAC TATGGGTTGG
AAGCTTCAGA AGGAAGCCGG TATCGACCTG ATCACAGTGG GTGATATGTA CCTCTACGAT
TGTGTTTTGT TTTGGATTGA GTCGCTGGGA ATAGTGCCTC ATCGCTTCGA GAATTTGGCG
GCTGGGACGA CGCGTATGTT TTCCATGGCT CGAGGTGTTG ACGGTGCCGA AGCTCTAAGT
AAGATGCTCT TTCTTGATTC GCCTCATATT ATCCATTCGA CACGAGAATC CTCCTAACAA
GCCTCTTTCT TTTCCATGTT GTGCTAATGT GAATAGGCAT GAAAAAATGG ATCACGAGCA
ACTACCACTA CATGGTTCCT GAGTTCGACA AGTCATCTAA GATTGCGCCT GACTTTTCCT
CCTTCATTTC CAGTGTTGAG CGAGGCGTCG GGGTTCTCGG AGCTGAATGC GCCATACCTG
TTGTGTTGGG ACCCGTTTCC ATCTGCTTTT TTGCGCGCAT CGTCGACGAC GCTCTGACCA
CGCATGAGTT GATTGCTTTG TTGATTCCTG TGTACAAAAC TCTTCTGCAG AAGGTTGCCG
ATCTGGGAGT GAAAGAAATT CAGATTCATG AACCTGCAAT CGTCCTCGAA GAAGTTGGGC
TTATCGCCTC TTTGAAGACC GTTTATCCAG CCATCCTGCC GAAAGGCCCG AAGATTAATT
TCGTTACCGC TATGGACGAT GTAGGGAAAG CTAATTATGA TTGGCTTATC TCGGAGTCGA
ACGGACTTAA CATCCTTTCG ATGGACTTCA CCCGTGGAAA TACACTTGGA TTGATCGAAA
AGCGAGGATT TCCCTCCTCT AAGATGCTTG GCGCGGGCCT CGTTGACGCT CGCAATGTTT
GGAAGGTCGA CCCCGGTAAA GTCCTTCCGC TCACCGACAA GCTGAAGTCT CTAGGGATTG
AGTTTCGTGT ACAACCTTCT ACATCTTTAC AATACACTCC TTGGGACTTG GATCGTGAAA
TGCAATTGGA CAACCACCCG GCAGCTCAGG TTCTCTCCTT TGCAAAGCAA AAGTTAGACG
AACTGGTGCT TCTAGCGCAG GCAATATCGG GTGAGAAGTC AGCACTCGAT ACGCATACGG
CAGCTTGGAC CACCTTCCAC TCTGCCCATA CGGGCTCTAC TTTGACGAAG GAACGAATCA
AGCACTTGAA AGAGACTGAT TTTCGCCGTC CCGAGCCGTA CAAGCAACGC CGTCCGAAGC
AATTGGTCGG CGTACCACTC CTACCCACGA CCACTATTGG ATCCTTCCCA CAAACCAGCG
AAATCCGGCG ACTTCGATGG GAATGGAAAA AGGGTAGGCT GACCAACGAG GCGTACGAGA
GGGCCATGGA CCAGCACATT GCTTACTGTA TTGGTATCCA GGAAGCTATC GGCATAGATA
TCTTGGTGCA TGGTGAGCCT GAACGGACGG ACATGGTTGA GTTCTTTGGT CAACAAATGG
AGGGAATTCT CTTTTCGGAA CATGGGTGGG TCCAGTCATT CGGCTCTCGT TGCGTCCGTC
CTCCTATTAT CTGGTCAGAT ATTCAGCGAC CTAAGGCGAT GACAGTGCGC GAGTTCAAGG
TTGCCCAGGA TTTAACGTCA AAACCGGTGA AAGGCATGCT GACTGGTCCA ATTACTATTC
TGAATTGGTC TTTCCCTCGT GTCGACGTGT CTCGCAAAGA GCAAGCGTAC CAACTTGCTC
TTGCTATTCG GGATGAAGTA GCTGATCTGG AAAGTGCTGG CTGCAAGGTC ATTCAGGTGG
ACGAGCCTGC TCTTCGTGAG GGAATGCCAC TCCGAACCGC CCAAAAGGAA GAGTATTTGA
CGTGGTCTGT TGACGCCTTT CGCCTGGCAA CTGCGGTTGC TGCGAGCGAG ACACAGATTC
ACACTCACAT GTGTTACTGT GAGTTTAACG ACTGTATGGA AGCAATTGAT AGGCTCGACA
CGGACGTAAA CTCAATCGAG AACGCACGCA GTGACAACGC GACCTTAGAA GCCTTTCAGC
GCGTCGGATA CGAAAAAGGT TTTGGCCCTG GCTTGTACGA CATTCATTCA CCTGTTGTCC
CACCAATCGA CATCATGTAC GAAAAACTCA GCAGTTTTTT GAAGGTTCTT GACGTCGAGC
ACACCGTCGT CAACCCCGAT TGTGGTTTAA AGACGCGAGG GTGGCCAGAG ACGATCTTGG
CCTTGAAGCA CATGGTTGCA GCTGCTCATA ATGTTCGCAG AAATCTCGGA ATCGAAACGC
AGTAAGGTGA TGCCAGCAAT TATGTAGACG AGCATTGGTC TGACAGTTAA GATGTTTACG
TCAGATTCAT GGAGGCAAAA GTAGACCCAT TGGTGATTTC TC
 
Protein sequence
MKITTTATLG FPRMGPNREL KFALEKHWKG SLSEVDLIKV SEDVETMGWK LQKEAGIDLI 
TVGDMYLYDC VLFWIESLGI VPHRFENLAA GTTRMFSMAR GVDGAEALSM KKWITSNYHY
MVPEFDKSSK IAPDFSSFIS SVERGVGVLG AECAIPVVLG PVSICFFARI VDDALTTHEL
IALLIPVYKT LLQKVADLGV KEIQIHEPAI VLEEVGLIAS LKTVYPAILP KGPKINFVTA
MDDVGKANYD WLISESNGLN ILSMDFTRGN TLGLIEKRGF PSSKMLGAGL VDARNVWKVD
PGKVLPLTDK LKSLGIEFRV QPSTSLQYTP WDLDREMQLD NHPAAQVLSF AKQKLDELVL
LAQAISGEKS ALDTHTAAWT TFHSAHTGST LTKERIKHLK ETDFRRPEPY KQRRPKQLVG
VPLLPTTTIG SFPQTSEIRR LRWEWKKGRL TNEAYERAMD QHIAYCIGIQ EAIGIDILVH
GEPERTDMVE FFGQQMEGIL FSEHGWVQSF GSRCVRPPII WSDIQRPKAM TVREFKVAQD
LTSKPVKGML TGPITILNWS FPRVDVSRKE QAYQLALAIR DEVADLESAG CKVIQVDEPA
LREGMPLRTA QKEEYLTWSV DAFRLATAVA ASETQIHTHM CYCEFNDCME AIDRLDTDVN
SIENARSDNA TLEAFQRVGY EKGFGPGLYD IHSPVVPPID IMYEKLSSFL KVLDVEHTVV
NPDCGLKTRG WPETILALKH MVAAAHNVRR NLGIETQ