Gene PHATRDRAFT_54405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54405 
Symbol 
ID7200682 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp167932 
End bp170838 
Gene Length2907 bp 
Protein Length623 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179594 
Protein GI219117604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTCCGTCC AGCAAAATGC CACCCAAGCA CGAAAGTCAA GAAGACCTCA AGATGTCAAC 
ATCTAAACAA GATGAAGACG AAGTCCGCAC CATTGACTTT CTCGACCACG ACGATGGTAA
CCAAGGCAAC GGCTGGGGCC GTGGTATCGT GAAAGACTTC CGCAAGACTG TTGGAACGCA
CTGGGTCAAC GAAATGACAA ACTTTAACCA GAAGTCGATT GCTGTTTCCT TTTTCATCTT
CTTTGCGGCT GTAGCTCCCG CGATTACTTT CGGTGCCGTC TATTCCAAGG TGCGTTTTCC
GACTTGTGTG TGTGTATGGT ATTGGTCATC GCCACGTGTT CGTCTCTCGA TAGTATCGTG
TGTTCTTCGG GGTATCACCG AGTCACCGTC ACCTTGACGC TTCCTCACAT CCTCTTGTCT
TATCCGCATT TCCATAGACC ACCAATGACG CGATTGGTGC CGTCGAGATG CTCATTGCGA
CTGCTTGGTG CGGAATTGTC TACGCACTTA TTGGAGGACA GCCCATCATG ATCAACGGTG
GAACCGGTCC CGTTCTTGCC TTTAGTGCCG TGCTTTTTGA TATTGCGGAC AACATGGACG
TCAACTTTTT GACTTTGAAT GCCTGGACTG GACTCTGGGT TGCAGGATTC TTGATCATTG
CGGCTTTCGT TGATTTGAAC CGTCTAATGA AGCATGCTAC GCGCTTCACC GACGAAATCT
TTGCCCTGTT AATTGCGTCC ATCTTCGTGA TTGATGCACT TGGTAGTCCC TTTTCTGATG
TAGGTATTTA CTGGTACTTC ACCCGCAGCC ATGATTCGCA TGACGAATTC GAAGACCAGG
AAGACTACTC ATACATGGCC ACAGCGTTTC TCAGCGCCGT TCTCTGTCTG GGAACAACCT
GGTTGGCCTT CTTCCTGAGG GATATTAAGT TTTCGCCCTA CTTTCCCAAC GATTCTTGGC
GCACTCTCAT CTCCGATTTT GCCGTGGTTG CCTCCATTCT GATCTGGACT TTGATCGCCA
ACGGACTCTT CGACAATGTT GAAGTGGAGC GCCTCAATGT CCCGGATAGC ATCACGCCGA
CTCAAATCTG CTGCACCGCC GATTGCATGA CGTCGTTCCC CGATGACTGT CCCGACATTA
CACCGTACGG ACGCCGTTCC TGGATTGTGG ACCTTGGTGC CGTCAACGGA AAGTCCTGGA
TCCCTTTTTT TGCCGCCATT CCGGCTCTTT TGGCATTTAT CCTTGTTTTC TTGGATGATG
GTATCACCTG GCATTTGATC AATCACCCGA GCAATAAGCT TACTCACGGA GACGCTTACA
ATTGGGACAC GGTTGTTATT GCTGCTATGA TCGCCGTCAA CTCTATGCTT GGTCTTCCCT
GGTTGGTCGC CGCCACTGTC CGATCCCTCA CCCACGTCAA TGCTCTCGCC GAACGTAGTG
AGAACGGCAA AATTATCAGT GTGCAAGAAA CACGCTTGAC GCATTTGGGA ATTCACTTGC
TTGTGCTTGC TGCTCTCTTT GCACTGGATG TGCTCAAGCT CATCCCTGTG CCAGTCTTGT
ACGGAGTCTT CTTATATATG GGAGTGGCCA GTTTGGCATC CAATCAATTC TTCCAGCGCT
TCCTCATGTT TTTTATGCAG CCCTCCAAGT ACCCCCACGA GCCACACACT AAGTACATGG
CTCCTAAGCG CATGCACTTG TTCACAGGGA TCCAGCTTGG ACTTTTCGTA ATTTTGACAG
TATTTCGATC TATTTCTGTC ATTGCCATTG CTTTTCCGAT TGTCATTAAG GCTTGTATTC
CAGTCCGGAT GTACATCTTG CCTCGCTACT TCACCTCCGA AGAACTTCTC ATGATCGATA
CCGATGACAG CATCGTCAAC GAATATCTCG AGTACAAGGA GTCCAAAGGC GAGAAAGTAC
CCGTCCGTCA TTGTGGCGAA CAAGAGCCCG AGGAAGTTCC GATGCTTCAA GTGACTCAGC
ATCCTATCCG AATCGACGAT GGCAGCGATG AAGAAGGGTC CGTCGAACAG GTTTAATGTT
TTTCTGTTAT GTAACGGAAT GGAATTTCTA CTAGCTTTCA AAAGAGAAGG CTTAAAGTTT
CAGACCGTAG TATTGATTGC CACGTGTCCA CTAGAGCCAT CTAAACGCCT TCCTTCGCTG
CGAAAAAATG AATTTGAGCA TCCTTGTAGC CAGTAACCAT TAAAGGCGCT TGACCACCAG
TTGGGACATC TCAACATGAC AGTGGTAGTC TCGGTTTGTT GAAGCCGATG CTTACTAGCT
GTAAAGAGAC CCTCAATGCT CAAAAGGGAT GCTCTTTGGT ATCAACAGAT ATGTTTTCGA
ATGTGCTTAT TAATAGACAA CATAAACGAA AGTACTTTCA AGATACAATT TGTATACAAT
GTCTGCAGTT GAAAAAGGTG GACTCACGTC CCATCGTGGC CCAAATCGAT CTACACTGCA
ACTCCAGATA GGAAACCGTT TTTCTGAGAG TAGTATGCGG TGGAGAGCTT TGCTTCTCGC
CAATACGATT CTCCAACAAC TCTACGCAGA CGAGGTACTA CCGCTTTATG GAGTGGCGTA
CATAGAGGAT CACCTTCTCT GTTAAGTTGG ATGCAATCAG AGATAACTGC CTTGGCTCGA
TTCCGAAGAG CCGCGTCGCA CGGATCCCTA TCAAGAGCAG CGAAGAGTTT TCTTACGAAA
ATTAGAAATC GCTGTCGTTC AGACAAGACT CGTCGGATTG GCATAGGAGA TCTACCCGGA
AGGTGGGAGG TACGCTCGGA AATATCGTTT GGCAACATGT TTTACCGGAC TTAAAGAAAA
ACCTTCGCTT ACGCAGTCTT TGGTTGGGGC GAGCTACGTA GAGAATTCTA ATGGAGCGTT
TGTTGATTAT TGAACTCGGT CTGCTTT
 
Protein sequence
MPPKHESQED LKMSTSKQDE DEVRTIDFLD HDDGNQGNGW GRGIVKDFRK TVGTHWVNEM 
TNFNQKSIAV SFFIFFAAVA PAITFGAVYS KTTNDAIGAV EMLIATAWCG IVYALIGGQP
IMINGGTGPV LAFSAVLFDI ADNMDVNFLT LNAWTGLWVA GFLIIAAFVD LNRLMKHATR
FTDEIFALLI ASIFVIDALG SPFSDVGIYW YFTRSHDSHD EFEDQEDYSY MATAFLSAVL
CLGTTWLAFF LRDIKFSPYF PNDSWRTLIS DFAVVASILI WTLIANGLFD NVEVERLNVP
DSITPTQICC TADCMTSFPD DCPDITPYGR RSWIVDLGAV NGKSWIPFFA AIPALLAFIL
VFLDDGITWH LINHPSNKLT HGDAYNWDTV VIAAMIAVNS MLGLPWLVAA TVRSLTHVNA
LAERSENGKI ISVQETRLTH LGIHLLVLAA LFALDVLKLI PVPVLYGVFL YMGVASLASN
QFFQRFLMFF MQPSKYPHEP HTKYMAPKRM HLFTGIQLGL FVILTVFRSI SVIAIAFPIV
IKACIPVRMY ILPRYFTSEE LLMIDTDDSI VNEYLEYKES KGEKVPVRHC GEQEPEEVPM
LQVTQHPIRI DDGSDEEGSV EQV