Gene PHATRDRAFT_21513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21513 
Symbol 
ID7202384 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp255069 
End bp258441 
Gene Length3373 bp 
Protein Length1035 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181518 
Protein GI219122368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.275387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCCTTTTTC GACCAGAAAT TGGTTGTGTC ATCGCCTTCT GCGCTGTTGT TGCCCGACAG 
CATAGTAGAA GTCAACTCTG CTTGCTGTGA AAGAAACAAT TGTCTTCCTC GGACAGAGCA
CTTTCTGGCT TTGATCGCCT CACGCACGCT CAAGCCATCT TACTATCCAC TCCCGTGTCA
TGCCCAGCAT CCTCCGCAAG CCTTCCTCGT CATCGTCTCG TCCCCGTAGG AATAGTATGG
TGACAATGTG GCTTTATTTT TCCACCGGCC GGACGGCTCT GGCGTTGGTA CCAACTATTA
ATTTTAGCGG TGCACGGAGC CTTTCGCGAG AAATCCCTTC ATTTGGTGCG GCTTTTGCGA
GGACATCTCA TCTTTCGACC CGCCGGTCTC GAGTCGGCCA AGCCTTTATG TCCACCACCG
CCGACCCTGA TACATCGAAA ACGACCGAAA CCGACAGAGC CTACGATATA ATCAACAAGG
AGCGTGGACT AGACAAATAC GAGCCTGCTT CCTTCGAGTC CGACATTTAT CGTTGGTGGG
AAACTGCGGG TTGCTTCCAA CCCGACGCCA AGCAAAAGGC GATAGACGAC AACGACGACA
GCACATCCAC CGCACCGTAC GTCCTTCCCA TGCCCCCGCC TAACGTGACG GGGCGCTTGC
ACATGGGGCA CGCCATTTTT GTCGCCCTCC AAGACGTTCT GGCCCGCTTT CACCGCATGC
GCGGTCGACC CGTGCTGTGG TTGCCCGGCA CCGATCACGC CGGTATCGCT ACGCAACTGC
AAGTCGAAAA ACTTTTGATT GCGGAAGGAA CAACGCGAGA AGAAGTGGGT CGCGACGAGT
TTTTGCGACG CGTTTGGATG TACAAGGAAG AACAGGGAGG ATTCATAACG TCGCAATTGC
GGTCACTAGG GGCGTCGGCG GATTGGAGTC GGGAGCGCTT CACGATGGAT GACGATTTGT
CGCAGGCAGT TGTCGAAGCC TTTTGTCGCC TACACGAGAA AGGTCTTGTA TACCGTGGGG
AATACATGGT CAACTGGGCA CCTTTACTTC AGACAGCCGT TAGTGACTTG GAAGTAGAAT
ACAGCGAGGA GGAAGGTAAA TTGTACTACT TCAAGTATAT GGTTGAAGGC AGTGAAGGTA
CGTAAAGACA AGTGCAAAAT GTCTACAAAA TTAGCTGCTA TCTCACAAGT ATTTGGGTTT
CAGAATTTAT ACCAGTCGCT ACGACGAGGC CCGAAACCAT TTGTGGAGAC ACGGCTGTTT
GCGTGCACCC CGAGGATGAG CGGTATAAGC ATCTAGTTGG AAAAGCTTTG GTGGTACCAA
TGAGCGGAGG TCGTACCGTG CCCGTGATTG CTGACGAGTA CGTGGATATG GAGTTCGGGA
CGGGAGCGCT CAAAATCACT CCAGGTCACG ATCCCAACGA CTATACCCTC GGTAAAAAGT
TTGATTTGCC GATTATCAAC ATAATGAATA AGGACGGTTC GATGAACGCC AATGCGGGCC
AGTATGATGG TCTCGATCGC TTCGAGTGTC GTCAACAGTT GTGGACCGAC ATGGAAACCG
AAGGTCTCGT AATAAAGGCC GACCCGCACA CGCAACGAGT TCCGCGATCG CAACGCGGCG
GAGAGATAAT TGAACCTTTG GTAAGCAGCC AGTGGTTCGT CAAAACAGAA GGGATGGGCG
CCAAAGCTCT GAAAGCTGTG GAAGATGGTG ACATCAAGAT AGTTCCGCAG CGCTTCGATA
AAATTTGGAA TAATTGGTTG ACCGACATTC ACGATTGGTG CGTATCACGA CAGCTATGGT
GGGGTCACCG TATTCCGGTC TGGTATGTTG GCGAGACAGG CGAAGACGAG TTTATAGTGG
CGCGGAACGA GAAGGAAGCT CGTGAAAAGG CGGTGGCAAA TGGTCACTCC GCAGACGTTG
TACTCCGACA AGAGGAAGAT GTGCTCGACA CGTGGTTCAG CTCAGGCCTG TGGCCGTTTG
CGACGGTTGG CTGGCCTCAA AACGAAGGAG TCAAGGGTTC GGATTTTGAT CGCTTTTTCC
CTGCTTCTTG TTTAGAAACA GGCTACGACA TCATCTTCTT TTGGGTAGCT CGTATGGTCA
TGATGGGTAT TGAGCTCACC GGGAAGAGTC CATTCAGTGT GGTGTATTTG CACGGCCTTG
TCCGTGCCGC TGACGGAAGT AAAATGTCCA AAACCAAAGG CAATGTGCTG GATCCTTTAG
ATACTGTTGC TGAATTCGGC GCTGACAGTT TACGCTACTC TTTAGTTACG GGTGTTACTC
CCGGACAAGA TATTCCGTTG AACATGGAAA AGATTGAAGC GAATAGAAAT TTTGCCAACA
AGCTCTGGAA TTGTTGTAAG TTTGTTACGG GAAACGCACT CAAAGATCTT TCAGACGAGG
ACTTGGCAAG TCTGGCCGTA TCCGGTCCAA TCGAGCAGGA AGAGTTCGAT AGCCTTTTGC
TACCGGAGCG ATATATCATC TCAAAGTGCC ACACTTTGGT AGCAAGCGTT ACACAAGACA
TTGAGAAATA TCAACTCGGA GCTGCCGGTA GCAAAGTATA CGAATTTTTG TGGGATCAGT
TTGCCGACTG GTACATTGAA ATTTCCAAGA CTCGCTTGTA CGAGGGCGCC GGTGGGGGTG
ACAATATTGA GGAAGCACAA GCCGCTCGTC GAGTTTTGGT GTATGTTTTG GACACCAGTT
TGCGTCTGCT ACATCCCTAC ATGCCGTACG TAACCGAACA GTTGTGGCAC CACTTGCCTC
GTGCCGACGC TGGCCCGGAC CAAGCTGCAC ACGCACTCAT GTTGGCGAAC TGGCCGCAAA
TGAACGACAA CGTGCTGACC ACGAGCGAGG CCGCTGTGGC CCAATTTGAA TCTTTCCAGG
CATTGACCCG AAGCGTGCGC AATGCCCGCG CTGAATATAA CGTGGAACCG GGCAAACGTA
TTGCTGCTGT GATCGTGGCG CGCGGCAAAT TGAAACAAGC GATTGAAAAA GAGCTCAAAT
CGCTCATTGC ATTGGCGAAA CTGGATCCGG AACAAACGCT AATTTACGAA GCAGGGTCGG
AAGAAGCGAG ACAGGCAACG CAGGTGGAAT CAGTCCAAGT CGTAGTCCAG GACGGTGTAG
AAGCCTTTCT GCCGCTGTCG GGATTAATCG ATCCGGAAAA GGAACGTTTG CGTCTCGAGA
AACGCCGCGA GAAGCTGGAG AAGGAAATCC AAAAACTTGC AGGGCGCTTG CAGTCAAAAG
GATTCGTGGA CAAGGCCCCC GCCGATGTTG TGGAGAAGGC CCAGGCAGAA CTGGCCGAGC
TGGAGGATCA AGCTGGTAAG GTACAAGCTA GCTTGGAGAC TCTGACCCAA TAGTAAAAAC
AGATTTTTAC TCC
 
Protein sequence
MPSILRKPSS SSSRPRRNSM VTMWLYFSTG RTALALVPTI NFSGARSLSR EIPSFGAAFA 
RTSHLSTRRS RVGQAFMSTT ADPDTSKTTE TDRAYDIINK ERGLDKYEPA SFESDIYRWW
ETAGCFQPDA KQKAIDDNDD STSTAPYVLP MPPPNVTGRL HMGHAIFVAL QDVLARFHRM
RGRPVLWLPG TDHAGIATQL QVEKLLIAEG TTREEVGRDE FLRRVWMYKE EQGGFITSQL
RSLGASADWS RERFTMDDDL SQAVVEAFCR LHEKGLVYRG EYMVNWAPLL QTAVSDLEVE
YSEEEGKLYY FKYMVEGSEE FIPVATTRPE TICGDTAVCV HPEDERYKHL VGKALVVPMS
GGRTVPVIAD EYVDMEFGTG ALKITPGHDP NDYTLGKKFD LPIINIMNKD GSMNANAGQY
DGLDRFECRQ QLWTDMETEG LVIKADPHTQ RVPRSQRGGE IIEPLVSSQW FVKTEGMGAK
ALKAVEDGDI KIVPQRFDKI WNNWLTDIHD WCVSRQLWWG HRIPVWYVGE TGEDEFIVAR
NEKEAREKAV ANGHSADVVL RQEEDVLDTW FSSGLWPFAT VGWPQNEGVK GSDFDRFFPA
SCLETGYDII FFWVARMVMM GIELTGKSPF SVVYLHGLVR AADGSKMSKT KGNVLDPLDT
VAEFGADSLR YSLVTGVTPG QDIPLNMEKI EANRNFANKL WNCCKFVTGN ALKDLSDEDL
ASLAVSGPIE QEEFDSLLLP ERYIISKCHT LVASVTQDIE KYQLGAAGSK VYEFLWDQFA
DWYIEISKTR LYEGAGGGDN IEEAQAARRV LVYVLDTSLR LLHPYMPYVT EQLWHHLPRA
DAGPDQAAHA LMLANWPQMN DNVLTTSEAA VAQFESFQAL TRSVRNARAE YNVEPGKRIA
AVIVARGKLK QAIEKELKSL IALAKLDPEQ TLIYEAGSEE ARQATQVESV QVVVQDGVEA
FLPLSGLIDP EKERLRLEKR REKLEKEIQK LAGRLQSKGF VDKAPADVVE KAQAELAELE
DQAGKVQASL ETLTQ