Gene PHATRDRAFT_50551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50551 
Symbol 
ID7199382 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp89639 
End bp93175 
Gene Length3537 bp 
Protein Length564 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185518 
Protein GI219130744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0179423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGAAAGCTC GGCAACCTCT CATATTCCAT CAAGCCGAAG GGATTGGAAG GATTCGTTCA 
TTCTTCACAG TCAGCCGAGA CCGTTGGTAC GGTTATCGTC TCCATTTGCG GAAAGGCTGA
GCACAACCCT CGATCTCGGT CGAGCTTTTC TGCAACTATC ATCACTAAAT TCTATCCTTA
TAAGCATGTC AACCCCAACA TCTTCCTCGT CCTTTGCGTT GGCGAAAGTT CTTCCCTCGT
CGGTCACGGC ATGGATGGCC GATCGCCCAC ACGCCGTCGA CACGATCATG TTGTTTGTGG
CCTTCCAGAT CGCCTACGCC GCGACGAATC CCAGCATACA GTGGCAGTAC ATGGCGATTT
ACGGTCTCGG TCTGTTGCTC GTAACGAAGG TGGCTCATTC GCCCTTGGAG TTCTTCAAAG
GTGGGATCGC TGACACGGCT ACCGATCGCA GTTCCTACGC AATCTTGGCT GGTAGTACCT
TTATCAGTTG GATCTTTGCC AAGAGCATCC AAAACGCCAG TATCCTCGGA GCCAGATACG
GTATCCTCGG CGGCTTTGCC TACGGCACTT GGTACATAGC CTTCCTTTCC GTTGGCGTTG
TTTGCTACTA TTTGCGAACC AATCAGGGCT ATACATCCCT GCAAGAAGCA ATCTTTGAGC
GGTACGGGTC GATCGCATCC ATTTCCTACA GTCTTGCTGT ACTTTTTCGT CTCTACCAAG
AAATCTGGAG CAACTCGCTC GTGGTGGCTT CTTTCTACGG CGACTACAAT ACGGCTTCCT
GGTGGATTGC GGCTCTTTTG TCCACATTCA TTCCCTTTGT CTACGTTTCC TTGGGAGGAC
TCCGGTCCTC GCTAATCTCC GACGTCATTC AAGCTCTTCT AGCTGTCATT CTTCTGGTGA
CGGTACTCGG TGTGATTGGC AAGCAAGTTA ACGAACTGTC AGACGAGTGT GAGGTCGCTG
GACGGGGCGA CTGCAACCTA TTCCAGTGGG ATACCAACGT GGGCGTTGCT ACCAATACTT
TGGAAGGGGG TTGGGACTTG GCGATAGTCG GCTTGATTCA GGGATTGTTC AGCTATCCTT
TCTTTGATCC AGTTCTCACG GACCGTGCCT TTTTAGCGAG CCCTAAAACA ATGTTGCGAG
CCTTCCTTAC TGGCGGCGTC ATTTCCTTCC TGTTTATATT TTTCTTCGGT TTCATTGGCA
TTTTCGGCAA TCTTGCTGCA ACCGTGGATG AGACAATTGA CCCTGCTCTC CTTACCGGAA
TTAGCACGGG AATTCCTGCC GACGTGGCTC GCTACCTCGG TACCGGCGTC TTTACCATTA
CCAATATTAT TTTCATGACA ACTTCTATCA GTACGCTGGA TTCGGCGTTT GCTTCGACGG
CCAAACTTTT TGCCGAGCTA CGCACATTCT TTTGGAAATG GAAGCCGGAA AAACTCGCCA
ACGTAACCGA TCAACACGTT GCGCTGGGCC GCTTTGCTAT TCGTTTTATT GCCCTCTTGG
GCACTTTACC TTTGTTACAG GACCCCAGTG CCCTCGACGC GACGACGGTC AGCGGTACAG
TCGTGCTCGG ATTGGGACCA CCTATCATGG CGCTCTATGT TCTTACCGCG TGAGCCCAAC
AGCTACTACC CACTGGCCTT TTTGAGTTCG TTTTGGTTGG GAGCGTTGTT GGGCTTGTTG
TTTCAGTTGA GTAACGAAAA CCCCAACGCA CTAGATTGGC AACACCTCAC TGTCGGCGAG
GGATCGTACG CCAAACTCCT ATGGTTCAAT TTAGTTGGCT CTGTGGCTAC CTTGGGAGCT
TTTGTCGTTT TCTTTGGCCT AGAAAAGTAT GTGCTCTCTA AAGTGGTTCC CTTGTACGCT
TGGCACGCTC AAGTCCTCGA AGTACACGAG GGGAAGTCGG TGCACGGGAT GGGCAACGAT
GAGCTGTCAC ACAAGTTGGA TCGATCCAAC GAAGATCACG ACAAAGAATT GGAGAGCATC
GACAGCAGTA GCGAAGAAGT CGCCTCGGGT GGGAGCGCTG GAGAAGAAGA CAACGACACG
AGCACCAAGC TGGAGGCCGA CAAAGTTTGA AACGTGATGG CCGGGAATTG CTGGTGTTGG
CCTACGGAAG CAGTGCACGA ATGCAAGCAG GTTAACGACA GTCAACAAAC ATAAATGTGT
TTGTATATAC ATTACCGATT TACAATACAA AGCTAAAGCT TGTAGAGTTC TATAGAAATC
GTCAATCAAT CAAGTGCAAT CATAGTCCAA GAGCTTCTTT TTGTAGTCGA AATAGGTCAA
CGGCCGCCTC GTATAGCGCC ATGTCGAGCT CATTGTGTTT CCGAATGAGC TGTGCGGTTG
CGTCGTCGGG TCGCGACGGC AAGTCCCAGT GCGTGCTACC ACTTCGGTCC CTTGCGTTCC
CGGTACGGCT TCCGCAATGA TTGTTTTGTG GCGACGCGTT GGAGTGCGGC AAGCCGCACA
CGGTGCCGTT CACGTCCGTG GCCAACCAGG GAAAAACGCG CCCCACCATG GCGGCGGTCG
TGTTCAACTC TTCCGTCAGG CCCACCATAG TAAAGAAATC GTGCATGTTG CGAATCGCTT
CCGATATGAT TTGCTGTCGT TGTTGCGGAG TCCCGCGGGC TGTGGCGATG GTGGTTTCGT
TGAAATTTGT CGAACTCAGG AGATTGTTGG TTTGGTGATT CTGCAGTTGC AGGGCACACA
TCCGATCAAG AGTGGAGTTA CCGCTGTCGA TTTCCGCGTA GACGTCCTTC AGATCCCGAC
AGCCGTAACA CGCTTTGGTT CGGAAGCGAT ACATGCTCCA GACACGGTCT ACGGGATGAC
GCAGGACGGT CACGGCCCGA ATGGGCGTCG TATCAGTGGA AGGAACGATA CCGCCTTCCT
CACCATTATC GGATCCACCA CTGGTGGTCC GAGTCCACTG AAACGTAGTC AGATCGCGGA
GCGGGGCACA GTAACTCATA ATAGCGGCTT CTTGAACCTT TCCGACACAC TGTGTATCGT
CTCCGGACAA ACAGCGCGCG TAACGCGCCG CACTGCATTC GTGTATGTTT TGGTAAGGAA
GTGTCCCGTG TTGCGATCGA TAGCGTTGCA TAGCGCAGCG AATCAGTCCA TCCATCGAGG
TCCCACCCGT CTTCATGTGG TGCAGGTGCA GAAATTGTCG CGGTTGTACG GGGCCTTCCT
CCGTGTCGGA CGTGCGTAAC AGATCGTGCG GTATCCCCAG TTTGGCTCGG AGTTGCTCCA
CGACTTGTTG TCCCTGGGTA TTACCATCCG GATCCAAAGG CACCGCCTGT GCGAGTGTCT
CGGAAACGGT GGACGGCTGG TAGAGACGTG TGGGTGTGGA AGAAGAGAAT ATATGTGTAC
GCGATTGGTA CGTGGCAAAC AAGACAGCCA GCGATACCGC GAAGGTAGCA ACGGCCACGA
CGGGACGCAT CGGAAGTGTG ACCAGTCTCG TACGGTGTGG CACAGAACGC CGTCGTTGGA
GACAGGAATG GGGTGGACAG TAATGAACAA GGCAAGCGAG ATTTGTGTAG CAGCTTC
 
Protein sequence
MSTPTSSSSF ALAKVLPSSV TAWMADRPHA VDTIMLFVAF QIAYAATNPS IQWQYMAIYG 
LGLLLVTKVA HSPLEFFKGG IADTATDRSS YAILAGSTFI SWIFAKSIQN ASILGARYGI
LGGFAYGTWY IAFLSVGVVC YYLRTNQGYT SLQEAIFERL AVLFRLYQEI WSNSLVVASF
YGDYNTASWW IAALLSTFIP FVYVSLGGLR SSLISDVIQA LLAVILLVTV LGVIGKQVNE
LSDECEVAGR GDCNLFQWDT NVGVATNTLE GGWDLAIVGL IQGLFSYPFF DPVLTDRAFL
ASPKTMLRAF LTGGVISFLF IFFFGFIGIF GNLAATVDET IDPALLTGIS TGIPADVARY
LGTGVFTITN IIFMTTSIST LDSAFASTAK LFAELRTFFW KWKPEKLANV TDQHVALGRF
AIRFIALLGT LPLLQDPSAL DATTVSDWQH LTVGEGSYAK LLWFNLVGSV ATLGAFVVFF
GLEKYVLSKV VPLYAWHAQV LEVHEGKSVH GMGNDELSHK LDRSNEDHDK ELESIDSSSE
EVASGGSAGE EDNDTSTKLE ADKV