Gene PHATRDRAFT_46320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46320 
Symbol 
ID7201504 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp916898 
End bp920214 
Gene Length3317 bp 
Protein Length1022 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180731 
Protein GI219119962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAGGATCC CTTGGTCGTC GAGAAGACTG GCTGGTGTTA TCATCGTCGG TGTGTAGTAT 
CGTTATCGTC ATCATGGCGA CTTCCAACTT TTTCCACAAG CCCGAGTTGG CCTTGCGACG
CGCGCAGGAA CTCGAAGGCA TCCATCAGGA CGACGCCGCC TTGGCCATGT TGCACGAAGT
GCTATCCTCA CGCCGTCATC GTACCTGGAG TCCAGCCTTT GAACAAATCA TGATCGTGTA
CGTTGATCTC TGCCTCAAGC TGTACAAAAC ACGCGAAGCC AAAGATGGTC TGCATCAGTA
CCGTAATTTG AGTCAAACAC AGGCTCCGGG ATCGTTGGAA AAGGTGATTC GCTACTTACT
CACCGCGGCG GAGAAAAAGT GTACCGAAGC CAAAGCTTCC GCGGACGCCC AGCAGGATCA
GATTTTGGAA GAGAATGGGA ATGGCGACGA CGAAGACGGT TTCCGGGCTT CTCCGCAGGC
AATCTTGCTA TCCACCATGT CCACGGACCC GGCCAAATCG CAACGCGATT CCGCCCTCCT
CGTGCCGTCC CTCAAGTTTC TCTGGGAAAC CTACCGAGCC GTTCTGGACA TTTTGCGATC
CAACTCCAAA CTAGAACACG TCTACCACTT TGCGGCACAG AGCGCCTTGC AGTTCTGTCA
AGATTACAAA CGTCGAATGG AGTTTAGGAA TCTTTGTGAC ATGCTCAGAC TCCATCTCGG
CAATCTGCGC CAGTACGGCA ATCTCGACGC CAACGATGAC GGAAAAACCA ACAACAAGGT
ACGTATGCCA CGAATTGACC TTTCGCTCGA CAATCCCCCG AAACGGTGCA CTCACACTGC
TCTTAAACTT TGGTGGGTCT CGCGTTTGCG CCATTAGGTC CGCGGTTGGG AAGGTTGGAC
TACGGAATCG ATCGAGTTGC ACCTACAAAC GCGCTTCATA CAGCTCGAAA CCGCCAGCGT
CCTGCACCGC TACACCGAAG GATTCCGGAC ATGCGAGGAT ATTTTCAATA TTCTCCAAAT
CAGCCAGGCT CGTCGCAAGC ACAATCCCGA CGTGCCCGCA CCCAAGGCCA AGCTCATGGC
ATCCTACTAC GAAAAGCTCA CGACCCTGTT TTGGGTTTCG GAAAATTACC TTTTCCACGC
CTTTGCCTGG TACAAGTACT ATACCTTGTG CAAAGAGTTC AATCGTGGCA TGTCGGAAGA
CACGAAGCGT ATGCAGGCGT CGGCCGTCCT ATTGGCCGCC TTGTGCATCC CTTCCGTCCC
CGCGAACGAA AGCAAGGCCT CGCATTCCAA CCAACATACT ATTGCCACCA CGGTCGAAGA
CGGGATAGTG AAGCAAAAAA TGGCCCGCAT GGCTACCTTG TTGGGCTTTC ATACGCGCAA
CCCGACCCGC GACGCCTTGC TGGCGGAAAT CCGTAGCAAG GGGATTCTGG AACAGGCTCC
GGCCTACTTG AGAGAACTCT ACGAGCTTTT GGAAGAAACC AACGATCCGC TTATTATGGT
TCAAAAGGCC AAGCCGCTTT TGGAACAGCT GCAGCAAGAG CTCGGGGCCA CCACGTCTAA
CGATGTCAAG AACGACGACG TCGATGACAC GACCTTGGGT CGCTACGTTA AGCCCATCAC
CAACGTGCTG TTACTGAAAC TCATTCGCAA CTTGTCGGCA GCTTACCACA CGGTATCAAT
GGATCATTTG AAATCTCTCA CGTCTGGTCT CGATCTGAGC TTCGAACAGG TGGAAAAGAC
CATCGTCACG AGTTCAAAAA CGCTGGCTGT GCGTCTGGAT CACCGTGCCG GCTGTCTACG
TTTCGGTAAC GTGCAACTGG AATCGGACGC AATGCGCTCG CAGCTGGTAA ACTTATCGAA
GCAGCTACAG GCCGTGTCCA ACGTTCTGAC CCCACCGGAT CGGCAAAGCG TGTTGCAGTC
CCGACTGTCC ACGTACCAAT CGGTCCGCGA AAATCTCCAC GCCGAACACG CGGCCGTGCT
GGAACGTAAA AACTTGATCG AAACGCGCAA AGAAGAAACC GAACGAGTAG CGCAAGAAAA
GTCTAAGCAA GAAGCCCGTG TCAAGGCGGA AGAGGAAGCA GCGCGTAAGG CCGAAGAAGA
ACAGCGCATC GTGCGGGAAC AGCGCTTGCG CGAGATTGAA AAGCAGCGCA AAATTCAACA
AGAGTTGGAC AATCAAGAGA AGAAACGGTT TCTGGCCGCC ATGGGAAAAA AGACGGAGGA
TATTTCGGAA GAGCAAATCG CCAAGATCGA TACGGAAGCC TTGCAGCGGG AGCACGAAGC
AAAGATCAAC AAGGAACGTG AGGAAGCCGA ACGCAAAACT CGTGAGACGG CCAAGAAACT
GGATTATTTG GTCCGGGCGA TTCGTATCGA GGAACTGCCT CTGATCAAGA AGAAGTACGA
AGAAAAGACG AAATTGGACA AGGAGCGGTA CGAACAAGAG AACATCGAGA AGGCGCAAAA
GGCTAAGTTG CAATGGGAAG CCGATGTGAA AGACAAGGCT GTGTTGGAAT CCCACAACGT
CTTTGCCTAC TGCTCTGAGT TTGAAAACTC CGTAATGGTG GGACGCCAAG CCGAGCATGA
CGTAATTTGC CAAAGAGCCG AAGAAGAGGC AGAAATGGAA GCCGAAAAGG CCAAAATTTC
CCGAGCTCGC AAGCGGAAAG CAGCGGAGGA GAAGCTTATG GCTGCCGAAG CTGCCCGTGA
GGCAGAGGAA GAAGCTCGCC GGAAGGAAGA GGAGGAGAAG CGCAAGAAAG ATGAGGCTCG
CCGGGAGCGA GAAGCCAAAG AGGAAGAGCG CCGGCGGGCA GAAGACGAAC GAATGGAGGA
AGAGCGTCGA AAAAAGGCAG GTCCTGCCAA GTACATTCCT CCATCACAAC GCTTGGCTAG
CGGTGGCGAA CGTGGTGGCG GCGGTGGAGA AGACCGCCCC AGTCGATTTG GTGGTGCCGG
ATCCTACCCT GGAGGTGGAC GTTATGAAGG CCGTTCGGAC GATCGAGGCG GTGGCTGGCG
CGGTGGTGGC GACCGAAGTG GTGATTATCG TAGAGGTGGA GATGATCGCA GAGGTGGAGA
TGATCGCAGA GGTGGAGATG ATCGCAGAGG TGGAGACGAT CGCAGAGGTG GAGACGATCG
TAGAGAAGGA GACGATCGTA GAGGAGGCGC ATACGGAGGA GATCGTCGTG GCGGCGCAGG
GAGTGGTAGC TACAACGATC GTCGTGGGCC CCCTTCGGAC GGCAACAGTC GTTGGCGCTA
AGGAATTTCA TTGGTGGCCG TGAAGAAACA AAAGTAGCTT TTACACAGAG GTTAGACGTC
TTTCTATTTT ACGAGTA
 
Protein sequence
MATSNFFHKP ELALRRAQEL EGIHQDDAAL AMLHEVLSSR RHRTWSPAFE QIMIVYVDLC 
LKLYKTREAK DGLHQYRNLS QTQAPGSLEK VIRYLLTAAE KKCTEAKASA DAQQDQILEE
NGNGDDEDGF RASPQAILLS TMSTDPAKSQ RDSALLVPSL KFLWETYRAV LDILRSNSKL
EHVYHFAAQS ALQFCQDYKR RMEFRNLCDM LRLHLGNLRQ YGNLDANDDG KTNNKVRGWE
GWTTESIELH LQTRFIQLET ASVLHRYTEG FRTCEDIFNI LQISQARRKH NPDVPAPKAK
LMASYYEKLT TLFWVSENYL FHAFAWYKYY TLCKEFNRGM SEDTKRMQAS AVLLAALCIP
SVPANESKAS HSNQHTIATT VEDGIVKQKM ARMATLLGFH TRNPTRDALL AEIRSKGILE
QAPAYLRELY ELLEETNDPL IMVQKAKPLL EQLQQELGAT TSNDVKNDDV DDTTLGRYVK
PITNVLLLKL IRNLSAAYHT VSMDHLKSLT SGLDLSFEQV EKTIVTSSKT LAVRLDHRAG
CLRFGNVQLE SDAMRSQLVN LSKQLQAVSN VLTPPDRQSV LQSRLSTYQS VRENLHAEHA
AVLERKNLIE TRKEETERVA QEKSKQEARV KAEEEAARKA EEEQRIVREQ RLREIEKQRK
IQQELDNQEK KRFLAAMGKK TEDISEEQIA KIDTEALQRE HEAKINKERE EAERKTRETA
KKLDYLVRAI RIEELPLIKK KYEEKTKLDK ERYEQENIEK AQKAKLQWEA DVKDKAVLES
HNVFAYCSEF ENSVMVGRQA EHDVICQRAE EEAEMEAEKA KISRARKRKA AEEKLMAAEA
AREAEEEARR KEEEEKRKKD EARREREAKE EERRRAEDER MEEERRKKAG PAKYIPPSQR
LASGGERGGG GGEDRPSRFG GAGSYPGGGR YEGRSDDRGG GWRGGGDRSG DYRRGGDDRR
GGDDRRGGDD RRGGDDRRGG DDRREGDDRR GGAYGGDRRG GAGSGSYNDR RGPPSDGNSR
WR