Gene PHATRDRAFT_47371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47371 
Symbol 
ID7202521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp431275 
End bp435915 
Gene Length4641 bp 
Protein Length1546 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181554 
Protein GI219122443 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAAC GCCGTCGCAG TAAATCGGCC GAAGGAACGG CGAGCACGAA AAGTGCTACG 
TCGACGAGAG CCTCCCCGAA AATGAGCCTG GATGGTACGC CTCCATCACA GACACGAGCG
CCTACGCCGG CGCGCTCCAA CCCGTTTACG GACCTGTTTG TGGAGTCCGA AAATGTTGTA
AACGATGGCA TTCTGGACAG CACACTGGTG TTTCTGGAAA AGGGAATCCT CACCGAGCCT
GGCTTTCGTA CAGCGGCTAC GGAGTTGATG GACTACGTCG AGGCTTGCCA AGATGACAAC
ACGTCCACAA CGCCCACAAC AGTGACATCG TGTTCCACCT TGACGACGCC GCAAGTTTCC
CTTGTGCTCC TGCCATGGGC AATTCGTAAT ATTTTGGGCA CGGAAGTATC CACGGACGAA
TTGGTGTGGC GTGCACTTTC CGTTTGTCTC CAATTGGTCG CAGCTACCTC TCACCAGAAC
GATTTGCAGA GAGATGATCA ATGGGAAAGT ATTTTGACCA AGTCGACTTT ATTCAAGCTT
GTTCCCAGAA TTGCTGTTTT CTGTGTACGC AATGAATGCA TCCTGGAAAC CGACGATACG
TCGGAAATAG AGAAGGCCCG AGCATTTGGT TGTCAAGCGT ACGAAACAAT TCTGGCGTCA
AACTTGTACG CACCGACTTT GGACGTCGCC TGCCACAAGG TGTGGATTCC CCTAGTGGAG
TGTCTAACCA GAAACGACCA GGGTTCTTCT AGCGCCAAGC GCACGACCTG TGCACTGCAG
GCCACGGTAC GATTTCTGCG CAAGCTCCAA TGCTCCGGCA AGGGCAATCC GAAAACGATT
TTTGGTTTAG TGGCAACCCG AGAAGTCCTG GTGGCAGCAT CGCGGACGTA TACGTTGTTG
GATACAAACC CTGATGGGCG AACAACGCTG CGAGACTTTC TTTACCAAAG TCTCTTTGAC
GTGTCGCAGC ATATGGATGG ATTTCGATCG TTGCTCTCCA AGGGCACAAT CCCTGTTCAA
GAAGATAACA GTATGGACAC AGGATCTCCA CTCAGCGAAC GAGACAAACC TACTCTTTTC
TTTCGATGCT ACCAAGATTC TCTGCTGAAG ACGATAACAG AAAGCATTGT TATGGAGGAC
GTCGATGTCA TTCAGACCGT GCCCGTCTTA CTCCGTGGCT TTTTGCATGA ATCCATGGCT
TGGGAGGTAA GGGAGAGAGA GAACAAGAGG CAGAAAGGCT CACTTAATTC CGAAATCTCG
AATACGGTTC TCTTTCAAAT GTTCGTATTT CTGACTGTTC CCCTTCGTAA GTTGCTGGTT
TCCTCAAAAG TACCAGAAAC CATCAGTCCC GCGACGCAAG CTCTACGCGA GTGTCTGGAG
TCTTTGTTCG AGCAAGACGC CTACCTTCCA TCACAAGACG TGGACGGAAA ACAACTCTTG
TATTTGGGCA TTATCACCAA TGAGCTGAGT TCGTTATCGG CTACCCAGAT GGTGCAGCCA
GCTGACAATG CACTTGCTGC TAATTGTATA CACTCTTTTC GAACACTGAT GCAATTGAAT
CACAATCTTC TTCATGAACA GATATCTTCC ATTACGGTCC TCTTGTTTCA GTTTGGTGGC
GATTACCGAC TTGGAAACGA GATAACCCAG TTTCTGGTCG TAGTTGTGGA GACGTATACC
AAGCTAAGAC GACAAGGCTA TTTAATTCGT GCAATTCTGC AAGTGGTCGG AGTCCTTACC
CATACAAAGG GAGGTGAGAA TGCTGTATCA CTTCCGTCTC TACTACAACA TACATCATTG
ATGACGGCTT TCGCCAACTC TGCTCAGTAC AGTCCGGTCT TTCAGGTGAA AGAAATATTT
GAAACCATCC AAAAGTTTAT CTTGGGTTTG AAAGAGAGTG ATAGTTCCTT GGAAAACACC
GTTCGTGCGT TGGATTCAGT CGTAGAGCTG ACAATCGTCA TGGTACAAAA TGTGAGAGTT
GACAGTGGGA CAGCTTCCGA GGTGGCATCA TTGTGTCAGG AGGTCGTCGA CACTGCGCTT
TTAAGACTCA TCAGTCCTTC ACTGGGTTCT CTCACTGGTG CAGGGCTACG GCTTTGTGGG
TGGTTTATTG AATTACACGC AAGGTGTGCC TTTTGGTTGG GCCCGGAAAC CCAACTGGAG
ATTCCACCTT CAGTCATGGA GATTCTATCA ACTGCATCGC AATACGCCAA AGGAGGAGAA
GATTTGGCGG GCTACGAGCA AATACTGGAC GAGCTCCTCT TTTTGGCATT GCATCGACTA
CGCCAACTAC ATTCGCTGAT TCATGAGCAA GAGCGCATAA ATTTGGGGGC TCTGAGGTCG
ATAGAAGGAA ACAATGTTTT CACCATAGAA GCTAGCAAGT TAGCGTTTTT TGCTGGATTT
GTTGCGAAAC AGAATGAGGA CTCTCCCTTG TCAGGTTCTC GATGGAGTAA AGTTGCTCGC
GCTTTCGCTT CATGGTCGCC ATACGCTGAA GACGAAGACG TCCGACTCTT TTTGGAATGG
ATGATTGGAG TTTTGGCTAC TGACGAAGCA ACGACAGAAT GTCATCTCGA TTCTTGTAAG
ATTCCTCAAA GCAATGCAGC CGTACGAGAA AATCTACAAA CAGCAAGAGC ACTTCTGTAC
GATTCCTCGT TTTTGGAGGA TGCTCGCGTA GCTTCCAAGT TTGCCCTTGC CGCCCTCTCC
TGTACATCGA TTTCGGTAAC GTCGGCAATC GAAACGCTCG GAAGTATCCG CTTCCACGAA
ACGAAGTCAT CGGGTACTTC TCCTTTACCC CTTGGAATCA ACAGCAAGGA CAAATCTGGT
CAGCTTGATG CGCATCTCGT GACAGAAATG CCGCTTTTCG ACTCGACTAC TGCATTGTCG
AAGTCAAGTC GGAAGACAAT GCTGGCATGT TTGCAAAAGA CTGGAAGGCC TTTGATGTTT
GTTAACAGCT TAGCCTCACT GTATTGCCAC CTAGAGGATC CCATGGATTT CGTCGACTCG
TTGTTGAGGT TGGATCAAGT CTGTAGATCC ATGGCGATGT TGAGTTCCGA TATATCTCAA
AAAGCTCTTC ATCTAGTGAA AGTGTTAAGG TTTGCTGTGG CCAGCACCCT TACTCGCATC
GACGCGGGGT CTCTTTTTCA CGTGTTGGGT AATTCTCAAG AAATCACGGA GGTCTTGAAG
ACAATAGTGC TGTCAGTAAA GCATTTGTGT CTTGAGACCT CCCTTTCAAC GTCGGAAGTG
ATGGCAGAAA TTTTGACGGC ATCTTCTGCA CTGACGGAGC AGCTCGTCCG TGTATCCGGA
GTTTTCGAGG AGCAGATAAA ACAAAGATTT GAGCAACTCA TCAGAACAGC ATTCTGCATC
GATAGTTCAA TCAAGTCAGA ACGGGTTCAT GAGATTGGTG TAGTTGCGTG CCTAGGACGT
TCCATCCTGA AAGGCATGAA ATCAAATCAA CTCTTTCATG AGGAGACGTC GGATTCAACT
GCTTTCAGAA ACATTCGCGG TTTGTTGTTT CCATTGGTTG CAGAACTCTG CTTAGGAAGC
GAAGATAGTA CATGTAGCCG TCATGGATAT CTCTTGTTTG GAGATCTCAT TCGATTCACC
ACTGATTCGG AAAACTGTCC CTTTGCCGAA ACAAGAAGGG ATATTGAATC TGTTTGCATA
GCCTCGCTTT GCAACGCGTC TCTCTCTAAA GATGCCTTAC ATTGCAGGCG GTACGTAGTG
GCATGTCTTG TAGAGACAAG ACCATCTCCT GCAGTTGCCC GGCAAATTCT CGACCAGATC
CTTAATTGCC AGGTTTCGTT TCCGCTGTTG GATACTAGTT TTTGTCAGTT GGTCGGCAGT
CTTCATGAGC AAGCTCTGGA GGAAATGTTT GAACGTCTTG TATCTGACCA GCAATTGTTG
GTAAGATCTA GGCCACTGAA TCTTCGTCTT TCACGATACG TTGTGCAATG TGTGAAAGAA
ACGGACCAAA TCGAAGTGGC ATCGAAGTAC GGCCAAACTC TATTTCGAGT TGCCGTGGAT
TCAATTTACG GTTTTACTCC CGGTGCTTCA TGGCAACATG ATTTTGAGGA GTCAGTAAAC
TTGGTTGTAG AATTGATTTT GAGACGAGAC GTATTTGCCT GTCGGGAGCT GGACTTGGCT
CATTTGTTGT GTCGGCTGAC GGATGTGCTC CGTCCAAATG GAAAAGTGAA GTATGTGACG
GATCATATAT TTGCCTCTTG CGGACGAATA ATCATGACAA TCTTTCTGCG CTACTCCAAA
CAAGTGTACA GTTGCGTTCC GTCGATGATA CAGGTCCTGC GCTCGTTACA ACGACATGTT
TTGTACAGAA CCGGCGAGAA TGGTCTGGAT ATTGCGGACC GGGCGCAGCG GCTGACACGG
TTGTACGAGC AAGTGTACGC GCACCGAGAC GTTTTTAAAA AGCACGTGCT GGGCTTGCTA
CTTGACTTTG TCTACTGCCT CCAACAGGAC ACCAATCCGA CTGTCAAGGA AAGTATGACA
CCTGCCGTAT ACTGTCTTTT GGATACACTC TCCAAATACG AAACGAAGCA ACTGAAGGGT
CTCATGGATT TGAAGGCGAA GGCAGTGTTC AAGGCCGTCT ATCAAGGCTA TCATGTGCAT
CATGCATACA AAGGGCAATA A
 
Protein sequence
MPKRRRSKSA EGTASTKSAT STRASPKMSL DGTPPSQTRA PTPARSNPFT DLFVESENVV 
NDGILDSTLV FLEKGILTEP GFRTAATELM DYVEACQDDN TSTTPTTVTS CSTLTTPQVS
LVLLPWAIRN ILGTEVSTDE LVWRALSVCL QLVAATSHQN DLQRDDQWES ILTKSTLFKL
VPRIAVFCVR NECILETDDT SEIEKARAFG CQAYETILAS NLYAPTLDVA CHKVWIPLVE
CLTRNDQGSS SAKRTTCALQ ATVRFLRKLQ CSGKGNPKTI FGLVATREVL VAASRTYTLL
DTNPDGRTTL RDFLYQSLFD VSQHMDGFRS LLSKGTIPVQ EDNSMDTGSP LSERDKPTLF
FRCYQDSLLK TITESIVMED VDVIQTVPVL LRGFLHESMA WEVRERENKR QKGSLNSEIS
NTVLFQMFVF LTVPLRKLLV SSKVPETISP ATQALRECLE SLFEQDAYLP SQDVDGKQLL
YLGIITNELS SLSATQMVQP ADNALAANCI HSFRTLMQLN HNLLHEQISS ITVLLFQFGG
DYRLGNEITQ FLVVVVETYT KLRRQGYLIR AILQVVGVLT HTKGGENAVS LPSLLQHTSL
MTAFANSAQY SPVFQVKEIF ETIQKFILGL KESDSSLENT VRALDSVVEL TIVMVQNVRV
DSGTASEVAS LCQEVVDTAL LRLISPSLGS LTGAGLRLCG WFIELHARCA FWLGPETQLE
IPPSVMEILS TASQYAKGGE DLAGYEQILD ELLFLALHRL RQLHSLIHEQ ERINLGALRS
IEGNNVFTIE ASKLAFFAGF VAKQNEDSPL SGSRWSKVAR AFASWSPYAE DEDVRLFLEW
MIGVLATDEA TTECHLDSCK IPQSNAAVRE NLQTARALLY DSSFLEDARV ASKFALAALS
CTSISVTSAI ETLGSIRFHE TKSSGTSPLP LGINSKDKSG QLDAHLVTEM PLFDSTTALS
KSSRKTMLAC LQKTGRPLMF VNSLASLYCH LEDPMDFVDS LLRLDQVCRS MAMLSSDISQ
KALHLVKVLR FAVASTLTRI DAGSLFHVLG NSQEITEVLK TIVLSVKHLC LETSLSTSEV
MAEILTASSA LTEQLVRVSG VFEEQIKQRF EQLIRTAFCI DSSIKSERVH EIGVVACLGR
SILKGMKSNQ LFHEETSDST AFRNIRGLLF PLVAELCLGS EDSTCSRHGY LLFGDLIRFT
TDSENCPFAE TRRDIESVCI ASLCNASLSK DALHCRRYVV ACLVETRPSP AVARQILDQI
LNCQVSFPLL DTSFCQLVGS LHEQALEEMF ERLVSDQQLL VRSRPLNLRL SRYVVQCVKE
TDQIEVASKY GQTLFRVAVD SIYGFTPGAS WQHDFEESVN LVVELILRRD VFACRELDLA
HLLCRLTDVL RPNGKVKYVT DHIFASCGRI IMTIFLRYSK QVYSCVPSMI QVLRSLQRHV
LYRTGENGLD IADRAQRLTR LYEQVYAHRD VFKKHVLGLL LDFVYCLQQD TNPTVKESMT
PAVYCLLDTL SKYETKQLKG LMDLKAKAVF KAVYQGYHVH HAYKGQ