Gene PHATRDRAFT_37268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37268 
Symbol 
ID7201931 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp558455 
End bp564344 
Gene Length5890 bp 
Protein Length1865 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181227 
Protein GI219121758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.645144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGCC GGCGCACGCG GCAGAAAGGT ATCGGTAGGA TGCCGGGGGT GAGTGTGAGT 
GTACCCAGCG AACGGTACAT CCTCAGTTCG AAGCGCTGTG GGCATTGGGG CAAGGAGTCG
GATCTGTGAA CAAGCTTGTG CGCCGTACTG TAAATGACAG TGAGTCGTTC AAAGTTGTAG
CGTCCCAAGT TGATTACAAA CGTTACGTAC GGCACCGCAA GGCAACGGCG ACCCCGTTGA
GACCCATCGT TCCATCATGT TGGGGAAACT CGCTACGGTA AGTGGACGTC GCGGTAGTTA
CGCTCGGTAG TGGATACGGT AGTCGAGGCT TATCCGTCGT CTCACACACG TCCCTCTTTC
TCTTACTTAT CCCTTACTTG GATATTCTTC TATGTATCCC ACTACATATT TGTTCCCCCG
TTCAGCAATT GGCCCAGCAG CGTCTAATGT TTGTGGAAAA ACCCGCGACG GACGACGCGT
CGCGACGACC CCGAAGCCAC CGTGACACGC GGTCTCTGGG CAAACACGCG GCCGTCAAAC
AGCGCATGAA GGCGCAACAA CAAAAGCAAG GACAGACACC ATCACCGTCC CAAAAGCGTT
TTCGAACGGT GCTTAGCATT GAAACGGCCG ACGCGACCGC GTCGCCCTCC TTGGCGGAAC
CCGAGCCGCA CACGGAACCC TTGCGGGTCG CCGTGACCGT GGACGAAGAC GACAATCAAG
GAGCGGCTTT GCCGACGTCT ACCCCCGTTT TGGGTAATAG TGCACCCTCC GACTTGTTCC
GGGACGTGGA AGCGGCGCTG CACACCGACT GCGATATTCA CGTCTCGCTC GACGACGAAC
CAGAGGTCGC ATCGCAGACT CGGGATGGAT CCTACTGGGC CATGTTTGGG GATACTCCGG
AACGGTGGAC CGGTATTGAT ATGACCGAAG AAGTCGCCAT GAGTCTACTA GACACGTCCC
TGATTAGTGT CTCGACAAAA GAGCCGGAGG AGGACGACTA CGACGGCAAA GATAGTCACG
GTGACGACGC CCCAGCCACG GTCAACAAGA CAATTACCGC TGCCACGGAC ACGACGCCTT
TGCATTTTTG GAAAACCTGG CGATCGCAGC CAGTGACTCC CGCATTGGTC CGGATTCAGT
CGCCGTCGTT CACTGCTGAA GGGGCTACGC CCGACGACCC CGAACGGCAT TTCCGCAACG
TAGACAGCGA ACAACCTCTG TACGCTGCCG ATCGGGTAGC CTTTACCACC TTTCGGCAAA
ATTCCACACA GATTGCCTTG CAGGAAAGCA AGACCTATCT CGATCGGATC GCACAATTGG
AAGAGGCCTT GGCCAACGCC GAGTACGAAC GGGATTGTGA AACGCAGCGG CGCATAGAAG
CGGAATCTAC CGTGTCGGTG CTGCAGTCGC CGTCGTCGTC GGTTCCCAAC CACCCGGCAC
CAACCCCCAA GGCAATGCTC CATCCTCCGG TCCCATCCAC TCCGTCATCC CTCCCGGATG
TGGAGTCTCT CTGGGAACGC AACAAAACAC TCGTCAAGGA AGTACGCTTT GCCGATCAAA
CGTGTATGGA ATTGTCTTCC GAGAAGACGG CCTTGGAACA GAGGATCGAG CACTTGAGCG
AGAACATTAC ACGGTTGGAT CAAGAAAACA CATTGTTGCG CGAGCAACTC GGACGGGCCG
AAGAGACGGT ACGGCAGGTC AAGACCGACA CCGAAACCGA CAATGCCGCC TCTCCCTCAG
ACAAGGATCG TACACGACGG GCCTTGGAAG AGTTCCCTCG TCCAATCGAT CGTAAGGAAG
GAATGGGCCC GTCCCTTGGC GAGCACATCG CGAATATGGC TTCCCGAGAA CCGTCCACGA
TGGAGAACCC GTCCGATCGG TGTCACACTG CCCAGTCGGA GAATCCATCG CTACGGGAAC
GGTTGGAGGT CACCTTGGAA GAGCGAACGG CGGCGCTTCG CCAATGCGAA AGTTTATCAG
CCCGTGTCTC GGAATTGGAG GCATTGGAAG GCCGTCACCA CAGAGATTTG GAAACGACAG
CGGCCCGGTA CGATGATCTC TTCCGACAGC TTGAAGAGGC TCACCAGAAG TTGACTGTGG
CCAACCAAAC GATAGTCGAG GGTGACGAGT GTCGGGAATT GCTGCAACGC CGATTGACGA
GTGACGCCCA ACGGACTGAC GAACTAGAGC AGTCGCGCTT GTTATTGCAA AAAGCCAACG
ACGACGTGGC TCTTTGGAGG ACCAAGTTTG CAGACGGCGA GGGGGAAATT CGCTGTTTAG
AGGGTCGCAT TCGGGAGCTC ACCTCCAGCG GTCACGACAG CTCAAGGGCT GCCGGCCTGC
TTCGAAAGGA ATTGGGAAAG GTTGCGGAGG AAAAACGCGA GCTGTTGGTC GAGCTGGAGG
CGGCCAACGA GAAACTCGCT GTATGTGAGA CCAAATTGGA ACAGTTCGAA AAGGATGGGC
AGGAACTCGA GGCAACGCGA AGTGAGCTTG ATGCCCTCCG TAACGAAGAG ACGTTACGAA
ACGAGCAACA CACTGAACGG GAACGTCAAT TCTCATCAAA ACTCGCCGAT ACCCTCTCGC
GGATCGAGAC TTTAGAGCTA GACGCTGCAT CTGCGATTAA ACAAAAAGAC GAACAATCGC
AAGAGCTCGA TGCGGCACTT GGGCAACTCG ATTGTTTAAG AAGAGAAATT GACGATCTAA
AGGAAGCACG CTTGAGTGAT GAAAACTTGT TCCAGCAGAA GCTGAGTCGC GCTAGCTGTG
ATTTGGAAGT AGCCCACACA GATATTGCCG TTGCCCGAGA CGCATGGCGC CAGAAGGAAA
GCGAACTGTT ATGGGCGGTA CAGAAGGTAG TTGCCCAGAC CGAGAGCCAG GCCACTATCT
TGGAAGAGAG ACTTGTCAAC CGTCACGGAG AACTCGTAGC TCGAATGGGT CAGGCAGTGG
AGGCAGTCTC GTACGTGAGA GAATCAATCA TTTTTGGAGA CAGTACTACT GCTGTCGATA
CTGTGAGTGC TGCCCATGGG CTGGCGGAAT CGGCACCCGC GACACCACTT CCTAGTGACG
GTGCTTTAGA TGCAACATTC GATAATCCCG GAGGCGCATC GACAGATCTA GAGTTAATGG
AAGAAGCTAG GCCTCATGGT TTCGCCGAGT CGTATGGCGT CCACGAGACC GAATTCGATT
TTACCGTGTC ACTACCGCTA GACACCGACA CGATGTCTAG CATTGCCGGT ATTTCTCATT
TGTTTTCTTT ATCGCCGGTT TCTATCGATC AATCCCCCAT GGAGAATACC CACAATACCA
CAACTTTACT GCGCCCGCGA GGCAAGGAAC TCCAGAGCCC GAGGCGACGG ATCAACGAAT
TGATTTGCCG TACCTCGGAA CTGGAAGAAC AGAAAGAGAT TGCACTGCAG GACGTCATCG
TCTTACAGCA ACGGGTGGAA GAACTCGAAT CGGAGCTCCA TGCAATGGCC AAGGATCTCT
CACTAGCATG CAAAGAGAGA AACGCCGTAG CAAAAATCAG CAATTTTCAC AATGGCTGCA
TAGAAGCAAC CCTGAAAGAA AACGACACAA ACCTTTGTCA AGTAGAGAAG ACCGACTCTG
ATACCACTGC ATCGGCAGTT TCAATCCATG TCAACGAGGT CATAGAAAGC ATTTTCTCTC
GCAAAGGAGA GACCTTTATC TCCGAGTCTG AAGTCGCCGA CAAGGTTTCC CGAAGCGAAA
GAGACATCGA GGGACTTGGA TTGGACCGAA GCATTTTGGA TCAGCGCCTA ACGACAACCA
ATGACAAGAA AGGAGAGATA GCGGTTGCAT TGGTGGCTGA CAACGAACAG ATCGACGATA
ATACCTCTGA GATTGTGTCG TTGAGATTGC ATTGTGGATC TCTTTCGAGT GATACGGACA
AAGTGGTGGT GCTTCGGAGA TTGGAAGACT TGGAAGCCAA ACGTGACAGT ATGCGGGATT
CATTTTCCAG CATCCAAGAT TTACCTAAAA GGAACGAGAG GAAATTTTTT GAATCCAGGT
TTGAGTCAAT GACGAATGAT TGTGCGAAGG CTAAGGAGAG AATCGAAGAG CTTGAAGCAC
TGCTAGGGGA AAAGACAAAG CAATGCCAAG AGCTCGAGGC TCACATTGCT ACAATTCAAG
AATCGTATGC GGATGCCAAA GTAGAAAAGC AAATCATTCA AGAGTCGATG GAAATAAGTC
TGTCTCAAAA GATAGCCGAG TCGCAGCAGG CAGCCCAGAT ACGACTATCT GCATCGGAAG
AACAGCTTGC CATCGCTCGA GATGAAATCG CTTTGCTTCA TTCGCTTCAC AAATCGGTCA
CATCAGAGAA AAAGGAAAGT GCCACTAAGA TTGCAAGTCT GCAGGAGGCT TCCAGCAAGC
AACTGACCGA ATTGAAGAAG GCGAATGAAT CTTTAGACAT AATTGGCCAA GAACGAGATG
ACTTGTTGAA GAGCTTGTCG AAGCTTCAGT TCGAAAATGC CGACGCCAAC AGCGAAATGG
AAAAACACCA GGAAGCCCTC TTCGAGACTC AAGAAAGCCT TACCGAGTGC CAGCTGAAAT
TATCCGTCCT AGATGCCATC ACCACCGAAA GAGACTCCTT GCTCAACAAA ATTACTGAGC
TTGAGCTGGA ATGCACTGGG CTGAAAACTA CTATGACTGA AAGTTCGGCT ACCGAACGCA
AGTCTTTGGA ACACACGCTC TCGTCACTAG AAGAAATTCA GGTCGAAAGA AAATCTCTTC
TGGAAAATTT GGATTCCGCG AAAATCGAAA ATGGACTCGT CAGGAAACAG CTAGATTCTA
TGAAGGCTGA GCTTGAAGAC GACAAGCGAA CGAGCGCTAA ACTTCGCGAA TTGTACGAGC
AGCAAGTTGA AGCCAACGCT GAATTGCGAA GGGAAGTCTT CCACTGCAAA GGTCTCGTGG
ACGATTCTGA TTTTGCAATG CAGGATCTCA AAGAGAAATA CCTCGAGTGC AATGAGAAGC
TTGCGAATTT TTGTTTTTTA CACAATTCTA ATGAAGAATC CAAGCTGATG TATGATCGTG
CTCGTGAATC CGCTGAAGGG CTTGCAATCG AAAGCCAAAG GCATCTCGCA AAAGCCCGGG
AAGACCTGGA AACAGCACTT TCCGAAAATG GCAAAATGCA AGCAGAGTAT GAAAAATCCC
AAGAGGTCCT CTCCGATGTC CGACGAGAGC TAGCAGAGCG AAAGGAAGCT ATCAAGGATT
TAGAAGTGTC GCGAGAAGCA GCTATTATGG GGCTTGCTGA ATACAAAGAA CAACTCAATT
CGCTGGAAAT TTCTCTCGAT AAACGGACGC AGACTGTGAA TCAGCTCCAA GCCGACGTTA
AGGAACGTGA CGACGCTCTT TCGAAGTTGG ACCAACATAA TGCTGAAATC AACGTGTGGG
AGAAACGCAT ACAGGAGTCA AACGATGCTC TTTCCAGACT GCAGGATCAG CTTGACGAAA
GCACGGCATC CCTGCGTACT ATGACTAACG AATTCAAAAT GGCATCGACG CGAAGCGATC
ATTTAGAACT AAAGTGTTCC CGTCTTCGAG ACTACATTCG GAAGGTGACT GGCAAGTGTG
ACCAGTGGGA AGACTTTTAC GATCGACAGG CTGAGGTTGT GGAGGGCCTG AAGCGTGCCA
ATGAGCGGAC TCGTCAAAAG ACTGCTGAGC TCGCTCGTCG GTACCAGGAA CGAGACCAAA
TTCATGACAA AGAGCGTGCC GTTTGGACAG CGCAGAAGTG CAATCTGGAC TTTATACATT
CGCAGCTAGA AGAAGAGTTG CATGGGATCG CCAACGAGCT AGCGCATGTC GAAAGCCGGC
CAGTTTCGAG TTAATGCTTA ACTGTGAAGT CAAATGATTA CCTAATCTCC AGCCCAGTTC
CGCTGTTTAA
 
Protein sequence
MSRRRTRQKG IGRMPGVSFE ALWALGQGVG SVNKLVRRTV NDSNGDPVET HRSIMLGKLA 
TQLAQQRLMF VEKPATDDAS RRPRSHRDTR SLGKHAAVKQ RMKAQQQKQG QTPSPSQKRF
RTVLSIETAD ATASPSLAEP EPHTEPLRVA VTVDEDDNQG AALPTSTPVL GNSAPSDLFR
DVEAALHTDC DIHVSLDDEP EVASQTRDGS YWAMFGDTPE RWTGIDMTEE VAMSLLDTSL
ISVSTKEPEE DDYDGKDSHG DDAPATVNKT ITAATDTTPL HFWKTWRSQP VTPALVRIQS
PSFTAEGATP DDPERHFRNV DSEQPLYAAD RVAFTTFRQN STQIALQESK TYLDRIAQLE
EALANAEYER DCETQRRIEA ESTVSVLQSP SSSVPNHPAP TPKAMLHPPV PSTPSSLPDV
ESLWERNKTL VKEVRFADQT CMELSSEKTA LEQRIEHLSE NITRLDQENT LLREQLGRAE
ETVRQVKTDT ETDNAASPSD KDRTRRALEE FPRPIDRKEG MGPSLGEHIA NMASREPSTM
ENPSDRCHTA QSENPSLRER LEVTLEERTA ALRQCESLSA RVSELEALEG RHHRDLETTA
ARYDDLFRQL EEAHQKLTVA NQTIVEGDEC RELLQRRLTS DAQRTDELEQ SRLLLQKAND
DVALWRTKFA DGEGEIRCLE GRIRELTSSG HDSSRAAGLL RKELGKVAEE KRELLVELEA
ANEKLAVCET KLEQFEKDGQ ELEATRSELD ALRNEETLRN EQHTERERQF SSKLADTLSR
IETLELDAAS AIKQKDEQSQ ELDAALGQLD CLRREIDDLK EARLSDENLF QQKLSRASCD
LEVAHTDIAV ARDAWRQKES ELLWAVQKVV AQTESQATIL EERLVNRHGE LVARMGQAVE
AVSYVRESII FGDSTTAVDT VSAAHGLAES APATPLPSDG ALDATFDNPG GASTDLELME
EARPHGFAES YGVHETEFDF TVSLPLDTDT MSSIAGISHL FSLSPVSIDQ SPMENTHNTT
TLLRPRGKEL QSPRRRINEL ICRTSELEEQ KEIALQDVIV LQQRVEELES ELHAMAKDLS
LACKERNAVA KISNFHNGCI EATLKENDTN LCQVEKTDSD TTASAVSIHV NEVIESIFSR
KGETFISESE VADKVSRSER DIEGLGLDRS ILDQRLTTTN DKKGEIAVAL VADNEQIDDN
TSEIVSLRLH CGSLSSDTDK VVVLRRLEDL EAKRDSMRDS FSSIQDLPKR NERKFFESRF
ESMTNDCAKA KERIEELEAL LGEKTKQCQE LEAHIATIQE SYADAKVEKQ IIQESMEISL
SQKIAESQQA AQIRLSASEE QLAIARDEIA LLHSLHKSVT SEKKESATKI ASLQEASSKQ
LTELKKANES LDIIGQERDD LLKSLSKLQF ENADANSEME KHQEALFETQ ESLTECQLKL
SVLDAITTER DSLLNKITEL ELECTGLKTT MTESSATERK SLEHTLSSLE EIQVERKSLL
ENLDSAKIEN GLVRKQLDSM KAELEDDKRT SAKLRELYEQ QVEANAELRR EVFHCKGLVD
DSDFAMQDLK EKYLECNEKL ANFCFLHNSN EESKLMYDRA RESAEGLAIE SQRHLAKARE
DLETALSENG KMQAEYEKSQ EVLSDVRREL AERKEAIKDL EVSREAAIMG LAEYKEQLNS
LEISLDKRTQ TVNQLQADVK ERDDALSKLD QHNAEINVWE KRIQESNDAL SRLQDQLDES
TASLRTMTNE FKMASTRSDH LELKCSRLRD YIRKVTGKCD QWEDFYDRQA EVVEGLKRAN
ERTRQKTAEL ARRYQERDQI HDKERAVWTA QKCNLDFIHS QLEEELHGIA NELAHVESRP
PSSAV