Gene PHATRDRAFT_49250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49250 
Symbol 
ID7195545 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp372110 
End bp376106 
Gene Length3997 bp 
Protein Length1179 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183980 
Protein GI219127517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0121535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAAAGGAAC GACCTATCGC TCAAGCTGTG TATTGTGGCA GAAACTGTTG TTGAGTAGCA 
GGTCGTAATC TATGCACTCA CACCTATCGG ATGAAAGACC AAAGCTATAG CTTTTCCCGG
CACACTGTAA GGGAACGTTC TGAGGTTTGT TGGTGTGCTG GTTGCCCCTA AAATGCAAGT
TTACGAATTC GAGGTCGGGC CGGGCGCTCG GATTGTCATT CATTCTGAGA TTGGAGGTGT
TTTGGCTTCA AGTCCACCAT TTTATGATCA AGAGACTGAT ATGATCACCA TTGATTTAGC
GAATTTTACC GGAAGACTGA TGATGCACAG TAGGGAGAGG CGCCCATCTC CCAGCATTCT
ACACGAAAAG GCTGAGATTT CCTCCGGACT ACTTCCAAAT CGAAGCCAAA CTCCACCTAG
CGTCAGTAGT GATATCAGAG CTGCCCAGCA GGATGTCATT GTCCCTCCTG GAGCTAAATA
CGGGGACGAT GCTCATCTTG CTCACGACGA AATATCCACA CCACCTCAAA GACAGATACT
TCCTTCGGGA GCCAGAAACT GGAAGGATGC ACACCTTTCG CAGGACAGAA ATTCTACTCA
ACGACAAGTT TCCGAACCCG CTTCGACAGT CCCTTCAACT TCTTCTTGCT CGAAAGATAT
ATTTCCAATT GCTTCATCAA GTGCCACTCC CTATTCGCCC GTTTCAGAAA GAGAAAGAGC
CGTCTCACCC ATTGAAGCGC GTGATAGTGT TACTTCGTTA GAATCCGAGG AAGACAAAGT
GACTGACCAG AACATGTTCG GAAACGTGGA AAACGACGGG CAAGGAAACA ACAGTATGGA
CGGCCGAGAA GTGCAGCTAC CAAAAAAGGT TTCGGAACCA GGTTTTGTTG CTCCTATCGA
AAGCGATGAG GGCAAACAAA GGACGTCCTC AAACGTTTTC CAGCATGTCA AAGACAGCTT
TCTTGACAAG GCCAAAAAGA TCAATTTCGT TGGTGGGGCT GTGAAAGATA CAGTGTCCGA
CGATGCAAAC TGCATCGGTA CGGAAGAATC AACGTCGATT CTTGACGGGG AGCTGCCTGC
TCTCAAAAGG CTAGGTGCGA ATTGGGGACA CATGCTAATG GTCGCTTCGA TGCGTAACCG
CATGCCCAGC AGGGTCCGCC AGCGTCGCTT GCTAGAGAAT CATTTGACGC CGAGCGGATT
AGACATTGAG CCTCCCGAAA AGCCACCTAC AACCTTGCAC ACGCTTTGTG GGTCCCCTGT
TGTGACGTTG GCAGAATTGC ACGCTTGCTT AGAAGAGAAT TCCGGAGCAG CAGCAATCTT
GGATGTCAAT GGGCGATTGC CACTACATAT TCTAGGCGAC AATGATGAAT TGGTAGGCAC
TCTCCATGGA CGACAGATTG GTACAGCCTT CGGCTACGCT TTGATGAAAG CGTATCCAGA
AGGAGTAACC ACCACCGATA AGGTAGGGCA CATGCCATTC GTAAAGCTTG TGGATGACTG
GGTTTTGTCG ATTTACGAGA CCTACCGGAA GAGCAAGAGA TCAGACTCGA TACCCAACAC
GGCAAGAGGT ATAGTAGATC GCGTTAAAGG CTTGACTCCT CGAAATATGA TACACAAGGG
CACCGATTCA ACCCAGACGC AAGCGCAGTC GAAATCGAAG TCTGATCTTC CTAGCTCCAG
CAGTGCATTT CCTTCGGAAA TCAAGTCAGT GCATACTGAT CGCTCGGACT TGTTTTCCTT
GTCCTGTCGA GCATTTCCGG CGGTTGAATT GTGGGACGAG GTCGAGTGGG CATTTGAGAT
GCTCTCATTG GCTATGGATG AGCTTGGAGG CAAGAGCGGT GGCCTTCACC AAGAAAACAA
ACGCGCAACG GTGCACCATT CGATTAAAGA CCGAAGTGGC CGGCATTTGC TATCGACACA
TGTTGTTACT ATCATTCCGA CACTTTTGAA GACTGTGCTT TTACTAGAAG ATGACGGGAA
AAATACTCGC AAGCGATTGC TTGGAATGTC CATTTTTCGG AGGATGCTAA TATGTCCCGA
ATCCGTGGGA CCATGGTTGA CACAAATGTT ACGCAGAACG GGTGTCCCTG CAAGGCGCGC
TGTCGACTAC CTTGTTTTGA TGTCTGAATC GACCGTTGAA GACTACGTTG GAGGCTTCCG
TACTATCCAA TCTGGTGATC AAGACAACTT TGTTGACGAG AAGGGTTTAG CTTTCGGCAA
AGTCGATCAG CTCGAAGGGC TCATTGCCTC ACTTGTGACA ATGGATGCTA GAGAAACGGA
AAGGGCTGCT TCCACATCGG TGGTGTGGCA TGTGATGAGC AAAAATCTTG GACGCCCTTT
TGTTGTTGCG CTAGTGCTGA TTGATTTTTG TCTACACTTG GTTCTTCTTA TGGCTTTTAG
TAAGTTTTTG AGACAAATTA GTTTTGCAGA AAGAATACGC TAACTATATG CTGCACGCTC
CATCTCTTAC AGGAAACGAT GTGGAGTTTC AAGGAAGAGG AGAATCAACA GCTGTTGGTA
GGTGAATTTC CCCCAATTGC TGGTACAAAA AAATAGGATC TAACCCGAAG TTCTTTCTCA
CAGGAAACGT ACCTACACAA ATAGTGATTT TTATTTGTGT AAATTACTTC CTCCGCAAGG
CATGTGAAGC ACTGGCACTT TTGAATATTT CGACACAAGT GTTTCGCACA TACTTCTCCA
ACGTCTGGGT ATTCTTTGAT ATCAGTGCAA TTGTTCTAAC TCTAATTGCT ATCATTTGGA
ATGACCGGAA TCCTGGATCG TATCGTCAAG GACTAAATGC CTTTATTCTC GCGCTTCTCT
GGGTCAAAAT TCTCGGAATT TTAAAAGTCC TTAATCGCCA AATGTCTACG TTCATTCTCG
CTCTGATACA GATTCTGAAA GACATTCGTT ACTTCATGGT CGTTTTGATT GTTATTCTTT
TTATGTTTGG GGACATGATG CACATCGCTC TCAGTACTAA GGACAACGGT CAATTTTGTA
TTGTGAATGA AGAAGCGGGC ACATTAAGTG GACCCGCAGA AGATTTTTGC TCCTCAGAGC
AGTTCGTATA CTATCTCCGC ATGTACGGTT TGCTACTTGG CGAATTCGAA TTAGACGACT
ACAAGGAGAC AAACGCAATG ATTATCATCT TTGTTGCTTT CACGCTGCTC GGCGTTGTTG
TTCTCCTGAA TGTACTGATT GCAGTAATTT CGGATTCATA CGAAAAGGCC AAAATCAGCA
GCATGCTGCT GTTTGGCAGA GCTCGAGTGC AATTTGTGGC GCAAACATCG GCTTTAGAGT
CCTTCCTGCG ACCTGGGGCA GCACCATTGG TGGTCACAGG GTTTGGAGAA CGTTTTGAAA
CATTTGTAAA ATCAGGAAGC AGATTCGGAC GGTGGTTGAT CCTCTTGGCA ATTATCGGTA
CCGCTATGAA CTCGGAAATA TACCTCGTCA CTCGAGCTAT TGTAGTGGTC AGGGGCAACG
GTTTTAGCTT TGTTACTTTG TTCACATGTG AGTCCACATA AGGCTGTACC TGCTTTGTCG
AGTCTGTCTC ATTTCAATTT CACTGGCTTG TTTTTCAGTG GCATTGCTCT GTGTTGTTTT
GACACTAGCC CTTTGTGTTG TAATAGTGTT CACTTTCGAT AAAGCTTTGC GAAAAAGGTT
ACCTGCAGGG ATCACCCGAC AAACTGACAA GGTTGACACA TGGTCAACAT ACCTTATTGG
CTTAGTCGGA GGACGGCTTT TTGGTCTGTA TGACAAAACG AAGTCAGACA CTGACCATAA
CAATGAAGAG GCAGAAGAAT GGACAGGACG CATGACTTAT CTGGAGCATG CAATTGAGAA
GCAAATTAAA TCTGCTTCTG ATAACCTCAA AGATGAAATT AGAGGCGTCG AAAAGCGAAT
CTACGAGCGC AAATTTGCTG CCGAGGCAGG ACCTGCCGCG TAGAGAAATA ACGAGATTTA
GGTAGTGAAA AAGTGGTCGT CCACTTCTCT TTCCGCC
 
Protein sequence
MQVYEFEVGP GARIVIHSEI GGVLASSPPF YDQETDMITI DLANFTGRLM MHSRERRPSP 
SILHEKAEIS SGLLPNRSQT PPSVSSDIRA AQQDVIVPPG AKYGDDAHLA HDEISTPPQR
QILPSGARNW KDAHLSQDRN STQRQVSEPA STVPSTSSCS KDIFPIASSS ATPYSPVSER
ERAVSPIEAR DSVTSLESEE DKVTDQNMFG NVENDGQGNN SMDGREVQLP KKVSEPGFVA
PIESDEGKQR TSSNVFQHVK DSFLDKAKKI NFVGGAVKDT VSDDANCIGT EESTSILDGE
LPALKRLGAN WGHMLMVASM RNRMPSRVRQ RRLLENHLTP SGLDIEPPEK PPTTLHTLCG
SPVVTLAELH ACLEENSGAA AILDVNGRLP LHILGDNDEL VGTLHGRQIG TAFGYALMKA
YPEGVTTTDK VGHMPFVKLV DDWVLSIYET YRKSKRSDSI PNTARGIVDR VKGLTPRNMI
HKGTDSTQTQ AQSKSKSDLP SSSSAFPSEI KSVHTDRSDL FSLSCRAFPA VELWDEVEWA
FEMLSLAMDE LGGKSGGLHQ ENKRATVHHS IKDRSGRHLL STHVVTIIPT LLKTVLLLED
DGKNTRKRLL GMSIFRRMLI CPESVGPWLT QMLRRTGVPA RRAVDYLVLM SESTVEDYVG
GFRTIQSGDQ DNFVDEKGLA FGKVDQLEGL IASLVTMDAR ETERAASTSV VWHVMSKNLG
RPFVVALVLI DFCLHLVLLM AFRNDVEFQG RGESTAVVIF ICVNYFLRKA CEALALLNIS
TQVFRTYFSN VWVFFDISAI VLTLIAIIWN DRNPGSYRQG LNAFILALLW VKILGILKVL
NRQMSTFILA LIQILKDIRY FMVVLIVILF MFGDMMHIAL STKDNGQFCI VNEEAGTLSG
PAEDFCSSEQ FVYYLRMYGL LLGEFELDDY KETNAMIIIF VAFTLLGVVV LLNVLIAVIS
DSYEKAKISS MLLFGRARVQ FVAQTSALES FLRPGAAPLV VTGFGERFET FVKSGSRFGR
WLILLAIIGT AMNSEIYLVT RAIVVVRGNG FSFVTLFTLA LLCVVLTLAL CVVIVFTFDK
ALRKRLPAGI TRQTDKVDTW STYLIGLVGG RLFGLYDKTK SDTDHNNEEA EEWTGRMTYL
EHAIEKQIKS ASDNLKDEIR GVEKRIYERK FAAEAGPAA