Gene PHATRDRAFT_50515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50515 
Symbol 
ID7199239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp282089 
End bp285425 
Gene Length3337 bp 
Protein Length1010 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185410 
Protein GI219130517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGAAC AGTCTCGGGT ACCGACGAGC TCCGAGAGAC GCAGGAAAAG GGCTTCGTCA 
AAGAGTCTCC GTTCCAATCG GAGACAGGCT TTCCCGGATC GCGTACGCTC CGAGAGGAGC
TGCCGCGATC TCTACAACAG CGACTGCGAC TGCCTTTTCC GCGCTAACGT GAGATCCGTC
CGACGTAACG TGCGCTCGCC AAGAGTAATC GCCGGAAAAC AACGACAAAT ACGGAGGACA
CTGTTGTTGG TGTATTGTCT CTGGCTCTCT GCCGACTGTC GTGCGTTTGT TGCTGGTCCA
CCAGGAGCAA ATCATCCTCA TCCACATCAG CGTTTACATC CGCATCCGTA TTTTCAGCAA
ATCCAGTATC TTTGCTCACG TTCAGAGACG ACGAGACGCG ATGCTAAAAG TAACAACCGT
CCCTTTACAG ATACGGACTC GTCGGCAAAC CCCGACAATT CGAGAAGACC GCGTCCTCAA
TACTCCAACG CTCAACACCG CCCCAAACGG AGGCAAAACA TCAAGTCGCA TCATCGGTAC
AACAACCAGG CGACGGTCAT TTTCAATCAA CAGCTGCTCG ATATATGTCG TAAGAAGAAG
GAAGCTCCCG CCAGTCGGAT TAAGGTCGCT GTACATGCGC AACAACTGTT GGAGGAACAA
ATGAATGCAG GCCTTCACTA CGATGTGGTT AGTTTTAATA TTGTTATGCA AGCCTGGGCC
CGACAGCACT CCTGGACGGC CGCGCAAAAC GCCGAAAATC TCTTGTCCAG GCTACTTTGC
CATTCTACGC TCCAGGCGGA TTCTTATTCG TACGCCGCCG TGCTACACGC CTACGCCAAG
TCCGGTGGTC AAGTTCCTGC CGCCCAACGG GCTCAGGCCT TACTTTCCCA ACTGTTGACC
CAGAGCACTG CCGGTGCGGT CACTTTGACC ACGGACATTT GTCACAACGC CGTGATGGAC
GCGTGGGCAG TCTCCGGACA TCCCCAAGCC GGCGAAAAGG CCGTTCGTCT GCTACAGCAA
CTCGAACAGC ATCCCGTGAT CCAACCCACT CGCATATCCT ACAATGCCTG TATTAAAGCC
TACGCCAAGA GCGGTCAAGC ACCACAGGCA CAAGCATTAC TGGAACGGAT GCGGACCCTA
TCGGCGCTTC GAGATCCACC GAACCGCTAT ACTCATTTGG CACCCGACAA GGTTTCCTTG
ACAACCTGCA TTGATGCCTG GGCTAAAGCT CCACCCAGCG CTACCGCGGC AGCTAGTGCT
GAAGCGTTGC TCTCCGATAT GGAAGAATGC TACCAACGCA CTGGCGATAC GTCGATCCGT
CCCGACATCG TGTCCTACAC TTCCGTCCTA GCGGCGGCCG CACGCAACGG GATGCCAAGC
CACATGGCAC TGGAATTACT CTCACGCATG GAGCGCTGCG CTCGCGAGAG GCCCAACGCG
GCCTTTTTAA ACACTTGGAT TCACTTGTTG GCCAAAACAA ACACCGGCGT CGAATCCCTA
CAGGCCGCCG AAGACATTTT AGCATATATG AAACGAGCCT ATCGAAACGG GAATGATCTC
GTCCAACCTT GTAAGATCAC GTACACGGCG GTCATCGCCG TATTGGCACA AACAGGGACG
ATTCCGGCGG CCCAACGTGC GTCCGAATTG CTGGATGAAT TACAAGGCTT ATGGGAGGAG
GCGCATTTCG ATAAGGCGTA CTTGCCAACC GCCAAGACTT TTGCAAGCGT ATTGCGCGCA
TGGGCAACCA GCAACGCACC AGACTCGTGG ACTAGAGCGA AATCCTTATT GGATCGGATG
GATCACTTGT ATGCAGCTAC CAATTCAGAC GAGCTGAAAC CTAGCTCGAT TGTTTTCGCT
CAGTTATTTC AGATCTTGTC GCGAAGTCGA GACTCCCAAA CTGCAGCCAG ACAGGCACGG
GACCTGTTGC AGCGCATGAG TGAATTGCAA AAATCCGGTA ACCATCAGGA CGTACAACCA
GATGCTTCTA CAATGGCGTA TTTTTTGAAC ACATTAACTA AGGCGGGCGT AGACAATGTC
GTCGAATTGG CAACCGTAGT GCTCAATGAA GTTGAAGATG GTTACGCGGC AGGGATGGGC
CATTTGAAAC CAACGAGTCT ACTTTATTCG GCAGTATTGC AGGCGTACGC GAAGAGCGCT
TCGAAGGAGG GTGCCGAGCT AGCCGAGGCG TTGTTGCACC GAACTCGAGA ACTGTACCGG
CAGGGAACAT TGTACGCGAA GCCCAACGTG TTGTTCTACA ATGCAGTGAT TGATGCGCAT
GCACGATCCA ATGGTGGGAG TGCTGCAGCA GAACGCGCCG AACTTTTGCT TGACGAAATG
GAAACGCGGT CCCGAGCAGG AGACTTGTCG CTCCGGCCAA CTACGCGAAG TTTCAATGCT
GCTATTTTCG CTTGGAAAAA GAGCAGTGCC GCTGAAGCAC CGCAGCGTGC CGAAGCTTTA
CTCAAACGCA TGAATAAGAG ATATGAAGCT GGCGATGAAC GCTGTCGGCC GGATCGTATC
ACATTAAATT CGATTATTGG TGTATGGGCC AAAAGCAGAC AAGAAGGAGC GGCAAAACAG
GCGGAAGAAT ATTTAGGATT TATGGAGAGT TTGTACGAGG GCGGCGATGA AACGTTTAAA
GCGGACTTGT ACAGTTTCAA CTCTTGCATC GATGCGTTTG CCCGACAAGG TGATGTGAAA
AGAGCTACTG CGTTGTTTGA TCGTATAAAG AGTAGATATG AAGACGGAGA CACGTCGTTG
AAACCGAATA CCATTACCCT TACTTCTCTG AGAAATGCAT GGAGCAACAG TCAAGATAGT
GAAGACAAAC AAAGGGAGCT CAATAGGATC GACAAACTAC TACTGGCACA GCAAGGAGAC
GGTTGGCGAA GGAGATCACA AGCTGTCAAC AAGGCTTCCG CAGTGGTCGA CTTGGATTCC
TCCGCTAGAG CCGTGAGATC CAACGAACTG GGCTTGGAAG CACTGTCGAT GTTCGTATCA
CTACGCCGTA AGCACACCGG CAATCCTTCG TAGAAAGACA ATGTCTTTCT GAAATTGAAA
TTCAATTCTT CACTGAGAAC GTATAGGGAG TTTCTTCCTC TCCCGCACAA ATTTTGCTCG
GTGTTGCGTC GGCCAACTGT CGGGTGGCTA CTAAACTTTC GGAATCAAAA AGAATCGCAT
CCTACTCACA TCTTTGATGC CTTGACTGAT GAGTAATCTC ACCAAAACAG CGCACCCTTC
CAATACATCT TTTTCAATTA TATCGACGGC GTAGATTTTT TGATTGTCTT ACTAGTGTAA
AAGTAAACTT GTTGATTATA AATACTGTCG AAACGAC
 
Protein sequence
MEEQSRVPTS SERRRKRASS KSLRSNRRQA FPDRVRSERS CRDLYNSDCD CLFRANVRSV 
RRNVRSPRVI AGKQRQIRRT LLLVYCLWLS ADCRAFVAGP PGANHPHPHQ RLHPHPYFQQ
IQYLCSRSET TRRDAKSNNR PFTDTDSSAN PDNSRRPRPQ YSNAQHRPKR RQNIKSHHRY
NNQATVIFNQ QLLDICRKKK EAPASRIKVA VHAQQLLEEQ MNAGLHYDVV SFNIVMQAWA
RQHSWTAAQN AENLLSRLLC HSTLQADSYS YAAVLHAYAK SGGQVPAAQR AQALLSQLLT
QSTAGAVTLT TDICHNAVMD AWAVSGHPQA GEKAVRLLQQ LEQHPVIQPT RISYNACIKA
YAKSGQAPQA QALLERMRTL SALRDPPNRY THLAPDKVSL TTCIDAWAKA PPSATAAASA
EALLSDMEEC YQRTGDTSIR PDIVSYTSVL AAAARNGMPS HMALELLSRM ERCARERPNA
AFLNTWIHLL AKTNTGVESL QAAEDILAYM KRAYRNGNDL VQPCKITYTA VIAVLAQTGT
IPAAQRASEL LDELQGLWEE AHFDKAYLPT AKTFASVLRA WATSNAPDSW TRAKSLLDRM
DHLYAATNSD ELKPSSIVFA QLFQILSRSR DSQTAARQAR DLLQRMSELQ KSGNHQDVQP
DASTMAYFLN TLTKAGVDNV VELATVVLNE VEDGYAAGMG HLKPTSLLYS AVLQAYAKSA
SKEGAELAEA LLHRTRELYR QGTLYAKPNV LFYNAVIDAH ARSNGGSAAA ERAELLLDEM
ETRSRAGDLS LRPTTRSFNA AIFAWKKSSA AEAPQRAEAL LKRMNKRYEA GDERCRPDRI
TLNSIIGVWA KSRQEGAAKQ AEEYLGFMES LYEGGDETFK ADLYSFNSCI DAFARQGDVK
RATALFDRIK SRYEDGDTSL KPNTITLTSL RNAWSNSQDS EDKQRELNRI DKLLLAQQGD
GWRRRSQAVN KASAVVDLDS SARAVRSNEL GLEALSMFVS LRRKHTGNPS