Gene PHATRDRAFT_46518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46518 
Symbol 
ID7201673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp583907 
End bp587152 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180861 
Protein GI219120236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.42154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGG GTGAAAGCCT TAAACTTTGT GTTGCTGATA CTATGAAAAC TGACTATCGA 
CAAACTCTCG ATCTCGAGGC GAGCGGAAAT CAGGTTGCCT TAGCCGGGAA GTCTATCGGT
GCCAGTTTCG AGACGTTCAT CATGGATGAT ATAAATGGAA ATTTACCCAG TTATGATTGC
CGGCTCCTTA ACACTAATCG TGTCTACGAC AACGACGGTG AGGATCTACT GGGTCAAGGT
TTTTTCATCC TCGAGGTCAT TGTGGAACAA AGGGTATATG TAACGGCTAC CGACACCTTG
ACAATAGAGC AAGTAACAAG TCTTGTCGGA TTGGGTTTCA ATCGAGGTGA TGGCGGTCGA
GTCAAGTTTC GGTCTTTGCT ACAACAGTAC GAAGCGTTCG AAAGCATCTT GAATGTTGAA
GTATTAGAAA GTGTATCGTC ACCACAGGCC GCCCCTTCGT CCATACCGAG TACTTTTCCA
TCCGGCAAGC TCTCCTTTGC TCCATCTCCA GCACCAACTA TAACGCCATC AGCTGTACCT
ACGGATCAAC CGTCGCATCG TCCCTCGATA TTCTCGACTG CTATGACTAC GAACTCACCA
TCTTTCCCGC CATTCAATGT ACCAATGTCA TCAACACCTT TGAAAGCTCC AACACTACTG
CCTGCGGCTT TGCCCACAAC TTTGGCCCCA AGTTCTCCGA GCTCGTCCAA GTCCAACAAT
GATGGTTTAC CTTTAATTAT CGGCACAACT ATTGGGGGGG TGTTGACTAT GCTGGTGTGC
TGTTTTTTCG TCTTTTGTGT ATGGTTTCCC TATTGGCGTG AGGGGAACAC CGGCGACGGC
AATGGGCCGA ATCGGCAATT TGAAACGAGC AGTCAAATGT CATCGGGACG TGACACTATT
GTCCCGGGTG TGGTTCAGTT GGATGATGCG TCCTTGGCTA ATACTACTTT GGGAGATGAG
ACAACAGATG GTGGCTTTCG CAACAATTCT GCCGAGAGCA AGAGACCGCA ATATTCGTTT
CTCAAACCAG CCCCAATCGC ATCAATGGAC AGTTTTGACG AGAGCTCTCT GTACACTTCA
CCCGGACCAC CCGCTAGCAA CGCAAACCAA AGCAGTTTAC GCATAAGTTC CATAGCCATG
GCTGCGGTAA AGCCCCCGGT TATGTCCAAG ATTGACACAT CCATGGAACT GAACCCCAGT
CTGAGCCTTG CCCTTCACTT TGAAGACGAC ATCGTCTTTC CCTTGTCTGA GAGTAAAACG
GATTGGAGCA CCGAAAAGCG AGCTACAGTG CACAAGGATT CCATGCATTC GGTGCTCACG
GGCGGCGGAG TCGTGGATCT AGATGAGGTT TATTTCTTTG ATGACGACGA GTCGAAGGAG
GAAGGCTTTG AGCGAGTGCC CGGTATACCT GGCAAACTGG CAATTACGTC TTCTAAGAGT
GACGAAGAAA AAGCTCGCAA AGTGTCTCAA TCAATGGAAG AAGGTACTAG AGGTTTCGAT
CCATTTGACG AAGAGAAGTC TGGGTCGTCG TCGTCGTTCA CTTTTGACAA CGGAGTGGAG
ACGGCTGAGA TCTTCAAAGA AGACTCCTCG TCCGATACGG AAGAAGACGA GAGTTTGCCC
GTGACTAACC CACACCAGTC CAAACGAGTG ATGCCCCTGC TGTCAGGGAG TACTCCCGAG
GCCAACGATA CCCAAAGAGG CATTCAGCCT TGTGATGTGC ATAGACGAAA GTTGATCAAC
AACATTGGAG AAGGCGCTCT GGATACAATA GAACTAAACA GGAATGGGCA CGATAGCAAA
GGCGACGATA CCCAAAGAGG CATTCAGCCT TGTGATGTGC ATAGACGAAA GTTGATCAAC
AACATTGGAG AAGGCGCTCT GGATACAATA GAACTAAACA GGAATGGGCA CGATAGCAAA
GGCGACTTAA AAGAGAAGCA GAGAGAAGCA CGCCTCTCAA GGTCAGCAAG AAAGAACAAT
TCGTTGCTCC GGAATGTTCT AGAGGATGCC CGTCGTTTAG CTGAGGCTGC AACTTCTAAC
AATCGTTCCA GAGCTTCTCG AAAGACGGCA CCACCGCGAA TTGTCGACAA AATCAAACGC
AACTCGCACG ATAGCCAGCC TTTTGATTTA CTAGCTGATA CATTAGACGT GAAGGAGTCT
CTTTCTCTTG CTTCGGCAAA ACGCCCCGCG CATTCCACGA AAGGGAAGGT TGTCTCAAAG
ATGACTGCTG CGCGGTCGAG AGGAAGCGGA GATTTGACAA GCGACAATGA TCATGCTAGA
ATTAAGGCCT TTGCAAGTAC ACATCTCTTC CGCAGTCGTC TTCTGGGCAA AAGAGAGGCA
GGGACTGGTC AGCATAGTTT GCCTTCATCA ACCACTTCGA ATTCTTCATT GCGGCTAACT
GTAAATGTTG AAGCTGAAAT CACTGATGCG ACGTCTGCTT TCGATGCCAG CTCGGTCCTA
ACCTCTGCAA ATCCAAGAGA TTCTCCACAA GGCAAGTATG CTGGTCAATG CCCGGAAAAC
CAGAGTGTAT TGTCACCCCC CAACAACATA GAAATTTTAT ACCCAAACGG AAAAATAACC
GTACAGGACG ATTTGTCATG CACACGGGAA GGCGCTCCAA AACCTTTGTT AACTGATGCG
GTGGATAAAG CTCCTGAAGC TTGTTCTACA ACTGATATTA AGTCGACAAG TGTATGGTCG
ATCGCGCAAT CATCTCAACC ACAACGGTCT CGAAATTCAC GGTGTGGGTC AATATCAAGT
ACGGGACGAC AGCTAGAAAG ATCGCCGGTA TCTACCCAGT CACGGCGGCA GAATCGGCGC
CTTTTGCGTG ATGAGATTGG GAGCTCCAGT CGCACCTTTT CTTCCGCCCC AGAGAGGTCC
CAAGGAGAAG AAAAGAGTTT GGGTGACGAC ACCCTGCCTC TGGCATTTGA ACAGGATTTG
GAAAGGCTTA AGCTGCAGCT CGTAGATATC GTGCGCACCG ATGCATTCAA GGTTGGTCCC
TCTTCAATAA CAGCGTCTAA GACAAATCGA TCCATTGCCT TCGTCAGGAA GAATAAGAAA
GACCAAATTG TTGTCATCGT CCCCCCAGGG AAGGTTGGAG TGGTTCTCGC AAATCGGTAC
GATGGAAAGG GAACGATGGT GTCGGAAGTT CGGCCTTCTT CAGCTGTTCA TGGGGCCATT
TTCCCCGGAG ATGAAATTGG TACGTTAGTG ACTGTGCTTA CGATTGTTGT CGCCAACATT
GCTTGA
 
Protein sequence
MQLGESLKLC VADTMKTDYR QTLDLEASGN QVALAGKSIG ASFETFIMDD INGNLPSYDC 
RLLNTNRVYD NDGEDLLGQG FFILEVIVEQ RVYVTATDTL TIEQVTSLVG LGFNRGDGGR
VKFRSLLQQY EAFESILNVE VLESVSSPQA APSSIPSTFP SGKLSFAPSP APTITPSAVP
TDQPSHRPSI FSTAMTTNSP SFPPFNVPMS STPLKAPTLL PAALPTTLAP SSPSSSKSNN
DGLPLIIGTT IGGVLTMLVC CFFVFCVWFP YWREGNTGDG NGPNRQFETS SQMSSGRDTI
VPGVVQLDDA SLANTTLGDE TTDGGFRNNS AESKRPQYSF LKPAPIASMD SFDESSLYTS
PGPPASNANQ SSLRISSIAM AAVKPPVMSK IDTSMELNPS LSLALHFEDD IVFPLSESKT
DWSTEKRATV HKDSMHSVLT GGGVVDLDEV YFFDDDESKE EGFERVPGIP GKLAITSSKS
DEEKARKVSQ SMEEGTRGFD PFDEEKSGSS SSFTFDNGVE TAEIFKEDSS SDTEEDESLP
VTNPHQSKRV MPLLSGSTPE ANDTQRGIQP CDVHRRKLIN NIGEGALDTI ELNRNGHDSK
GDDTQRGIQP CDVHRRKLIN NIGEGALDTI ELNRNGHDSK GDLKEKQREA RLSRSARKNN
SLLRNVLEDA RRLAEAATSN NRSRASRKTA PPRIVDKIKR NSHDSQPFDL LADTLDVKES
LSLASAKRPA HSTKGKVVSK MTAARSRGSG DLTSDNDHAR IKAFASTHLF RSRLLGKREA
GTGQHSLPSS TTSNSSLRLT VNVEAEITDA TSAFDASSVL TSANPRDSPQ GKYAGQCPEN
QSVLSPPNNI EILYPNGKIT VQDDLSCTRE GAPKPLLTDA VDKAPEACST TDIKSTSVWS
IAQSSQPQRS RNSRCGSISS TGRQLERSPV STQSRRQNRR LLRDEIGSSS RTFSSAPERS
QGEEKSLGDD TLPLAFEQDL ERLKLQLVDI VRTDAFKVGP SSITASKTNR SIAFVRKNKK
DQIVVIVPPG KVGVVLANRY DGKGTMVSEV RPSSAVHGAI FPGDEIGTLV TVLTIVVANI
A