Gene PHATRDRAFT_42940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42940 
Symbol 
ID7196773 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1578932 
End bp1583368 
Gene Length4437 bp 
Protein Length1337 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177324 
Protein GI219111145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.544378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAGTCGGC TGATTAAAGT GAATGGGCTG CTCTGTTTCT GAAAGCTAGA TCGTACAAGT 
AGTGCTACTC ATATAGCGAG GTAGTATTGC GTGTTTCATT ATCTTGCCTG ACTAGAAAAC
ATGCCACCGG AGGCAATCCC AGCGCCTGTC GCTTCGCAAA ATTGGAAAAA CATGAACGAA
TCCGACAAAG AACAGATTGA ATGGGGGTAC GTTTGGATGG AACACTGCCG CTTATGCTTC
GTAGATGGAG AAAGTGGAAA TACTGGCGGA AATGTGGCGC TCAAAGAGTG GATTGACGTT
GATGTACTGT ATGATGATGA CTATGATGAA ATGAAATTGT CACACGCCAA TCGTACAGGG
CGTCAGCGAC CTCTTGAAGA TCTTCCAGAA GTCAAAGGTG ACATTGACGA TTCACTGTCA
GCAAAATTTT GTTCCAGATG CAATTTGGTT CTGTATCAGT CGGGACCCAT GGGGAGCAAT
TTGAAGGTAC GACTGAATGA AATACCTCCA TGTCAAGGTA CCACTAAGGA CTTCCTGGGC
ACGGACGAGT GCTCTTGTAT AGTGTCAAGG TCTGACGCGT GCTTGATAAT GGGTGTAATG
AGAACATGTG CTAGTGTAGC GCGTGTCCAG CATTGCTCAA TATTGCGGGA AAGTGATAGT
CAGGAACCGA CTTTTACCTT GGTCATATCG GTAGCCTTTC CGCATTTACT ACATGGACCA
GGTGGCCGTC GAAACATTAT GGCTCTGAAA GATTACAGAC GGGCTTACAA ACCCCTTCAC
ACGACAACAC AACTGATACT ATCGATACTC CGCTCCGACT GGAAAAACTT GATATCGATG
ATGGAAAAAA TGCGACTCGC CCCCACAATC AAAGTAGAAA ATCGTCGAGT ACCACGGTTC
TTTCCCAACA GGCTTGGTCT GGACGAGTTA TACGAACGAA TTCAGGGTAC TGCCGCAACC
CACATGATGC TCGAAACTCG TCGAGAAGAT CAGCTGCGAA ATCAACGTAA CGACTGTTTG
ATAGTCCAGT TGCCAAAGGA TGTATTCACT GATTACCTTG CCCCGTTTTT GAAGCCCCGT
TCGTTAGATG CCCTTCGTTG CTCTTGCTCA TACCTACACA ACACATTACG AGCGGTCGTC
CCAGGTCTAA AATTACGATT ATACAGCCAC CAAGTGTCTT CTCTCATTTG GATGCGAAAT
CGTGAGACCA ATTTACTTTC GGAGAAGGAT TGCCTTAGTA CCACACCCTT GCCGGACTCT
CCTGATAGAG ACTTGCACCG AATGGTCACT GGTGGATACA CGACATTGTT ACGATCACGT
TGCCCTTTGA GTGATTTAAG CATCCGTATT GATCAAAAGA CAGGCTTCGA AATTGTCCCG
AAGGACCTTA CCACTTTACC CAGAACTGTG GCGAGAGGTG GTCTCTTATG TGACGACCCA
GGACTTGGGA AAACCGTTAC AGTTTTATCA CTGATTATGC AAACAGCGGG ACTCTCGACG
AAAGCGAATG ACACTGTAGA TACATCCGTT AGTAAGCAGT TGATGCCGAT GGAGGAAGCA
ATATTCCGAA CGTATTGGAG CGAACAAATG ACCCCTGAAT TTCGTCGTCC ACTCTTGCAC
CGCTTTCTGA ATGCTTTCAT ACGGCAAAGC CCCGGAGGTT TCATTGGTCA AGCTGATCAA
GTCAAGAGAA ACATCGATCT AGACAAGTAC GATTTGAATT TTTCATTGTT TGAAAGAGAT
ATGACGTAAG CTTTGTCGGA CCGCTTTTGA ATTCGAAAGC TGTGAGCTGG ATCTTACTGC
TTGTTTCCTC TACTTTCTCA GGGAAGCCAT TTTCCCACAA ACTTGGGAAG GTAAACATGA
AGATAACGAT AGCGATACTC ATGTATACCG AAACGCGGCA CGCCAGCTTA AAAGCGAGTT
TCTGGCTAAT GTTGAGGAAT TTAAGCAAAC TCAACTTCAG TCAGCGCGCA AATCCTTCTC
TTCAGTTTCG GCAAAGCCAA ATTCACGAGT CGCGGCTTTG CTGGATCGGT CGGAACAAAT
GAAATTTGTG GATTCGCTGG TCTCGTCCTC AGCAACACTG TTGGTAGTCC CGGCAGTGTT
GTTGGACCAT TGGCAGGTAA GGTTGTTTGG GCACAGCATT TCTTTTTCAC AAGGCTGAAG
TCTCATGTTT CATGTTTTGA CTCCCGTTAC ATCAGGCTCA AATCGATTTG AACTTGGACC
TTAGCTATTG CACTGATAAG ATACCATTAA TTTTTGAGTT TGGGAGAAAG CACAAAGGTT
TAACAATGGA AGCCGTATGC GCCATTTGCA AAGACAATGG AAGTCACTTT CCCATGGTTT
TTATAGATAG AGGTGGTACA AAGAAGTTGC CGGCGCCGGA GTTTCTTGCA ATGTTTCAAA
TTGTGATAAC AACGACACAG CGCTTTTCGC AGGAATGGAG GAATGGTTCT TTTCAAGCGG
AACTCAAGAG CAGCGGTTGC AAGGAAGTAT CAAAGTTGTA CCTTGATTCA GCTTTTGATC
GATCTGAGAG TGCGTGTCCG CTACTAAAGA TCCACTGGCT TCGAATGATC GTTGACGAAG
GACATTCAAT GGCGAAGAAC CAGAACTCGA CGATTCAGTT TGCATCGTGG ATTTCTGCTG
AAAGAAGATG GGCGATGACC GGGACTCCAA CAAAACAATC CGCGACCCAG ATTCAGCAGA
TCTACGCTAT GCTCCGCTTC CTTGGTCATG GTTTTTTCAC TCCTAGACTC GATGGAAACG
CGGTTTGGAC GTCAAACGTT GCCCGGTGTT GGAAAGAGGG ATCGTTCGCA GCGTTTTTCC
GACTCAGATC GTTGTTGGGT TTGCTTATGA AACGTCATAC AAAACGTGAT ATTGCAGAGC
TGGAGTTGCC GTGTTGCTCA GCAGAGGTGA TTCCGATGAG TTTTGTTGAA GTAACGACTT
ACAATACGTT GGTATGTGGT GTCCAATCAA ACATTTTGTT AACGTCAATG AGCGGGAAGA
CGTCTGGACT GCAGGATTCT CTTCTTCACC GCTCTCAGGT TCAGCATGCT CGTGCTGCAC
TTAGTAACCT ACGTCGTGTT TGTGTCGGCT ACTCCAGAGT ATTACCGACG CTTGAGACAA
GGTTTTACAT TGAGACTATG GTCCTGCTGA AAGAGCATGG AAGGGATGAC AGACAAATTC
AAAATGTGAA GGAATACCTC CACCGCGCCG AAGCTCAAGA GCTCTCAGAA TGCGATTGCT
GCCAGATCAA GCTCAGCACG CTTCTCCTCT TTCCATGCTG TGGCGGATTC TTGTGCCCAG
AATGCATGGA CGAGAAATCA AATATCTGCG TCCTTTGTGA CCAAGATTTC GACGTGGACG
AATTTCAACG ATTGCAGCCA GGTTTCTCGC TAAAGTGGCT CGAAACGATG ATTGAAAGTG
AACAACGCAA ACCGAAACCA TCTATTTCCA ATGCCAACGT GCAAGTCGAT CCTCCAGGTG
GCGCGGATCC AATCGTGCGT GCCGAGATTG AACTCCCAAA CGGAGTTTTG GTTCGGCCGA
ATATGGAGCT CCGTAGAACT CGAAGATTAG GGGATGGTCA CGAATGTCAA TATGATCGGT
ATACTGTCCA TGGAAAATGC ATTCTGTGCT TGTCCGAACA CAGCTTCTGT AACCTATTTA
ATGACAATGC ACAATGTGCG ATCTGCTTCC GGGCAGCTGA GGAGTGCTCG GAGGAGGAAT
CAAAATCCTT TTATCTTGTG AAGAAGCTCT CTGAATTACA TCAACAATTA CGCAACAACG
AGCAGCGACG TCCTCTGAAA ATTATCGTCT TTTCGCAATT TCGCCAAGCT TTGAACATGG
CTGGGAACCG TCTTTTGCGT AAGTTTGGAA CTGCATGCAT TGCTGAATAT TGGGGCAGCT
TTCGCACGAC TGAACTGCGG AAATTTACGT ACGACCGAGA TTGTTTCTGC ATGCTTCTAG
GAAGAGACGG CAGCGAAGGA TTGGACTTGA GCTTTGTCAC GCATATATTC TTTCTCGAGG
AAATCATGGA CCAGTCGCTT CGTGACCAAG CAATTGCTCG AGCCTGGCGG ATGGGCGCAA
AAGGACGTGT GCGGGTGGTA ACTTTGACCG CGGCAAAAAC AGTAGAGGAG ACCATGCAAG
AGATTGAGTC AGCAGCTCAA TACCGTTTCC AGTACTCGCA TCAAACTGTC ACACGCCCTA
TTGTTGCAGC AGCCGAGTCA AACAATTTAG AGGAGTACGC CACGGCAAAG ACACATGCAT
TGCTTCGTTC CTTACGACTC ATTACTGACT ACCACCATTT TTCGGCGGAG CCGCGAACAT
CGACCGCAGA AAAAGCCACC TGTTTGAATA ACAAGCTACC CGGGATTTCG AAAGAAAGCG
ACAAAACTGT TGACAACCCT CCAGTTAAGA GACGAAAAGT AACTTTCACG TAGCGAT
 
Protein sequence
MPPEAIPAPV ASQNWKNMNE SDKEQIEWGY VWMEHCRLCF VDGESGNTGG NVALKEWIDV 
DVLYDDDYDE MKLSHANRTG RQRPLEDLPE VKGDIDDSLS AKFCSRCNLV LYQSGPMGSN
LKHCSILRES DSQEPTFTLV ISVAFPHLLH GPGGRRNIMA LKDYRRAYKP LHTTTQLILS
ILRSDWKNLI SMMEKMRLAP TIKVENRRVP RFFPNRLGLD ELYERIQGTA ATHMMLETRR
EDQLRNQRND CLIVQLPKDV FTDYLAPFLK PRSLDALRCS CSYLHNTLRA VVPGLKLRLY
SHQVSSLIWM RNRETNLLSE KDCLSTTPLP DSPDRDLHRM VTGGYTTLLR SRCPLSDLSI
RIDQKTGFEI VPKDLTTLPR TVARGGLLCD DPGLGKTVTV LSLIMQTAGL STKANDTVDT
SVSKQLMPME EAIFRTYWSE QMTPEFRRPL LHRFLNAFIR QSPGGFIGQA DQVKRNIDLD
KYDLNFSLFE RDMTEAIFPQ TWEGKHEDND SDTHVYRNAA RQLKSEFLAN VEEFKQTQLQ
SARKSFSSVS AKPNSRVAAL LDRSEQMKFV DSLVSSSATL LVVPAVLLDH WQAQIDLNLD
LSYCTDKIPL IFEFGRKHKG LTMEAVCAIC KDNGSHFPMV FIDRGGTKKL PAPEFLAMFQ
IVITTTQRFS QEWRNGSFQA ELKSSGCKEV SKLYLDSAFD RSESACPLLK IHWLRMIVDE
GHSMAKNQNS TIQFASWISA ERRWAMTGTP TKQSATQIQQ IYAMLRFLGH GFFTPRLDGN
AVWTSNVARC WKEGSFAAFF RLRSLLGLLM KRHTKRDIAE LELPCCSAEV IPMSFVEVTT
YNTLVCGVQS NILLTSMSGK TSGLQDSLLH RSQVQHARAA LSNLRRVCVG YSRVLPTLET
RFYIETMVLL KEHGRDDRQI QNVKEYLHRA EAQELSECDC CQIKLSTLLL FPCCGGFLCP
ECMDEKSNIC VLCDQDFDVD EFQRLQPGFS LKWLETMIES EQRKPKPSIS NANVQVDPPG
GADPIVRAEI ELPNGVLVRP NMELRRTRRL GDGHECQYDR YTVHGKCILC LSEHSFCNLF
NDNAQCAICF RAAEECSEEE SKSFYLVKKL SELHQQLRNN EQRRPLKIIV FSQFRQALNM
AGNRLLRKFG TACIAEYWGS FRTTELRKFT YDRDCFCMLL GRDGSEGLDL SFVTHIFFLE
EIMDQSLRDQ AIARAWRMGA KGRVRVVTLT AAKTVEETMQ EIESAAQYRF QYSHQTVTRP
IVAAAESNNL EEYATAKTHA LLRSLRLITD YHHFSAEPRT STAEKATCLN NKLPGISKES
DKTVDNPPVK RRKVTFT