Gene PHATRDRAFT_41388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41388 
Symbol 
ID7199299 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp16152 
End bp19796 
Gene Length3645 bp 
Protein Length1032 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185365 
Protein GI219130423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000381991 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG CTGATGCTCC CAACGCTCCC GAAGCTGGTC TTGCTCCGGC TGTTGCTGCC 
AATGCCCCGG GTATTGTTGG TGACGCCCAA GCTGCCGCTT ACGCTGCGTT TGCCGCTGCT
GAAGCTCAAA TGGCAAACGC TCTCGCTGTT CTCGCGGGGG AAGCACCCCC TGATGCTCAA
CCCTTTCCTC CACCTCCGGG AGTGGCCGAC ATTCCAGTTG GCCTCCCTCC GGCTGTGGTG
GATGCTCCTC TCATTGATCC GTATGACAGT CTTCTCATGA GGGCTGGCAT GAACTACGGA
ACTCGCAAGG CTTTTTCTAA GGAGGGCTAT GGTTTGATGT CTGACTTGGT TACTCTTAAC
CAGAAGCAAC TTGAATCCCT CATTGATATG ATGAACAAGA AGCATTGCGG TAAATCTTTT
CAAGGCGCAA TTCCGTTGGG CCTGAACCTT GTGGCTGAAG ACCTTGAGAT TGATATTGGC
CACAAAACCA AGACCACTCT CAAGGTTATT CTCCATTGGG CGGACCTCCA GAAAAGTCTT
GGATTAGACG TGAATGCCGA GGATTATACC AATGTTGTTG GTCAGCTTGC CCGCGAACGT
ATGGATGAAG AGGAGAAGAT TCTTGAGGCG GCCAAGAAGC TTACGCCTTC TAAGCCAACA
ACCCTCAAGG ATATGACCAA GTGGCGTTCC TTCTTTGAAA ACTGGAACTC ATACATGAGT
CAGTGTCGCG GTGCTGCGGC TATCCCTCTT TCGTACATTT ACCGTACCAA CAAGCAGCCC
AAGACCGCTT TGGTCGGAAC CTATGTGAAT ATGGATGCCT ATTTGGTTGC CCAGACAGTC
CTGTCTGGTT CCAACTTTGA GATCAATAAC CAACGGGTTT TTGACAAATT CAAGGAAGCA
ATCATTACAA CCGGACCTGG ATGGTCTTTC ATCAAGATGT ACAACCAAAG CAAGGATGGT
TGTGCTGCCA TTTTGAAATT AAAGGAACAG GCGGAAGGAA CATTAAACGA GTCAGTTCGC
CGTGATGATG CCATCAAGAT CCTGTCAACT ACAACATACA ATGGTCCGAG TTGTAACTCG
AATATTGATA TGCTGTTGCA GAAATTTCAG TATGCCATAT CGGAATTGGT CGAAATTGAC
GGAGTTGCGT TGCCGGATGG GCAGCTTGTG ACTTATTTGG TCCAGGCATT GAAGGACCCA
AGTCTGAGTT ATGTTCGTGA CACAATTCGC ACCAATGCCA CTTATCGGAA CAGTTTTCCG
GAAGCGCAGC TTTTTGTGAA GACTTTTGTG TCTTTGTCCA CAAGCAAATC CGAAAACACG
CCTCGACAGG TCAATGATGT GCAAACATCA GGTAGTGGGG CCTCCGGTGG GAGTAAGAAA
GGAGGTACCG GGAAAGGAGC CAGCAAGCCG ACTCCCTTCA AGGGTGCAGT CACGGCTCGC
AGTTATACTC CGGGAGAATG GAAAAGTTTG TCCAAGGACC AACAGGAAAA AGTGCGATCG
CTGCGTAATA AAAAGAAGCA AGGAGGGAAA CCCGAGGAAT CAGAGAGGAG TGTTGACAGT
GTAGCACGGG GTGAGCCTGT GGACACTAAG GAAGTCCATA CCAGCAGTGA AATGGAACCG
ACTTCAGATG CGGCTGGCCT GCAATTTGGC CGTGGTGCGT ATAAGAAATC GGTCGGATTC
ACTGCGGACA CCGCTTCTCC TTCTGAAAAC GGAACGAAGA AGCAGAAAAC GCATCATGAT
GCGTGAAACG CGGCACCCGA TGCCAGTGTT TCGGGGACTA AGCAATGCAT TTTACCAGAT
CGAGTGATAT TGAGCCTCAC CTCTACACGC AGCATTTGTG ATCTCAACGC ATGCACTCAT
CTTGGTGAGG GCCGCTGCAA GTTGGATTCA CATGCAGACA CATGCGTGGC TGGGGCAAAC
ACTGTCTTGA TTGGTGAATC GCAGAAGTCC GTAACTATGC GGCCTTTCTC TGGTGAATAT
TCTGCGCTGA AGAATATCCC CATTGGAACG GTTGCCACAG CTTACACAGT ACCAGAAGAC
GGGAGAGTGG TGCTTCTTAT TATTAATCAG GCCCTATTCT TTGGGGACAG ATTGAAAAAC
ACCCTATTGA CCCCCAACCA GATGCGAGAC TTTGGCATTG AAGTTGACAA TGCCCCTCGG
CAGTATGTCG CCAACTCCAA GCACTCTTTG TATGTTCCTG ACTCTCCACT TTGGATTCTG
CTGCAACTGC GCGGAATATT CTCGTTTTTG GAGTCGCGGA AGCCCACGCA ACAGGAACTC
GACGAGTGTG AGCATATCAT ACTCACCTCT GATGTGCCGT GGGAGCCTTG CTCCACTGAC
TTTGCACGTC GGGAAGAACA GGCCGTTAAG CGTGACCGGA GTGTATCATT GGTTGACACA
AGGGGACTTT CCACTGGCCA TGCAACCTTC TCAGCACACC AATATGGTAT CCGTACTATT
GCGGCCTTGC AGAGAGTACT TGAGACTTTT CGTTCCTTGA CAGAGGTTGA ATTGTGCGAG
ACAATCTAAC GGACCGCCTT ATTGCCTGTG TTAATGTTGC GTCGGATGAC TACTGTGGAG
ATGGGTTGGA TGGCAGGGCT GACTTGGATG TGTACCCATT CCGACAACTT CACCAGTGTT
GTCTCAAATA TGACATCAAG CGAAAGATGG TCAGCGTTGA CGCTTGAGGT TTTGTCGAGG
CGTTGGAATA TTGGCCTGGA ATCGGCCAAG CGGACTCTAC AAGCGATGAC GCAGAAAGGT
GTGCGGACTG TGATGCACCC CTTGACCCGA CGGTATCGTA CTCGCCAATC GCATTTACGA
TTTCCTACCA TTTGGACCAA GGTTTACACC GACACCATGT TTTTGTCTGT GGTCTCCATC
CGCCAGTACA AGTGTGCCCA GGTGTTTACA ACAAACACGG CCTATTTGCG CATTTACCCT
CTGCAAACCA AGGAGCAAGC TCCTGATGCA CTAGATACAT GACATTGGGG TAATGAGTGA
TCTAGTTTAT GACGGATCTA AAGAGCAGGG AGGTGGCAAA CATTGGAGAG AGATTGAGCA
GCGTCACCAC ATACATTGCC ATGTAACGGA GCCACACAGC CAGTGGCAGA ATCGATTCGT
GAAATTAAGA AGGCTGTTCG GCACCGACTG CAGGTCTCTC GGGCACCAAG GCGACTTTGG
TGTTTTTGTT GTGAATGGGT TTCGGCAATC CGTCAACTGA CCGCCCACAA CATTCCTGCA
TTAAACGGTT GTGTTGCTAC AGAGCTTTTG GAAGGGGACA CCCCTGATAT TTCTGAATAT
GCGCAATTTG ACTGGTATGA GCCTGTCTGG TTCATTGACC CAACTTCTGC TTTCCCTGAA
ATGAAGAAGA AATTGGGCCA ATGGGTCGGA GTTGCATCAG ATGTGGGACA GGCAATGACT
TTTTGGATCC TTCCAAAGTC ATGCATCCCA ATGGCACGGT CCTCTGTTGC TCGCGTCCTC
CCAGACGTAG GCGCTACTGA TGAATTTAAG GCTGACCTTG CTGAACTCAA TCTAGCCATT
GAAAAGAGAA TTGGAAACAG CAAAACTGCA GAAGAAGATC AGGTCATTGA CGGTCAACTC
GCAAATCTAG TTTCGGGTCC GACTGATGAT CTGTTTGAAG GGTAA
 
Protein sequence
MADADAPNAP EAGLAPAVAA NAPGIVGDAQ AAAYAAFAAA EAQMANALAV LAGEAPPDAQ 
PFPPPPGVAD IPVGLPPAVV DAPLIDPYDS LLMRAGMNYG TRKAFSKEGY GLMSDLVTLN
QKQLESLIDM MNKKHCGKSF QGAIPLGLNL VAEDLEIDIG HKTKTTLKVI LHWADLQKSL
GLDVNAEDYT NVVGQLARER MDEEEKILEA AKKLTPSKPT TLKDMTKWRS FFENWNSYMS
QCRGAAAIPL SYIYRTNKQP KTALVGTYVN MDAYLVAQTV LSGSNFEINN QRVFDKFKEA
IITTGPGWSF IKMYNQSKDG CAAILKLKEQ AEGTLNESVR RDDAIKILST TTYNGPSCNS
NIDMLLQKFQ YAISELVEID GVALPDGQLV TYLVQALKDP SLSYVRDTIR TNATYRNSFP
EAQLFVKTFV SLSTSKSENT PRQVNDVQTS GSGASGGSKK GGTGKGASKP TPFKGAVTAR
SYTPGEWKSL SKDQQEKVRS LRNKKKQGGK PEESERSVDS VARGEPVDTK EVHTSSEMEP
TSDAAGLQFG RDTCVAGANT VLIGESQKSV TMRPFSGEYS ALKNIPIGTV ATAYTVPEDG
RVVLLIINQA LFFGDRLKNT LLTPNQMRDF GIEVDNAPRQ YVANSKHSLY VPDSPLWILL
QLRGIFSFLE SRKPTQQELD ECEHIILTSD VPWEPCSTDF ARREEQAVKR DRSVSLVDTR
GLSTGHATFS AHQYGIRTIA ALQRMGWMAG LTWMCTHSDN FTSVVSNMTS SERWSALTLE
VLSRRWNIGL ESAKRTLQAM TQKGVRTVMH PLTRRYRTRQ SHLRFPTIWT KVYTDTMFLS
VVSIRQYKCA QPVAESIREI KKAVRHRLQV SRAPRRLWCF CCEWVSAIRQ LTAHNIPALN
GCVATELLEG DTPDISEYAQ FDWYEPVWFI DPTSAFPEMK KKLGQWVGVA SDVGQAMTFW
ILPKSCIPMA RSSVARVLPD VGATDEFKAD LAELNLAIEK RIGNSKTAEE DQVIDGQLAN
LVSGPTDDLF EG