Gene PHATRDRAFT_42841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42841 
Symbol 
ID7196438 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1286304 
End bp1289597 
Gene Length3294 bp 
Protein Length915 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177259 
Protein GI219111015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGAT TCAGTTTAAC AAATGTAAAC TCCGATGAAG AAGAAAGTAA GTCTAGGGAT 
TTCGAGTCTA CGGTACTAGC TTTAAATGGT GTGGACGTCG AACTCGAGTA CAGCAGCGAC
GACGACGATG AAGAAGACGA GCCTCCGCAT CCACTGGGGT TTCGGTCGAG CACTCGGATT
TCTCTCCATA GTCCGACAGA AACAATGGAG CAATACAAAC GTAGGTGGGG TTACTTCTAC
CCGATAGACG ATGCCCATGA CGAGAACTTC TTTGACACCC CAATAGATTC CAAAATCCAA
TCCTTTCCAC CGTTATCGAC CTATATGAAC CCCAACACAG TATCCACGAG AGCCCTTATT
GGCTACGCGA ATGAAAGGAA ACCAACAGAT GACTGTGTGG ATCAAGTAAT TCAGCTGCTG
CAGGCCGCCG CTTTGGTTGA CGAACGAGAC TTGACGACAC CGCTTTTGCT GGCTACCAAT
ACACATCTCG ATGCCACCAA TCGGTTAACG CAACAGAAAT TGATCGAAAT TCAGCATGAA
ATTGACGTAG AACGCAGGCG GATGGAGCGT GACCATTTGG AAGCAGCCGA AGCCCTTCAA
TTGATTCTGC ATCGAAACCA AGAAACGGCT GAATTGATAA GTGCGGAACA ACGTCGGCTC
GACAGCGTCG CACAAGAAGG AGAGGATATC CGAGCTCGAG CGGACAAAGA AAAACAAACA
GCGTTGAAAG ATGAAGAGCG ACAGAAAGAA AAGGACGCTC AGGAAAATGC CAAGCGAGAG
AATCTAGATT CTCAGAGGGC TGCGGAAAAA GAAGAGGCTG TAAGATCCTC GAAGTATGAA
TTTATCGCCA AAGCCAAGAA GCTTGTTGCG CAGCTTGTGC TAATTCGGGC TTCAGTAGAA
TCATTCGAAA AATCAAAAGC TGTTGGAAAG CGTCGATTAC AAATGAAGAA AATAGTCAAC
GGCAAGGTCA ACACTCTTTC TGAAAACACG CAAAAGATAA GAGAGGTCGC AAATAATGTA
TCACAAGCTA TTGAAAAAGC GCGCGACGAA GACAAACAAG CCAAGGAGCA GGGCGAGGAG
GGGAACAAAG GCTTCCTCCC AGAAATGGCT AGAGGAAAAC GATACTTTGT TGATTTGCTC
TCAAGCAAAG TCATTGTCCG GGTCCAAGCC GAAGGCTTCA ACGGGTGAGT TTAGCATCTA
TCAATGGACA CTTATTTGTG TGCATGCATA CACTCACATA TTTTTACTAG TCAACGCGGT
GATGGATTCC CTCTTGCAAA TATGTTGGCT CAGGTTTCCA CTGATCACAA AGAACTCGGT
CCAAATCTTG CTGCCCATAT ATACACCGTC TGCCCCACAG CTATACCGAG CTTGCCGGAT
CCTGCACCAG ATGCAAGCGA GGATGATCTG ATGCGAAGCC TTGGCATGTT GCAACACGCG
GATGGCAACT TTGAAAGCTT TGAGCGCTTT TTGGGCAGAA CTGAGGTAGG TGCTGCTGCG
TCAACAACAA TTTCTTCAAT TTTTGTCGAA GAAGTGATCG AGATGCTGTA TGCACATGCC
TAACATATTT CTCATAACAG GGCATAATTT CAATGGTGGC AAACATCATG TCTTCAAGTC
CTGCAAATCA TACGCTGCTT GGCGGCCATG AAGGGGCAGT CAAGTGGATG ACGCGATTTC
TTTCCTTGCT ACCAAGCAGT ACCGACACAG CCCTCCCATT GATCGTTGCA CCTGTTCTGG
ATGCATTTCT TACAGGTGCA GGTCACATGC TCGCAAATAT CCACGCTGAA GAATTCAAGC
TCCTATTGAA AGCTATCGAC GAGAACGTCT TGCCTAGATT GGATGACGGG CCCACTGGGA
AGCCTTCCGC TATGCGGTTA GAGAAGACTA TGAGTGGAGG ATACGAAAAG TTTCAAAGTA
CACTTCCATC TCGTGCTTTG GCGGAGTTTT ACAATGGATC AAGCTTCTTG CGAAGTCACG
GTAGCACATC AACACCCTCA CCTTTCGGTC ATTCGGTGTT TGCCGGAAAT GCGGGGACTA
CGGGCGTCCC CCCTTTCGGC CAAAGCTCAA CGAATGTAAG TTCTGGCCCT AGTTTTGGGG
GACCTAGCAT TACTGCGAAG AAACAATCAC CTTTTGGTCA GAGCTTGGTA GAAGCGGGAA
CAAGCTCGGA TGCATTTGGA AAGCCACCTT TGAATGCATC CCCTTTTACT GGTAGTGGGA
CAACATCTCA TACTACATCA GGTATCTCAA AAATGGACCA GTCTGATTCA TTCCAGCCAA
AACAGACGCA AGCATTGCCG TTTGGCGTAG CTTCAAACAC TTCAACTTTT GGACAGGTTG
CTTCTTCGTC GCCAGCTCTA CTTCAAAAGT CATCCCCCTT TGGAAACACT TTTCCAAGTC
CGTCCCCGTT TGGAACTGCT GCCTCCGTCC CCTTTGGAAA GCCGAACGCA GTCCCTTTAT
CCGTGAGTGC ACCCAATCCA AGTACTTCTC CGTTTGGAAA TTCTTTACAA ACCGTTGTGC
CCTTTGGTGG CCAAAACATG AGCCTTCCTT CGTCTGGTTC TTCTATTCCA AACACTTTTC
CATTCGGAAA TCCCGCACAA GCCGCTTCGC CGTTTGGAAA GCCAACCCCA ATGCTTTCGT
CATATGGTAT TTCCCATCCA AATCCTTCGC CATTTAAAAA TCCATCAAGT GGCTCGTCTC
CATTTGGCTC GGTTGTTGCT AGCTCATCTC CGTTTGGAGG GACCAGCAAG GTAATTTCCA
CATTCAATAG TGGCTCGTCT TCTCCGTTTG GCAGCCAAAC GGGTTCTTCG TCTCCTTTTC
CATCAGTTGG AGATTTTCCA CCAAATTCTA ATGGAAGACC CAGCCACACA ACCAAAACGC
AGCCTTGTAA ATTTTTTGCC GAAGGTCGCT GCCGCTTCGG AGACAATTGC AGGTTCTCGC
ACGAACCTCC TAACAGTACC AGAATTCCAT CATCTAGTCC CTTTATGAAC ACAACATTTC
GTCAATGACC AACATCCGCA CACAAATCTG GGTTGATAGC AATGCTTTAT CCTCGGTCTA
TCGATAAGAT TTGGATCGCA AGTGCCGTGG GGGAGCCCCA GTTCGCTTTG GGCAGTCTCT
GATGAAGGGC TTGGAGACAG TAAGTTTTCA TTAGATGGTA TGAGACATGA GAAGCTGTTC
GAATTGTGAA CACGTTTAAA ATCGTCTAGC ACTTCGAATT AAAAGACAAG GCTGTTCTTA
CTAATGGTAG CATCTTCGAG CTAAAAGATA CAATTAAGAT TATCGACTAT CTAG
 
Protein sequence
MARFSLTNVN SDEEETLNGV DVELEYSSDD DDEEDEPPHP LGFRSSTRIS LHSPTETMEQ 
YKRRWGYFYP IDDAHDENFF DTPIDSKIQS FPPLSTYMNP NTVSTRALIG YANERKPTDD
CVDQVIQLLQ AAALVDERDL TTPLLLATNT HLDATNRLTQ QKLIEIQHEI DVERRRMERD
HLEAAEALQL ILHRNQETAE LISAEQRRLD SVAQEGEDIR ARADKEKQTA LKDEERQKEK
DAQENAKREN LDSQRAAEKE EAVRSSKYEF IAKAKKLVAQ LVLIRASVES FEKSKAVGKR
RLQMKKIVNG KVNTLSENTQ KIREVANNVS QAIEKARDED KQAKEQGEEG NKGFLPEMAR
GKRYFVDLLS SKVIVRVQAE GFNGQRGDGF PLANMLAQVS TDHKELGPNL AAHIYTVCPT
AIPSLPDPAP DASEDDLMRS LGMLQHADGN FESFERFLGR TEGIISMVAN IMSSSPANHT
LLGGHEGAVK WMTRFLSLLP SSTDTALPLI VAPVLDAFLT GAGHMLANIH AEEFKLLLKA
IDENVLPRLD DGPTGKPSAM RLEKTMSGGY EKFQSTLPSR ALAEFYNGSS FLRSHGSTST
PSPFGHSVFA GNAGTTGVPP FGQSSTNSLV EAGTSSDAFG KPPLNASPFT GSGTTSHTTS
GISKMDQSDS FQPKQTQALP FGVASNTSTF GQVASSSPAL LQKSSPFGNT FPSPSPFGTA
ASVPFGKPNA VPLSVSAPNP STSPFGNSLQ TVVPFGGQNM SLPSSGSSIP NTFPFGNPAQ
AASPFGKPTP MLSSYGISHP NPSPFKNPSS GSSPFGSVVA SSSPFGGTSK VISTFNSGSS
SPFGSQTGSS SPFPSVGDFP PNSNGRPSHT TKTQPCKFFA EGRCRFGDNC RFSHEPPNST
RIPSSSPFMN TTFRQ