Gene PHATRDRAFT_42556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42556 
Symbol 
ID7196260 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp409983 
End bp412960 
Gene Length2978 bp 
Protein Length964 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177084 
Protein GI219110665 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.162025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTTAC CGGAGCGCAA CAAAGAAGAG ACGGAGCGGA CGCCGCTCGT GTCGAGTCGG 
AACACAGTGG CCAGCGGTAC TACTCCCCGA TCGAACGTTC CCCAACACGG CCGATCGCGG
TCCAACTCAT TTCGCGAACA TCACGGTGTC ATGCCTCCGA TTGCATCGAA TTACCCGTTG
CCACAGAACA AACACCGAGT AACGGCTTCC CTGAATGCGG AAGGATTCGC CCCGAGAGCG
GTACCGTCTT ATACGCCCGG AGTTCCCGTC ACCAACGCCA GTTCTTCCGC CTATACGCTG
GGACGTCAAC CCAGTCTTCC ACTACCCCCG CGCCAACGAC GTCAACCCAG CCAGGATTCT
ACCAATGGAC CCAAAAAGGG AGGTTTCTTT TACTACATTG TCTACGCCAT GGTCAACGTC
ATTATCAGTG TCCCGGGTTT GTATGGCTAC GCCGCCGTCA TTTTCAATCA CGAAGCCTAT
AATGATCATA TGAATGCTCT CAGCAAATTG GTAATATTTT CTTCGCTCAT GCATCAGCTG
GGATTTTGCT TGTTCTCGTC CCTGCCGTTT GCGATCGGTA CCGTACAAGA CGCCGGGCTC
ATCTTTCTCA GTGCCATGAG CAACATCGTG GCCAAAGAAA TTCTGGCAAA CGGGGGAACA
GTGGAGGAAG TTGTGTCCAC GACCTTGGTT ATTCTGGCAA TGAGCACAGC ATGTCTAGGA
CTCGCTCTGG TGGCCATGGG CAAATTCCGA TTGGCCGATA TTGTTTCCTA CTTGCCCATG
CCCGTCGTAG GGGGCTATTT GGCCTTTATA GGCTATTTTT GCCTCCAAGC CGGGGTTGCC
TTGTGTATTT CGGAACCCAT GGTCGGGCTG GCGGATTGGC GATATTTATT GGAGTCGCAA
AATGTGCTTC TGGCTGCACC TGGGCTGGGT GCGGGGCTAG CGCTGACGTA CATTTCTCGC
AAGGCCGAAA GCGATGCCGC CTTGCCAGCG GCAATGATTG TCATTCCGGT TTTGTTTTAC
GCTCTCATCT TTGGGACGGG TATGGGCATT GAAGGTGCTC GCGAAGATGG GTGGGTTGGC
CAAACAGCAC CACCCGTGCC AGTGCAGGAT CTTTTTCATT TGGTGGACTT TAAGCTTGTT
CATTGGACCC TGATCTCCAA GTGTTTGGCA ACATGGTTGG GCATGGTTTT TGTCGTTTCA
TTTGCGTCTT GCCTAGATGT GGCGGCTATT AGCATGGATA TGGGAGAAGC TTTGGATACA
AATCGAGAAC TGGCCACGGT TGGGATTTGC AACGTCATGT CAGGCTTGAC GTTTGGCTTC
ACGGGCTCGT ACATATTTTC ACAAACAATC TTTACCTATC GGACTGGCGT ACACTCGAAA
TGGATCGGCG TCATTATCAT GATTGTTTTT CTGGCCGTTG TTCTCTCAAC CGTGAACATG
TTACAGGTGG CACCTTTATT CTTCCTTGGA TCCACACTCA TTTTCATTGG ATACGATCTG
CTTTACGAGT GGCTCTTTGA AATTCGACAC AAGATCTTTT TGAGCGAATA CGTTGTCTTA
TGGCTGACTT TTTTAGCAAT TCAGGTCGTC GGTATCAATG CCGGTATTGT TTTTGGTGTA
GTTGTGGCCA TGGTCGATCA TGTCATGACG ACGGCACGCG TTTCTGCTCT CAATCGAGTA
CCGAAGCGAT CGCGTGCTGT TTGGTCTCCA GAACATTGGA AGATTTTACA GACGCACGGA
TACCATTCAC AACATCCCAA AATTGTAACC TATGAGATTA TCGGCTCGGT CTTTTTTGGA
ACAGGCCAGC AGCTGTTATC GACTATTTCG GAAGAGATTG GCATAGATGC AACATTGGAA
GAGGTCACTG AAGAAGCTGC TATTATGAGC CCTCATCGAG CTGGATATCT GATGACAAAG
TCTCCAGGAT CTGGTGGAGC TACAAAGAAA CCAAAAAGTC CCCTTATGAG ACCTCGTCCG
CATTTCTTAG TTCTCGACCT TGCTCAAATG CCGAACCTCG ACGCCTCCGC CGCTCGAGGA
TGCTTTCTTC AGTTGGCAAA GATGTGCTCG AAACGACACA TTCTTGTCTG TGCTGCTGGG
CTGTGCCCAC GTGTGGACTG GATGCTCCGG GCTCACGATG TCGCATACGA TGAAATTGAA
GGAGAAAGAA TCAAACAAGA CATGGAAGGT GGTATTTTGC CAACTGGAAC GTGCGACAAG
ATACTGCCGT TCCTAACTAT TTATGAAGCT CTTGAGTTTT GTGAAAGTCA GTTGATCCAG
CAGTTGGATC GCCTGAATCG GTCGCCATCC TTTATCGGCC TCAAGGACAT TGCTCCATCG
ACAGTGCGCA GAAAAGGGAA AGCAACTCTT GCAGAGGTCT TCTCTTTCAT TTTGGGCTTG
AGGGAAGAGG ACAAAAAGCT TCTTGATAGT CTTTCTGACG AGACTTACCA TCAGGAGATG
GAATACAATG CAGGGGACTG TATGTGAGTA TGATTTCAGT TGACTGCACA ATTCTTACAT
GGTCCCTTGC TTTCTCACTC ACTCTCTCTC CATCTATTTA CCTTAGTTTC CCCAAAGACA
CTCACTCTGA CTCATTTGGA GTTGTGCTCA AAGGTGCTGT CGCAAACGTT CGAGAAGAAC
TCAGCTCGCA TTTGACAACT CACATCGTAT CTGGAGCAGG AAAAGTGTCC TTGACAGGTA
CCGGAAGGAG TACTTCAAAT CTCATGGATC AAGGTGATAT TGGACATGTT CGTTCCTTCT
TGTCGGTCGG AGGAATTTTT GGGTTCGTTG ACTTCCTTTT GGAGCATCAC CGAAGTTTTC
GTAGCGTCGC ATCTCGCGAC AAGACGGTTG TTGCGAAGAT AACACGAGCA GGGCTGGATC
GACTGCAAGA AGAGCACCCT GAGGTTGTGC GAATTGTACA GAGCGTCCTG CTCCAGGCTA
GCGCCATGGA GCTTTCGAAC TGCACGTGCA GTGACTAA
 
Protein sequence
MYLPERNKEE TERTPLVSSR NTVASGTTPR SNVPQHGRSR SNSFREHHGV MPPIASNYPL 
PQNKHRVTAS LNAEGFAPRA VPSYTPGVPV TNASSSAYTL GRQPSLPLPP RQRRQPSQDS
TNGPKKGGFF YYIVYAMVNV IISVPGLYGY AAVIFNHEAY NDHMNALSKL VIFSSLMHQL
GFCLFSSLPF AIGTVQDAGL IFLSAMSNIV AKEILANGGT VEEVVSTTLV ILAMSTACLG
LALVAMGKFR LADIVSYLPM PVVGGYLAFI GYFCLQAGVA LCISEPMVGL ADWRYLLESQ
NVLLAAPGLG AGLALTYISR KAESDAALPA AMIVIPVLFY ALIFGTGMGI EGAREDGWVG
QTAPPVPVQD LFHLVDFKLV HWTLISKCLA TWLGMVFVVS FASCLDVAAI SMDMGEALDT
NRELATVGIC NVMSGLTFGF TGSYIFSQTI FTYRTGVHSK WIGVIIMIVF LAVVLSTVNM
LQVAPLFFLG STLIFIGYDL LYEWLFEIRH KIFLSEYVVL WLTFLAIQVV GINAGIVFGV
VVAMVDHVMT TARVSALNRV PKRSRAVWSP EHWKILQTHG YHSQHPKIVT YEIIGSVFFG
TGQQLLSTIS EEIGIDATLE EVTEEAAIMS PHRAGYLMTK SPGSGGATKK PKSPLMRPRP
HFLVLDLAQM PNLDASAARG CFLQLAKMCS KRHILVCAAG LCPRVDWMLR AHDVAYDEIE
GERIKQDMEG GILPTGTCDK ILPFLTIYEA LEFCESQLIQ QLDRLNRSPS FIGLKDIAPS
TVRRKGKATL AEVFSFILGL REEDKKLLDS LSDETYHQEM EYNAGDCIFP KDTHSDSFGV
VLKGAVANVR EELSSHLTTH IVSGAGKVSL TGTGRSTSNL MDQGDIGHVR SFLSVGGIFG
FVDFLLEHHR SFRSVASRDK TVVAKITRAG LDRLQEEHPE VVRIVQSVLL QASAMELSNC
TCSD