Gene PHATRDRAFT_40731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40731 
Symbol 
ID7198529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp257176 
End bp258930 
Gene Length1755 bp 
Protein Length584 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184761 
Protein GI219129154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.247124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCCA GTACTCGCAT CGATAGCGAG CACTCGACAC GAGACCGAGA TCCGTTAGCT 
TCGCCGCTAC CGTCATCTTC AGATGATCAC GGAAGTGCTT TGGACACGGA CGCTTCGCAA
GTCTACAGTC TCTGGCTCGA TGGGGCCGAA GAGCATCTCA GAGCACCAAA TGGCAGCAAC
ATGTCCGAAT CATTTGAGTC GAATCTGGCA ATAATTCAAC AAGCCAGCGG GGCATCCTTC
ATGGGATCCG TCGCGAACTT GTGCAGCGCT ACGCTCGGCG CGGGAGTCTT GGCCTTGCCG
TACGCTTTCT ATCAGGCAGG AATCGTGTTG GGTCTTTCTT TACTGTTGAC GTCGGCTGTT
GCCACAGCTG TTTCCATCAA GCTTTTAGTC CAAGCTTCGG AACACTACCA ACTTTTCACC
TATGAATTAT TGGTCGAAGC CTTGTTTGGC AAGCATTGGC GAGTTTGCGT TGAGGTGTCG
ATTGTTGTTT TTTGCGGGGG ATGTGCAGTA GCATATGTGA TTGCTGTCGG CGATATCCTC
GAACGATCCA ATCTCCTTTG GTATAACAGT CGAGCACTAT CCATGACTGC CGTATGGATG
ACTGCAATGC TGCCTTTGAG TTTGCTGCGA CGTATGCAGT CCCTTCAATT TGCTAGTGGG
GTAGGAATTG CTTCCATCGG GACTCTTGTC TTTGCAGCTT TCATACATTT GCTGGAAGGT
AAAGGAGCCT CCACAAATGC GACAAATTAT ACACTGGCTG AATTTACCTT GCACCGAGCC
TCCAATACGA TGAGTATGCA CAATGATTTT GGCGACTTTT TATGGCCCGC CCATGGGTCC
GTTTCAGTCT TGACAGCCTG CCCTATTGTA CTATTTGCGT TTAGTTGCCA AGTCAACGTC
TGTGCAATAT ACCAGGAGCT CGCTATTCCG CACATCCCCG ACACCAATCG ACACACTTTG
CGACAGGACC GTATGCGCCT CGTCACATTG ACGGCAGTTG CTATTTGCGC GACACTTTAT
TGCAGTATCT CGATTGTGGC ATTGGCCGAC TTTGGCAAGG ATGTGACTCC CAACATTCTT
TCCAGCTACG AAATGCATGG CATTATGCAA GCAGCGGCAG CCTTCATGGG AGTCGCGGTC
ACGTTTGCGT TTCCCCTTAA TGTCTTTCCA GCACGGGTTA CGCTCCAGGA CATTTTCTTT
CCGAAAGTCT TATTGCACCC GCCTGTGAGA AACGAAACCT TGACAGCGGC ATTATTATTG
GACCAAGATG AGGTCACTGA ACCTCGCCTT CCTATGAGTG CCGCTAGAGA TGTTCTCGTT
GATGAAAGCG ACGAGAGAAC GCCACTACAA CCGCAGATAA ATTTTGCAAA TGGCGAAGGA
GGGCTCAGGA ATGAAGATGG TGCTCAAGTT GACGCGATCG AGACGCCGTT GGATGAAGGC
ATAGCGTCTC GACCGGCCGG AATTGAATCT GAATGGAACA TGCGACAGCA CGTTGGATTG
ACGATAGGGA TTGCTGGCTC GGCATTGTGC CTAGCCCTTG TGGTGCCCGA CATTTCCGTC
GTCTTTGGAG TTTTGGGAGG TACGGCTACC AGCATGCTTG GGTTTTGCGT ACCCGGCGCT
CTGGGTGTGC GGCTGGGTCG GGACCTGGAC GATTGGTCCT TGTCAGTGCC TTCGTGGGTA
CTGTTGATTG GAGGGGCTGT GTTTGGAACG GTGACGACAG CTGTAACGGT TTGGGACACT
TTAGAAGCTC TGTAG
 
Protein sequence
MSPSTRIDSE HSTRDRDPLA SPLPSSSDDH GSALDTDASQ VYSLWLDGAE EHLRAPNGSN 
MSESFESNLA IIQQASGASF MGSVANLCSA TLGAGVLALP YAFYQAGIVL GLSLLLTSAV
ATAVSIKLLV QASEHYQLFT YELLVEALFG KHWRVCVEVS IVVFCGGCAV AYVIAVGDIL
ERSNLLWYNS RALSMTAVWM TAMLPLSLLR RMQSLQFASG VGIASIGTLV FAAFIHLLEG
KGASTNATNY TLAEFTLHRA SNTMSMHNDF GDFLWPAHGS VSVLTACPIV LFAFSCQVNV
CAIYQELAIP HIPDTNRHTL RQDRMRLVTL TAVAICATLY CSISIVALAD FGKDVTPNIL
SSYEMHGIMQ AAAAFMGVAV TFAFPLNVFP ARVTLQDIFF PKVLLHPPVR NETLTAALLL
DQDEVTEPRL PMSAARDVLV DESDERTPLQ PQINFANGEG GLRNEDGAQV DAIETPLDEG
IASRPAGIES EWNMRQHVGL TIGIAGSALC LALVVPDISV VFGVLGGTAT SMLGFCVPGA
LGVRLGRDLD DWSLSVPSWV LLIGGAVFGT VTTAVTVWDT LEAL