Gene PHATRDRAFT_42677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42677 
Symbol 
ID7196330 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp763427 
End bp765232 
Gene Length1806 bp 
Protein Length521 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177156 
Protein GI219110809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0989203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACGTACTC GCATCAGTGA ACGAGCACTT CGCTTTATAC CATTGCATCC TACAAGAGCT 
ACGATAGCAC TGTTTTAAAG AGTTGCTTCC AATTAATATG CTTGTGGGAG CTCTGGATGT
CGTTAAATCA TACTTGCCGG ACTGGCCGGA ATGGTAAATA TAACTGTTGC TTCACTGAAA
GGTGCCTGTG TCGCCGGTGT CTTTAATAGC CTGTAATTTT CTTGCAATAG GGCGTTGGAT
ATTCTTTTAC TGATGACTGG CTTACTGATT GCTACCGGCA TCATATCGAC GCTCTACGTT
CCCGACGAAG AGAAAGAAGA CAAGGAATAT GTGCAAAAAC ACCCGGCAGA TGCTGCACAG
GTCTCGAACC GGAAGCCTGC AGCTCACTGG TGTAGATTTA CCGTTGTCGA GCTAAAGCAA
GAGCTGCGCG AGCGTGGACT TCGTGTCAGC GGTCTCAAGC ATGAGCTTGT CGATCGCCTA
GCCGAATTTG AAGCAATGTC CCCGGCGCGA CAAAAAGGAC AGGTCGAAGA GACAAAACTG
CATCATCACG AGCTGCTGAC CGAGTTTCAC GGCTTTCGCA CCATGTACGT GACGGTGTAT
GCTGTCATAA TGCTGGCAGA TTGGATGCAG GGAACGCACA TGTACACGCT ATACATGTCT
TATGGAGTCA ACGTTTCTGC TCTATTTTTG ACTGGGTTTT TGAGCGGAGG TATTTTTGCG
CCCTTTCTTG GTTCTTTCGT AGACAAGTTT GGTCGCAAAC GATCTTGTAT TGTCTACTGT
GTTTTGGAAA TCCTTATAAA TGTCATGGAG GGTTTCGACA ATTTTACAAT TCTTCTGGTG
GGGCGTGTTA TGGGGGGTGT CAGCACGAAC CTCTTGTTCT CGGCCTTTGA AAGTTGGATG
ACAACGGAGC ACAGAAAGCG GGGATACCCC GACGAGTGGC TTTCGCGAAC CTACTCTCAG
TGCTCAATTG TTAATGGGAG CACTGCTGTT ATGGCTGGCA TTGTCGCTCA GGTATTGGAG
GATTTTCTCG GACAAATTGG ACCCTTCCAC GGTGCTGTGG GCCTAACCAC TTTGGCTCTT
TTGCTAATTC TGGGTTGGGA GGAAAATTAT GGCGAGGAAC AAAGAGGAGA TCACGAAAAA
TCGAGTTTGA CACACCAATT TATTGAGGGT TGGAAAACAA CGATTTCTAA TTCGAATGTC
TGGCGCATTG GCTTGACACA GGCGCTCTCC GAGGGAGCCA TGTATACCTT CGTTTTCATG
TGGGTTCCGA CTCTTTTGTC GTTAGATCCA CCTGGCGGTG TACCGACAGG GTGTGTCTTT
TCGGCTCTAA TGATGTCGAT AACAATTGGC GGCCTTTTAT TTCCTCTGCT GCAGGCCGGA
ATCAACGCGT TTGTCCCCAA AGACAGTTCG TCGGAATTGT GCGCATCCTT CGTGTACCTT
CTTGCTAGTG CTAGTATGGC AATTCCGGTT CTGTGCCTGT CCGCCATTGA AACACCCGGA
GGCCTAAATT GCCAGCAAAT GGTCATTGGT AGCTTTCTGA TCGTCGAGTT TTGCGTTGGG
CTGTTCATGC CTGTGGCTGG AACTCTTCGA TCGAAGTATG TTCCAGATGC CCTGCAAGGT
GCCATTCTCA ATATTTTCCG TCTTCCTTTG AACGCTGTTG TTGTTTCGGG CACTTACGCC
ACAAATGTTT TAGAAGCAAG TATTGTCTTC AAGCTGGTCA GCGCCTGCTT CTTTGCGGCT
GCTATTATAC AGGCTACGAT GATCACATCA ATACCAAAGC CCCTGAGCAA ATCAAAGACA
GAATAG
 
Protein sequence
MLVGALDVVK SYLPDWPEWA LDILLLMTGL LIATGIISTL YVPDEEKEDK EYVQKHPADA 
AQVSNRKPAA HWCRFTVVEL KQELRERGLR VSGLKHELVD RLAEFEAMSP ARQKGQVEET
KLHHHELLTE FHGFRTMYVT VYAVIMLADW MQGTHMYTLY MSYGVNVSAL FLTGFLSGGI
FAPFLGSFVD KFGRKRSCIV YCVLEILINV MEGFDNFTIL LVGRVMGGVS TNLLFSAFES
WMTTEHRKRG YPDEWLSRTY SQCSIVNGST AVMAGIVAQV LEDFLGQIGP FHGAVGLTTL
ALLLILGWEE NYGEEQRGDH EKSSLTHQFI EGWKTTISNS NVWRIGLTQA LSEGAMYTFV
FMWVPTLLSL DPPGGVPTGC VFSALMMSIT IGGLLFPLLQ AGINAFVPKD SSSELCASFV
YLLASASMAI PVLCLSAIET PGGLNCQQMV IGSFLIVEFC VGLFMPVAGT LRSKYVPDAL
QGAILNIFRL PLNAVVVSGT YATNATMITS IPKPLSKSKT E