Gene PHATRDRAFT_20066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20066 
Symbol 
ID7200391 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp738996 
End bp740518 
Gene Length1523 bp 
Protein Length347 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179705 
Protein GI219117835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTATGCGAC ACCCGTTTCG CGGCTTGCGG GAAAGGTTTG AAACTTGGGC CCTCGACCAT 
CGGAAAAGCT ACCACACGGA GATCGAGAAG CAGAAGCGGT TTGAAATTTG GGCCGAAAAT
CATCGCCGCA CATTAGAAAA AAACGAACGC CATGGCCCGT GCCGTTTAAC TGAACAACCA
GTCTTTGGTT CCAATCGCTT TCAAGACTTG ACGGACGAGG AGTTTCAATC CAGCTACCTC
ACCGGATACT CCAGCTCGAA TGCTAGGCGT CTTTCGTTTT CCAAAGATAG TGGAGTGCTT
GACCCATCAA AAAATATGAA ACGCCATCCC GAAGTACACC GACGTGTTGC TATGCAGCGC
GAAGAAACAG GTAATATGGC ACGGGGATCC AAGTGGCAGA ATTGCGAATG GTGGAACGTG
TCATGTGGTC TACGCTTTAT CTTTGAGAAC TACTTTTACG GATTCGGTCA CACAATGGAA
CCTAAATACG ACGAAAGCTC GTACCCGAAA GGTAATTCAA ATGAGATCAC CCTGTGCTTT
TCCAATTCTC GTTATCTAAT ATACCACGTC TCCTTCTGAA GCTGTGGATT GGCGCGGTAT
TGGTGCGGTC ACTAGTGTGC ATTCGCAAGG AGACTGTGGT GCCTGCTGGG CCATTACGGC
TGTAGAAACA GTAGAGTCAG CTGTCTTTTT GGCTACTGGA ACTCTATATG ATTTATCGGA
AGCAGAAGTC ACCCTTTGTC AAGAGAATTG TGACATGTGC TACGGTGGAT GGCCGCAAGA
CGCATTTGAC TATATCATGG ATCATGATGG CCTGCCGTTG GAAAGTGATT TGTCGTACAA
CGGAAGCCTC CTGCTTAAAC TATCACAAGC CAAGGTTGGT GAGAGCGACG AAATGAAGTA
AGTTGGTAAA GACGAGGGCA TAAACAGCGA CTGGTATTCG TTCTTACCTA CTGCTTATTT
CTTTGTGCAG CGAAAGTTCA GTTGAATCAT ACATGAACGA TTGGTGCCCA GCTAACGGGT
CCAATACATT GAAGCGGTAC GGACAGATTG AAGGCTATGG TTATGCCACA TCTCGATGTG
TTTGTTACAC GGATGGATCT GGTTGCGACT GCGATTCACA GGATGAAGGT GTGGCGGTCA
GAAATCTCGC TACCTATGGT CCAGCAGTCG TCTGCGTAGA TGCTTCGACT TGGAAAGATT
ACAGCGGTGG CATCATAACT TCCGAATCTG GTTGCAGTCA AAAGTTTCTA GACGTAAATC
ATTGCGTTCA AGCTGTCGGC TATGCCTATA CCTCTTCCGG AGGGGGCAGC GAAAGCGAGA
ATGGAAGCCA CGATAGTGGT AGTCAAGACG ACTCCGGATC CCGTCAAGGC TACTGGATTG
TTCGGAATCA ATGGAGCTCG TACTGGGGTA TGTCGGGCTA TGCATGGGTT GCCATGGGAG
AAAACACATG TGGGATCCTA AACGATTTCG TTCAAGCTTA CGCATAAAAG CCTAACTTTG
ATTAACATTC GAAAGCAGAT TGC
 
Protein sequence
MRHPFRGLRE RFETWALDHR KSYHTEIEKQ KRFEIWAENH RRTLEKNERH GPCRLTEQPV 
FGSNRFQDLT DEEFQSSYLT GYSSSNARRL SFSKDSGVLD PSKNMKRHPE VHRPVDWRGI
GAVTSVHSQG DCGACWAITA VETVESAVFL ATGTLYDLSE AEVTLCQENC DMCYGGWPQD
AFDYIMDHDG LPLESDLSYN GSLLLKLSQA KRYGQIEGYG YATSRCDEGV AVRNLATYGP
AVVCVDASTW KDYSGGIITS ESGCSQKFLD VNHCVQAVGY AYTSSGGGSE SENGSHDSGS
QDDSGSRQGY WIVRNQWSSY WGMSGYAWVA MGENTCGILN DFVQAYA