Gene PHATRDRAFT_19122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19122 
Symbol 
ID7198098 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1007017 
End bp1008212 
Gene Length1196 bp 
Protein Length322 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178344 
Protein GI219115097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.398188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGTGCAAGA CTCTCCATTT GCGTAAATTA GATCACAAGA CTACATTTTG GATTAACAGC 
TGCAATTTTT TAGCATGCCG AAGGCATTGA AAAAGCGGGG TCAGACCCAT ATTGCATCTG
CTGCGTCGAA ACCGTACCAG AAAGCTTCAC CAGCATCGAC CAACTCGGCT AGCACCAATC
TTATTTCACC GAACACTTCT CTGGGGCAGC ATTTTCTCAA GAACCCTGCT GTAATAAGTT
CGATTATAGA CAAGGCTGGA TTAAAGGCAA CGGATGTTGT GCTTGAGATC GGACCAGGAA
CAGGAAACAT GACTGTTCCG ATGTTGCAAC GGGCCAAGAA GGTTGTCGCC CTGGAGTTCG
ATTCTAGAAT GGTTCGCGAG GTTCTGAAAA GAACCGAGGG CACGGATTTG GCACATAAAC
TTCAGGTTAT ACAGGGTGAC GCTATGAAGA CAGCTTGGCC ATTTTTTGAT TGTATGATTG
CAAACTTGCC CTATCAGATT AGTTCTCAAG TCGTCTTTAA ACTTCTTTCT CATCGGCCCA
TGTTTCGTTG TGCCGTTCTC ATGTTTCAAG AGGAGTTCGC TTTGCGTCTC TCTGCTCGCC
CCGGGGAGGC ACTGTACTGC CGCCTTTCTG TGAATACGCA GCTACTGGCA AAGGTAGATC
AGCTATTGAA AGTTGGGTAA GACTCTTCTG GATTAGTGTT TGTAGATTGG AGCAGTTCTC
TAACGTCATG ACCACATGCA CTTTATCAGG AAACAAAATT TTCGCCCGCC ACCGAAGGTA
GAGTCCCGTG TTGTGCGTAT TGAGCTAAAA AATCCTCCTC CTCCTGTGAA TTTTACTGAA
TGGGACGGAA TGGTACGTTC GAATGTATTC AAGCAGTGGC CAACTGGTCT AGCAGATTGA
AATTCCTAAT TGCGTTGGAC TTGCTTCTAC AGATTCGATT GTTATTTAAT CGAAAAAACA
AAACGCTTCG CTCGGTTCTA AATACAAAGT CAGTTATGAA ATTGCTGGAG GATAATCGGA
GAACAGTTCA GTCACTTCAT CCCGAAAAGA TGGTCGACGG TCGACCCGCT CAAGTTATCG
TGGAAGAGAT TTTGGAAAGA GATTCATGGA AAGGACAGCG TGCAGCAAAA CTTGATCTAG
ACGACTTCTT ACAGCTGCTT GCTGAGTTCA ACGAGGCTGG AATTCATTTT AATTGA
 
Protein sequence
MPKALKKRGQ THIASAASKP YQKASPASTN SASTNLISPN TSLGQHFLKN PAVISSIIDK 
AGLKATDVVL EIGPGTGNMT VPMLQRAKKV VALEFDSRMV REVLKRTEGT DLAHKLQVIQ
GDAMKTAWPF FDCMIANLPY QISSQVVFKL LSHRPMFRCA VLMFQEEFAL RLSARPGEAL
YCRLSVNTQL LAKVDQLLKV GKQNFRPPPK VESRVVRIEL KNPPPPVNFT EWDGMIRLLF
NRKNKTLRSV LNTKSVMKLL EDNRRTVQSL HPEKMVDGRP AQVIVEEILE RDSWKGQRAA
KLDLDDFLQL LAEFNEAGIH FN