Gene PHATR_33458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33458 
Symbol 
ID7204033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp964109 
End bp965395 
Gene Length1287 bp 
Protein Length428 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186162 
Protein GI219113157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0063707 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAG TCGTAGAAAA CGAGGATTTG TTTACCGACA TTTATTCAGA TAACTCGACC 
TGCGCTTCGT CATATTCTAC TTTGAAGAAC AGCAAAGATG GATGGCGAAT TATAGATTGG
AACAAGGAAG CCGTGGGCGT ACTTTCTGAT GAAAGCAAGT GGACTCAACC ATCATCTGTC
GATGTTACGG TGATCCAAGA CAGGCTTGTT TACGTGAAAA GAGACGACCA CCTTCGCCTT
CAAGGTTCCC AGATTGCCGG CAACAAAGCA CGTAAGATGC TCGCTTTGAA CAATCTGAAG
GATTTTCCGT TGTGCGTAGT AAGCTATGGT GGACCACAGA GTAATGCAAT GGTTGCCTTG
GCTGCGGTTG TCAATTTCCA GAATATAAAG CAGGGTATAG ACGATCACCA CGATCCGCAT
CGGTGTCGGT TCATATATTA CACAAAGAAA CTACCCAAGT TTCTCCGAAA CCAGCCAACC
GGAAATCTTT TCAGGGCAAA GATGTTGGGA ATTGAGATGA TAGAGCTACC GCCTGAAGAG
TACAGAACTT TATTTGGTGG CAAGTGGGGC GCGAATACGC ACGCTCCGCA GGGTTTAACT
CCTCCTGTCC CTGGTGACTC ACTTTGGATC CCACAAGGGG GATCTTCTGG AATGGCTCAT
GCCGGCACAA GGTTGCTGGC TCAAGAAATA TGTGAATTTT GGTCTCTGAA GGGAAATGGG
CGTCCACTTT CTGTTGCTAT TCCCGGGGGA ACATGTTCAA CCGCTGTTTT GGTTCACACC
GCAATTGAGA GCTTACAGTC CAAGCTTTCA AATGACAAAC AAATGGACAT TAAAGTTGTT
GTAATCCCAT GTATTGGTGA TGACACCTAT GCTAGAAGGC AAATGATGGC ACTGAATACA
CAGCTAGGCA ATGCCTCCAA TGATCTTCCC ACAATATTGA AGCCCTCGCC TTTTGACTTG
GCCACCCAAC ATAATCACAA ACATTCTGAC AAATATTTCA CATTTGGTGA ACCAGAAAAG
GATATTCTTG AAACATTTGT CTATATAAAG GAGAAGTGTG ATATAACTTT GGACTTGCTG
TATGGAGCGC CGGCATGGGC GGTCTTGCTC AGGCACTGGA AAGGAAAACA GACTTCCCCA
TCAGTGTTTG ACGCAAATGC GCCATTTGCA GATCGCTCAG TCATGTATGT GCATAGTGGT
GGCATTGAAG GCGTTAACAC TCAACTATTA CGCTACCGGT ACAAAGGCCT GCTGAAAACC
AAAGATGTTC AACTTCCAAA CCACTGA
 
Protein sequence
MTQVVENEDL FTDIYSDNST CASSYSTLKN SKDGWRIIDW NKEAVGVLSD ESKWTQPSSV 
DVTVIQDRLV YVKRDDHLRL QGSQIAGNKA RKMLALNNLK DFPLCVVSYG GPQSNAMVAL
AAVVNFQNIK QGIDDHHDPH RCRFIYYTKK LPKFLRNQPT GNLFRAKMLG IEMIELPPEE
YRTLFGGKWG ANTHAPQGLT PPVPGDSLWI PQGGSSGMAH AGTRLLAQEI CEFWSLKGNG
RPLSVAIPGG TCSTAVLVHT AIESLQSKLS NDKQMDIKVV VIPCIGDDTY ARRQMMALNT
QLGNASNDLP TILKPSPFDL ATQHNHKHSD KYFTFGEPEK DILETFVYIK EKCDITLDLL
YGAPAWAVLL RHWKGKQTSP SVFDANAPFA DRSVMYVHSG GIEGVNTQLL RYRYKGLLKT
KDVQLPNH