Gene PHATR_33584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33584 
Symbol 
ID7204119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1305462 
End bp1307855 
Gene Length2394 bp 
Protein Length797 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186502 
Protein GI219113837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTA GTATCGCGCC CTCGCAGACA AGGAAAAGGC AGTTTCCGTA TTTGCGTCCC 
GACAAGGTCG ACACATCAGC CGAGATCTTG TCATCGTTTT CCGATATCCG CGTCAAAACG
TTCGAAACCA TCTACGCCGA CGCCGATGAC GAGCACGCGG CATGGCGTCC CGACAAGTCT
CCGAAAGGAT TTGTGGACGA ACCTATTCGC GACGTAGTAG ATTTGATCAA TACGCACCCG
TCATTTGCGA CAGTGTCTTC TTGTTCCGGA CGAATTGCTC TTTTTGATTC GTCGCTGCAA
CAAACAAACG ATGATGGTCT TGTAGAGAGT GGAAAAGGAA TTGGAGGGTG GCTAATGGTT
TGTCACGAAG AAGCGGAACC CGTCTGTTTG CTCGATATTT TCCACGCAAG CGCTGATACA
GGTCATCTCC ATAATGAAGC GGAAATGCCA CTTAGTTTTA AATTTGAGCC AATGCTGCTG
CACGTTGCAG CTGCCAGTCT GTCTCGTGGG CAGCAACTCT TGCAATTAGC TTTGCAATTG
GGATTCCGGG AATCTGGTTT GGTAGTGACA GATCTTCGTG TTACAGTGGC TATTCGAACT
TATAGCTTGG CCTTGACGGT GCCTCTCGCT CGGCATGGGG CGTTTCGTCC TCCCGATGAG
TATCTTCGAG CCCTTGCTGT AGAAGCCAAC AGACGGATGC GAGTCAACAC AGAAAAGATT
CAAAGGCTGT TACATTCCTT GACGGAGCAC TTTTTCCGTC CCGTTCCCTT GTCTTGTCGA
ATACGAGTCC AAGCGCTTCC CCAATTGGGA TTGCGGTCAC ATTCCGCAGT TGCCGTGACG
AGACCTCGTA CAACAGACAG CATCGACATT ATTGTATTCG GGGGATATGG GAGAGGTCCA
AGGCTAGCAA AGAACACTGG AAGAAGTTTA CAGGGATCAC AGAGATCCAG CCATATTTAT
TGTTTGACGA GAGCGAACGG GGTCTGGGAG GATGGCTGGC ATGAGATTCC ACAAGGCCAT
CCAAGTGATT TAGGAGATAC GTGCATTTCT CCGTTTCGAT TCACTTGCCG CCCAGTAGCT
TTGACGACCC GTGAGGGTAG TGCAACGTGC GTTCTACCTG GCGGAATATC TATTGTGGCT
ATATTTGGAG GTCGAACAAA CCCGGCTAAT CCACTTGGAG ACTTACTTCT CTACGACCAC
GAACACCACC CCGGAATTCT ATGGGAACCC AACGACATCC GCGGATGTTT ACCAGAACCT
TCTTGGGGGC ACACACTAAC TGCCATGCCT TTCGGGAGTA GCTCTAATCG TCTGGCGGTT
CTTTGTGGTG GAAGAAACGA AAGGGAATGT TTGGGTTCGA TTTATATCTT GTCGGCGGTG
AGAGATAACG AGCAAGCCGC ACACTTGATC TGGGAAGAGG TCGTCACGTC ACCTCCACTG
CAAGGCGTCT TCCTCCATTC CGCTGTTGCT ACGAGCCACG ACTCGCTTCT ACTGTTTGGT
GGATTGAATA AGCCTTTGGA CATTTTGGAG GCTTTCGACT ATATGACATG TGCGTGTGCG
CATAGCGTTG ACCTTTGTAG TGGAAAACTA ACTCCAATCG ACAGTAAAAG CTGTCCCTGT
CTTTTTGGGC ATACGGTGGT ACCTTTGGTC TCGAGCGAAA ACCAGGGATT TCAAAGTCAC
TTCCTCTTGA CGGGTGGTCT ACAGAAAACG TCGCAAGGAG GGAATTTTGC CACATCAGCT
CCATTTCGAT GTGTTTCCGT GTCAAAAAAT GGTCCGGATC TTTCCTTTGA GCAGCATGGT
ACTATAATTG AAGAAAGCGA CGAAAAATTT GATTTTGGGT CCTTGCTGGA TCACAGCTGC
ATACCTATTG ACAATTGCTC GCGTGAGCCG CACAAATTTA TTTCAGTAGG TGGTGGGGTT
GCAGGATTTG CCTTTGAGCA GTGCTTTGCC GAGTCGTTTA CCTTTGAGGT TCAGCTTGTG
CCAAGTGCCA ACAGTGTGGG AGACGACATA GTCGCTGGAA AAGCAGACGC GACCTTACGA
AAATCGAACT CGACCGCGGA CAATTCGCGT GTGGCCACGC TGCATGCTTC GGAGTCGGCA
CTGATCGATG TGCTGTACGT GGACAGACGA AACGCGAAGA AAGCGAAAAC AATGTTGGAA
GAGGCGTCAT GGCTTGACAA GAGACATCGC ATGTTTCCAG CTGACAGTCA TGCACCTATC
CTAGATGTCG AGAAATGTAT TGCACTCCCT GTTTTGGAGT CATGTTTATT TGCATTGGAC
GCGATGGAAA CTGGATCCAT CAATCTCGGG AAAATTATCA TAGGTAGAGG AAAGCAGTCG
ATGCCACTCA GTACGGCTGC ATATGCGAAC CAAGCGAAAA AGATAAATGC GTAA
 
Protein sequence
MSSSIAPSQT RKRQFPYLRP DKVDTSAEIL SSFSDIRVKT FETIYADADD EHAAWRPDKS 
PKGFVDEPIR DVVDLINTHP SFATVSSCSG RIALFDSSLQ QTNDDGLVES GKGIGGWLMV
CHEEAEPVCL LDIFHASADT GHLHNEAEMP LSFKFEPMLL HVAAASLSRG QQLLQLALQL
GFRESGLVVT DLRVTVAIRT YSLALTVPLA RHGAFRPPDE YLRALAVEAN RRMRVNTEKI
QRLLHSLTEH FFRPVPLSCR IRVQALPQLG LRSHSAVAVT RPRTTDSIDI IVFGGYGRGP
RLAKNTGRSL QGSQRSSHIY CLTRANGVWE DGWHEIPQGH PSDLGDTCIS PFRFTCRPVA
LTTREGSATC VLPGGISIVA IFGGRTNPAN PLGDLLLYDH EHHPGILWEP NDIRGCLPEP
SWGHTLTAMP FGSSSNRLAV LCGGRNEREC LGSIYILSAV RDNEQAAHLI WEEVVTSPPL
QGVFLHSAVA TSHDSLLLFG GLNKPLDILE AFDYMTCACA HSVDLCSGKL TPIDSKSCPC
LFGHTVVPLV SSENQGFQSH FLLTGGLQKT SQGGNFATSA PFRCVSVSKN GPDLSFEQHG
TIIEESDEKF DFGSLLDHSC IPIDNCSREP HKFISVGGGV AGFAFEQCFA ESFTFEVQLV
PSANSVGDDI VAGKADATLR KSNSTADNSR VATLHASESA LIDVLYVDRR NAKKAKTMLE
EASWLDKRHR MFPADSHAPI LDVEKCIALP VLESCLFALD AMETGSINLG KIIIGRGKQS
MPLSTAAYAN QAKKINA