Gene PHATRDRAFT_33955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33955 
Symbol 
ID7197770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp742584 
End bp744050 
Gene Length1467 bp 
Protein Length488 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178294 
Protein GI219114997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGGC GGATAATTTT TGTCGTCCTG TGGTGGTCAC AACAGTCGTT TGATGTTTCG 
GCTATCGAAG ACTTTTTTCG GATCCAAAGT CGGCGCCCTC GCACCCTTTT GGCAGCTCAA
ACGAATATTG CTGCCTCAAT AGCTGGACTT CGGGGAGGCA ACAACGAGAT TCCCAACGTA
GAATACGATG AGATTACGGC AATCAAAAAA TCTCGCGAGC AGTCCGCCGA AACCAAGGCA
ACATCAATCA ATATCCCAGA TGCCATTGCC CGGGATCCTG CTACAGAGTC CCCGCAACCG
ACATCCAGTG ACGAAGCTAG TGGAAAGGCA GCCGTCTCGA AGCAAACGAC ATCAAACAAA
TCAACAACAT TTGTTCCCAA GAGACACACG CTTCCCTCGT TACCCTCTTA TTGGAAACGA
CACGGAAAGA TTTGGAAGGA TCAAGTTGCG TCGATGCAAC TTAGCGCCAA GCCACTTCAG
CTTTCCATGC AACAAAGCTG GCAGGTTGCC GAACGCCACT CTACCCAGTT CGTATCCACC
ATGGCAGCTT CCATTATTGC AGTTTTTATA AAAAAGCAGT GCGATATATC GTTCGGCCGT
CTCTACGCGC TTGCCCTACT CGGCTCATCG GTGGGCTTCT ATCTCTTTCT CTATTTTATT
TCGGTGGGGT ACGCCTTGGG AGTCGCGTTG CCCGTAACGG TAGCCTTATT TTGTTACAAA
CGCCACACAG TCGTGAATCT TTCCACCACT TTGCACTCTC TCTTTGTCAG TTTCTGGGGC
CTCCGTCTGC TCGTCTTTTT GCTGTGGCGT GAATACATCA ATTGGCCAGC ACTACATCGT
AAGGTTGTGC AAGTCAACGA ATCTCAATCC CCATCAACGA TTGAAAAAGC TATGGGATGG
CTTCTTTACT CGCTCCTGTA CATATGCATG CTTTCTCCTT GTTGGTTCCG ACTGCAGGAG
AATCGAATGA ACGGGACTTG GTCCAACATA CTTCTCGCCG TACAACTCAG TGGGCTAGTA
TTGGAATCCG TGGCTGATAT ACAAAAGAGC TTCTTCAAGG TTTCGGCACC GTCAAACAGG
TACGAATGGT GTCACCAAGG TCTGTGGAAG TGGTCGACGC ACCCCAACTA CTTGGGAGAG
TGGTTGTTTT GGTTAGGTAC TTACCTAGGC GGATGGTCGA CCAAAACAAG TTTCGTACAG
TGGCTCGTCA TGTCGACCGG CTTCGCCTTT CTCACCTGGG TTCTACGTGG AGCCACAATG
TCTTTGGAAC AAAAGTATGG CGACAAGTAC GGAAAAAATC CCGCATACAT AGGATTTACA
GAATCTCACA CCTTTTGGGG TCCAGCGTTT TGGACAAGAT CTTTCCAGCC CACTGCTGCG
GACACAGACC CTGTGGTTCA AGTGGTATTG GAGGAAGAAA TGCCCGACAA TGAAGAAGAA
ACGATACTCA AAAAAGAGCA ACCATAA
 
Protein sequence
MKRRIIFVVL WWSQQSFDVS AIEDFFRIQS RRPRTLLAAQ TNIAASIAGL RGGNNEIPNV 
EYDEITAIKK SREQSAETKA TSINIPDAIA RDPATESPQP TSSDEASGKA AVSKQTTSNK
STTFVPKRHT LPSLPSYWKR HGKIWKDQVA SMQLSAKPLQ LSMQQSWQVA ERHSTQFVST
MAASIIAVFI KKQCDISFGR LYALALLGSS VGFYLFLYFI SVGYALGVAL PVTVALFCYK
RHTVVNLSTT LHSLFVSFWG LRLLVFLLWR EYINWPALHR KVVQVNESQS PSTIEKAMGW
LLYSLLYICM LSPCWFRLQE NRMNGTWSNI LLAVQLSGLV LESVADIQKS FFKVSAPSNR
YEWCHQGLWK WSTHPNYLGE WLFWLGTYLG GWSTKTSFVQ WLVMSTGFAF LTWVLRGATM
SLEQKYGDKY GKNPAYIGFT ESHTFWGPAF WTRSFQPTAA DTDPVVQVVL EEEMPDNEEE
TILKKEQP