Gene PHATRDRAFT_41156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41156 
Symbol 
ID7199099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp94150 
End bp95301 
Gene Length1152 bp 
Protein Length383 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185204 
Protein GI219130085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTCTT CTGCAGGAAA CCCACTCAAG TCTGATGATG ATCTTAAGCT GATGCTAGAG 
AAGCTCGAAT TCATCAAGAC TGCAAAAATC TTCAGGCGTT GGAGGGGATC ATTTCTCAAC
CGCTTCGAAA TATTCTTAGA AGAAGACAAT ATAGTGAACG CCCAAGGGGC ATACGAACAG
TTTTCTCATA GCATGGAGGC ATTCGCAAAT CAGGTAAAGA AAGTTGAAGC GTACATAGAA
AGCGGAGACC TTACAGCGGA CCGTTATTTT TCTTCCGTCA AGGCAAGAAA CTGCCTGAAT
GAAATGTCTA AAATGATGGC TTCGGTTATA GAAGAAAAAG ATGCCTTGAT TCCTTCTACA
GTCTCAATGG AAAATGACAG CGGCTACAAC AAGTTTCATA TGGGAGCTGT TCTCATTCGT
GACAATTTCG AAGAATATGG CCGACTGACA TATTACGGAG AAAGCTTGCA GCACATGAGA
AAGGTAACCC TGGCCGAGAT TATTGACAAG CAAATTTTAG AGCAGATGGA CAACTATGGA
GCGAAGCTCA AAAAATTTTG CGATGTAATG GCAGATCTTG GCCTTTACGA AGTCATGCTA
AAATGCCGTG AGTTTGCTTG TGTCGAGGAT AACAAGGATG ATCTTATATT CCTCGACCTG
AAAACTGGTG GAATCGGCGA ATTGGATCGA GCCGCCTGTC TCGGAAAACG TGTAATCACG
TCTACTCACA AAGATCAGGA AGGTAACGAG ATTTTCGAAG AATCCGTCTT AGACGACGAT
GGCAAAGCAA AGCTTCTCAA AATGATACGC CAGAATCCGA GGCTAGGACT AGGTTTTGGA
AACAGCTTGA ACTCCTTCCA GGAAGAGAGT CTCGCTACTG AAAATGCAGA AGTTATAAGA
AGTATGTGGG GTGTGACATT GCGAAAGACA CCGAGAAACA AGAAAGGAGA GGAGTTCATC
TTTCTCTGTC AGAAAACCGG TGTTTTCGGA GAACTTTCAC GCAAAACGTG TTTAGAGGTG
GCGATCATTA CCGAAGTGAA GGACGAAAAT GGAGAAGCCA AAGTTTGCGA GTCACAGCTT
GAGTTTGACG AAAGAGCGTC GCTCTTAGAG CAGATCCGAT CTCTTCTTGA TTTGGGAGTG
CTGGAACAGT GA
 
Protein sequence
MVSSAGNPLK SDDDLKLMLE KLEFIKTAKI FRRWRGSFLN RFEIFLEEDN IVNAQGAYEQ 
FSHSMEAFAN QVKKVEAYIE SGDLTADRYF SSVKARNCLN EMSKMMASVI EEKDALIPST
VSMENDSGYN KFHMGAVLIR DNFEEYGRLT YYGESLQHMR KVTLAEIIDK QILEQMDNYG
AKLKKFCDVM ADLGLYEVML KCREFACVED NKDDLIFLDL KTGGIGELDR AACLGKRVIT
STHKDQEGNE IFEESVLDDD GKAKLLKMIR QNPRLGLGFG NSLNSFQEES LATENAEVIR
SMWGVTLRKT PRNKKGEEFI FLCQKTGVFG ELSRKTCLEV AIITEVKDEN GEAKVCESQL
EFDERASLLE QIRSLLDLGV LEQ