Gene PHATR_33082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33082 
Symbol 
ID7204080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp3397 
End bp4950 
Gene Length1554 bp 
Protein Length517 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186257 
Protein GI219113347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTC AAAAACTCAA GGGTGTCACC GAACTTCGCA AGGGATGCTC TTCGGAGTCA 
TCGCTCAAGG CGGCATCTCG GCACTTCGAT CTCTTCCTTG AAGAGGTGAT TTCGTCCGGC
ACCATTGGCG GAAAAGAGAA GCAGCAGCTC TTTCCAATAA GAAGCATCAA TAACAGCACA
GATAGCAACA ACATTGATAA CAGCGAAAGA TTGGATGGTG ATGCGCAGAC GCAGTACTCG
TTCAAAACCA TCGCCGCGGA AAAGATCAAC GACAAACTGC TGAATCTCTT TGCTGGATAT
TTGACAAGGG CTGAAAAACT CCGTGGGAAT AAGAACGAAA CGCAGGACAA TTTCTCGGAT
GGCAACGGAA TTTCGTACAA CACGGCAGAA CGATACCTGA GCTCCATTAA GAACGAGATT
CTTCGTCGTT GCCTTGATTT GGGGCTAAAG AGATCTTTCG ACGATGCGCA ACAAACGCGA
ATTCGCCAGT CCATGACAAG ACGTTTTGTT GAAAGAGCCG TCCGGAACAA GACGCCTCTG
GCCAGATCTC ATGTCACAGC TGCTCGGAAC GACTTTCTTG TAATTGCGTT TTTATGTATC
TGGGACGGTT CTTTTCCGAT GGCGGATATG TTATTTTATC TTTTGACGCT CCGATACTTA
GCCGGCCGGG GCCAGGAAGT GGCCATGATA TCACGGTCTA GAGTTTCTCT TGGTGAGCCA
TCAGAATGGG CCGATAGTGG TGACAAGACC TTTGTTGTGA GGCTGTGGAG GTCGAAAGTC
AGCCACGAGC AGGATCTTTC TATTGTTCCT CACCAAAGTG AAATGTTGCT TGATTGGGTG
TTTGCATTTG CCTATAGTGC TGTGATGAAT ACCAACCCAA ACGACTCGTT GTTCCCGACC
TTTGCCGAAA AAGTGGAGTT GCGCAATTTA TCAGCTGGAA ACATCAATGA TGAAGAAATT
GGAAATGAGA CTACCCAAGA TAGTACTTTG AAAGCAAAGG TAGATTCCAG CAAAAAAGTC
ACCAAGTACT TTCAAGCACT TTTGGAGCGA CTAATTAAGA CAAGCGAGGA GCTTGTAGGT
CCGAACGAAA TGGCTAGAGC TGCCGGTTTG TATCAGGATG ATGAAGATTT TAGCGAGGAA
TTTGATGGTA GCAGCTGCGT CGGTACAACT AGTGTCAACA ATGAGCCAGG AATATTCAAT
CTGACAGGAG AAGAAGAAAC GGTTGGATTG GATCCCCCAG CCAATGCTGC CTATTACTAT
AGCGGTTTGC TTCATAAGCA CGGCATTTCA GCCGGACTCT CAACACATTC GGCTAAGCGC
TCTGCAGTCG AAATGGCAAA TGAAAGTGCT CTATTGCTCA CAACATGGGT ATGCTTCCGG
GCAGGATGGC TGATGAAAGC AGTGCATACT ATTTTTGATT ACCTATCATT TAATCCGAAA
AATGATCGAC AAGTTTCGAG GGTGTTCAGC GAATGGAATA CGCCATCTTT TCGTGGTGAG
ATACTAGGTG GGCGTCCTCC AAGACTCCAT CCCATTCGAC TTCAGGGATC CTAA
 
Protein sequence
MNCQKLKGVT ELRKGCSSES SLKAASRHFD LFLEEVISSG TIGGKEKQQL FPIRSINNST 
DSNNIDNSER LDGDAQTQYS FKTIAAEKIN DKLLNLFAGY LTRAEKLRGN KNETQDNFSD
GNGISYNTAE RYLSSIKNEI LRRCLDLGLK RSFDDAQQTR IRQSMTRRFV ERAVRNKTPL
ARSHVTAARN DFLVIAFLCI WDGSFPMADM LFYLLTLRYL AGRGQEVAMI SRSRVSLGEP
SEWADSGDKT FVVRLWRSKV SHEQDLSIVP HQSEMLLDWV FAFAYSAVMN TNPNDSLFPT
FAEKVELRNL SAGNINDEEI GNETTQDSTL KAKVDSSKKV TKYFQALLER LIKTSEELVG
PNEMARAAGL YQDDEDFSEE FDGSSCVGTT SVNNEPGIFN LTGEEETVGL DPPANAAYYY
SGLLHKHGIS AGLSTHSAKR SAVEMANESA LLLTTWVCFR AGWLMKAVHT IFDYLSFNPK
NDRQVSRVFS EWNTPSFRGE ILGGRPPRLH PIRLQGS