Gene PHATRDRAFT_40219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40219 
Symbol 
ID7195841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp430573 
End bp433401 
Gene Length2829 bp 
Protein Length942 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184251 
Protein GI219128082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTC AAAAACTCAA GGGTGTCACC GAACTTCGCA AGGGATGCTC TTCGGAGTCA 
TCGCTCAAGG CGGCATCTCG GCACTTCGAT CTCTTCCTTG AAGAGGTAAT TTTGTCCGGC
ACCATTGGCG GAAAAGAGAA GCAGCAGCTC TTTCCAATAA GAAGCGTCAA TAACAGCACA
GATAGCAACA ACATTGATAA CAGCAAAAGA TTGGATGGTG ATGCGCAGAC GCAGTACTCG
TTCAAAACCA TCGCCGTGGA AAAGATCAAC GACAAACTGC TGAATCTCTT TGCTGGATAT
TTGACAAGGG CCGAAAAACT CCGTGGGAAT AAGAACGAAA CGCAGGACAA TTTCTCGGAT
GGCAACGGAA TTTCGTACAA CACGGCAGAA CGATACCTGA GCTCCATTAA GAACGAGATT
CTTCGTCGTT GCCTTGATTT GGGGCTAAAG AGATCTTTCG ACGATGCGCA ACAAACGCGA
ATTCGCCAGT CCATGACAAG ACGTTTTGTT GAAAGAGCCG TCCGGAACAA GACGCCTCTG
GCCAGATCTC ATGTCACAGC TGCTCGGAAC GACTTTCTTG TAATTGCGTT GTTATGTATC
TGGGACGGTT CTTTTCCGAT GGCGGATATG TTATTTTATC TTTTGACGCT CCGATACTTA
GCCGGCCGGG GCCAGGAAGT GGCCATGATA TCACGGTCTA GAGTTTCTCT TGGAGAGCCA
TCAGAATGGG CCGATAGTGG TGACAAGACC TTTGTTGTGA GGCTGTGGAG GTCGAAAGTC
AGCCACGAGC AGGATCTTTC TATTGTTCCT CACCAAAGTG AAATGTTGCT TGATTGGGTG
TTTGCATTTG CCTATAGTGC TGTGATGAAT ACCAACCCAA ACGACTCGTT GTTCCCGACC
TTTGCCGAAA AAGTGGAGTT GCGCAATTTA TCAGCTGGAA ACATCAATGA TGAAGAAATT
GGAAATGAGA CTACCCAAGA TAGTACTTTG AAAGCAAAGG TAGATTCCAG CAAAAAAGTC
ACCAAGTACT TTCAAGCACT TTTGGAGCGA CTAATTAAGA CAAGCGAGGA GCTTGTAGGT
CCGAACGAAA TGGCTAGAGC TGCCGGTTTG TATCAGGATG ATGAAGATTT TAGCGAGGAA
TTTGATGGCA GCAGCTGCGT CGGTACAACT AGTGTCAACA ATGAGCCAGG AATATTCAAT
CTGACAGGAG AAGAAGAAAC GGTTGGATTG GATCCCCCAG CCAATGCTGC CTATTACTAT
AGCGGTTTGC TTCATAAACA CGGCATTTCA GCCGGACTCT CAACACATTC GGCTAAGCGC
TCTGCAGTCG AAATGGCAAA TGAAAGTGCT CTATTGCTCA CAACATGGGT ATGCTTCCGG
GCAGGATGGC TGATGAAAGC AGTGCATACT ATTTTTGATT ACCTATCATT TAATCCGAAA
AATGATCGAC AAGTTTCGAG GGTGTTCAGC GAATGGAATA CGCCATCTTT TCGTGGTGAG
ATACTAGGTG GGCGTCCTCC AAGACTCCAT CCCATTCGAC TTCAGGGATT TAAAGAAGCA
GAGAAGGTTC GTTGTTTTGT TGCTGCACTA TTTGTGAACT ACGAAGACAA AGGCATAGAC
TCCAATATAT TTTGCACCAT TGACGATCGG AAAAATCGAC TTCGTGAGTT ACGAAATCTG
TATGATCTCT TGACGGCAAC TATTCTCCGT TATCTGCCCA AGTTTGTCAA AGTCCTTCAA
GAACATCCAA ATCCGTCTCA CCCATTCCGA CAAGGTCAAC GCACTTGTAT CGAAAAGCAC
CCGTTTCTGT TACGAATCCT TAAGGCAAGT CAAACAGCAG GCATCTTGTG GGATGAGTTG
AAAGCATGGA GTACTTTGGT TCAGAAAGAC TTTCTCGAGA GAAATTTTGA GTCAGCAAGC
TGGTCTGAGA TGCTAGAGGT TGTCGGGGAG GAGCAATTTT CTGCGGATCC TAGGACACTG
GGAGGCTACA TGGAAACAAC GAATCGGACT ATGAATACTA TTCAACTGGG ACAGAGCAGG
ATGCTGGAGA CTACAAGCAT TCATGATGCC CGACTTAAGG AGCTAACTAC CACTGTCAGC
ATACAAGGGC AAGCAATTGA GGAATGCAGG CAAGCGCTTG CAGAGATAAA GACATTGCTA
AAAGAGGCTC TGGGGAAACC AGATATCCTT ATCCCTGCCA TCAATCAGGG TCAGGAACAA
CTACCCATCA ATCCTTCTGC AGCAGAAAGA ATGCTAGAAA TAGACCAGCA GGTACTCCCA
GTGTATTCTC TTCCCGAGAA GCTCAAGGAT ATGGACCTCA CGGATTTATT TCAGCAATGG
CATTTTCAAC AGTGGCATTT GCAGATGTAT AGCGGTGGAA CCAAACAGAG GGGCATTTCG
AGTCAAATTC GCTTTGGAAT GGAGTACTTT TCTCTGTTCC TTCCGGCACA GGTACCTCCG
TTGCCTACAG GTACAGTCAA TCCAATGGAA TTACTTGCGG AACCGTGGCG GCGGCAGATC
CGTTTCCTTG CTAACAAAGC CTTTGAAGCA ATGGCTGACT TCTTTGAGAC CAAGGGTTTG
AAGGTACCAA AGTCTTTGAC GCCGTTCAAG AAAGCAATGT TTCTAATTGA TGCCAGCGAG
TGGCCAAAAG GGCCACAGGA TCCGTCACCA TTCCCCATTC TTAGTAAAAA AGGCAAGAAG
GAGGAACTTC GCAACTACGA TTGTCTGCAA AAGTCACAGG AAAAAGAAAA GTTAAAGGTC
CAGAACCGCC TTCAGAAGAG GGGCTCCGCG ACCAAACAAA CACTTCCAGT TGTTGTTGAC
ACTACTTAA
 
Protein sequence
MNCQKLKGVT ELRKGCSSES SLKAASRHFD LFLEEVILSG TIGGKEKQQL FPIRSVNNST 
DSNNIDNSKR LDGDAQTQYS FKTIAVEKIN DKLLNLFAGY LTRAEKLRGN KNETQDNFSD
GNGISYNTAE RYLSSIKNEI LRRCLDLGLK RSFDDAQQTR IRQSMTRRFV ERAVRNKTPL
ARSHVTAARN DFLVIALLCI WDGSFPMADM LFYLLTLRYL AGRGQEVAMI SRSRVSLGEP
SEWADSGDKT FVVRLWRSKV SHEQDLSIVP HQSEMLLDWV FAFAYSAVMN TNPNDSLFPT
FAEKVELRNL SAGNINDEEI GNETTQDSTL KAKVDSSKKV TKYFQALLER LIKTSEELVG
PNEMARAAGL YQDDEDFSEE FDGSSCVGTT SVNNEPGIFN LTGEEETVGL DPPANAAYYY
SGLLHKHGIS AGLSTHSAKR SAVEMANESA LLLTTWVCFR AGWLMKAVHT IFDYLSFNPK
NDRQVSRVFS EWNTPSFRGE ILGGRPPRLH PIRLQGFKEA EKVRCFVAAL FVNYEDKGID
SNIFCTIDDR KNRLRELRNL YDLLTATILR YLPKFVKVLQ EHPNPSHPFR QGQRTCIEKH
PFLLRILKAS QTAGILWDEL KAWSTLVQKD FLERNFESAS WSEMLEVVGE EQFSADPRTL
GGYMETTNRT MNTIQLGQSR MLETTSIHDA RLKELTTTVS IQGQAIEECR QALAEIKTLL
KEALGKPDIL IPAINQGQEQ LPINPSAAER MLEIDQQVLP VYSLPEKLKD MDLTDLFQQW
HFQQWHLQMY SGGTKQRGIS SQIRFGMEYF SLFLPAQVPP LPTGTVNPME LLAEPWRRQI
RFLANKAFEA MADFFETKGL KVPKSLTPFK KAMFLIDASE WPKGPQDPSP FPILSKKGKK
EELRNYDCLQ KSQEKEKLKV QNRLQKRGSA TKQTLPVVVD TT