Gene PHATRDRAFT_40692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40692 
Symbol 
ID7198591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp160868 
End bp162088 
Gene Length1221 bp 
Protein Length406 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184660 
Protein GI219128943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.388438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTT CTATTTTTGC GTTGCTCGGC GCCATTTCGG CCTCGTCCCT TCTGCCTGAA 
CTGACGGGGG CTCTTCCAAA CGGCGATACC ACTTACGAAC AGCAAATCGG ACGCCTGGAC
AATGCGGCTC GCTCATCCGG CCGTATGTTG ATGAAGAACC TGAGGATGGA TTTAAGCGAC
AGGCAGGTTA GTACAAAGTC GCCTGCCCCA CTTGTGACTT CTCCTCCAAC AGCCCAGCCG
ACATCATCTA TTAAGCAAGC TTCCTCCAAA GTCCCGTCCG ACGGTTCCAG TAGCCCTAGT
TCGAAATCGT CTCCAACCTC ACCGGAGACA GACATGCCAG CATCTAATGC TTCCAAGAAA
AGCAGCAAAT CCAGATCTCC GACATCGCCG AATCCGAAGA GCGCGAAAGC TACGGTATCT
CCCGATCAAA CTCCGTCTGA CTTCCCCTCC GAAGTTCCTA CTGGTCCCAG TGCGAAGTCC
TCTGTGGCAC CGGCAGCCAC CAAGAAGAGC GGCAAGGGAG GGTCCCCGAC ATTGCCAAAT
CCGAAGAGCG CGAAAGCTAC GGTATCTCCT GATCAGACGC CGTCCGACTT CCCCTCCGAG
GTTCCTAGTA GTCCCAGCGC AAAGTCTTCC GATATATCAA AGGTGAGCGC TTCCAAGAAG
AGCGGCAAAG GCGGGTCCAC GACATCGCCG AATCCGAAGA GCGCAAAAGC TACGGTATCT
CCGGATCAGG CTCCGTCCGA TGTCCCCTCT CCGGTTCCGA GTAGTCCGAG TGTGGAACCT
TCTTCTGACG CACCGGTGGC CGATACTTCC ATCACTTCCA AGAACGGTAG CAAGACCGGG
ACTCCGACCG TATCGAAAAA TCCAAATCAA GCTCCGACCG CAGTCCCTAC GAGTCCCGCT
ACTATTCCGC CGTTTACCTT CTCGTTACCA CCAGTGTTGA CTTTGCCGCC AGTCAGCGAA
TTGCCAAAGA TCCCTACGCA GAAAGCTGAT CAATTACCAC TTCCGAAACT ACCCAGAACA
AAAAAGGGAA CCAAGAAAAG CTTGACTAAA AGCACCGCTG ATTTGCCTCC AGTGATGGCT
CTACCGCCAG TGATCGATTC GCCGAACATC CCCAGCAAGA ATAGTAACGT GATAACCGTT
CCGAAAGTAC CCGAAACGAA AAAAGCCACG ACCAAAGGCT TGACCAAGAC CACCAACTCG
TTGGAAATTC CCGCCTTCTA A
 
Protein sequence
MRFSIFALLG AISASSLLPE LTGALPNGDT TYEQQIGRLD NAARSSGRML MKNLRMDLSD 
RQVSTKSPAP LVTSPPTAQP TSSIKQASSK VPSDGSSSPS SKSSPTSPET DMPASNASKK
SSKSRSPTSP NPKSAKATVS PDQTPSDFPS EVPTGPSAKS SVAPAATKKS GKGGSPTLPN
PKSAKATVSP DQTPSDFPSE VPSSPSAKSS DISKVSASKK SGKGGSTTSP NPKSAKATVS
PDQAPSDVPS PVPSSPSVEP SSDAPVADTS ITSKNGSKTG TPTVSKNPNQ APTAVPTSPA
TIPPFTFSLP PVLTLPPVSE LPKIPTQKAD QLPLPKLPRT KKGTKKSLTK STADLPPVMA
LPPVIDSPNI PSKNSNVITV PKVPETKKAT TKGLTKTTNS LEIPAF