Gene PHATRDRAFT_36079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36079 
Symbol 
ID7201258 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp498878 
End bp500707 
Gene Length1830 bp 
Protein Length565 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180650 
Protein GI219119795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCGA CAAGGTCGAA GAGAGCCAGT GCACGCATTC GAAAGAGACC CATCAGCTTT 
GGCAACCCGG CTCTTGGAAC CTATCATGGC GATGTATACA CGGTCGAAGG TGACCACGAT
AACATTGGCA CCCAGCTTCT AGCGGAAGGC GTAGTGACCG CCAACAATTT AGTTGCCATG
GTCGATGTGC CACCAGAGCA GGTACCCGAA GGCGTATTGA ATTTGGCGCG TTCCCATCGA
CCGTTGATTG ACCATATGCG CGTAGTTATT GCGGAAGGAA ATGGCCAGGT CGAGGCCGTA
CATCGGTCTA CAGAGACCGA TGTGGTTAGC GAAGAGGCAA GAGGGAAACC GCTCGCTTCC
TACACCTTCG CTGGCGTCAA TGCCGCTTCC ATTTTAGCTT CTGACGTTGA AGTGTCGGAA
GCTGCGATTG ATCCAAAAGC GCACATGGAG CAATTACTGG AAGACGATGG AGCCGAGCAC
CATCCATGTC GAACATACCT TGTCTTGCTA GAAGCATGCT CGCCCGAAGC CGCAAAGGAC
CTTGTGCAAG ACTTGCGTGG AATGCCTTAT ACATTTCTAG ACGAGACACA AACTTGCAGC
GTTTTCCACG TTGTGGCACT AGAAGGGGCC GACGGAGTTT CCCTCATGTC CCCTTTCTTC
GCGCCGTCCA CAAAAGCGAC AGACAATGGG CACTTGGAAA TTTCATCGTC TCATTCAAGT
GACTCCCGAA TGGAGGCAAG AAATCGCTGC GGGAGCGTTG ATTGTAAATC TGGAGAATCT
GAGCACCCTT CAGGGCAGCA CCAACGCTCG GAAGATTACA ATTGCGCAGT CTGCCTAGAG
CATATGGACA TGACCTATCC CAGATCTGGC GAACGGACCT CAATTTTGAC AACAGTCTGT
AATCATTCGT TTCATATGGA TTGCTTGCTG CAATGGCAGG ACTCTCCCTG TCCGGTTTGT
CGCTTTGATC ATTCTGGTTT GAACGAGGCG TTGTCACAGT GCCACCTATG CGGAAGTACC
GCCCACAACT ACGTTTGTTT GATATGTGGT ATTGTGTCGT GCAGCGGAGG GCCCCGCTCC
TCTAGTGCTG CTGCTGGTAG GTTAGGCCCA CATGACAGCT TTTCACAGTG TCGATCCGAA
GACACGCCCA TATTGCCGTG TTACCAGAGA CAAGCGTTGT CGCATGCACG GCAGCATTAC
GACGAAACAC TACATGCATA TGCATTAGAT ACGGAGACGC AGCATGTATG GGACTTTGCC
GGTCAAGGGT ACGTGCATCG CCTCTTACAA AACAAAGAGG ACGGGAAACT AGTAGAAGTA
CACGATCCCT ATAACACCAC TTCCCAAGAA CGTTCGCTAA GTCCTGGTTT GAGCGAATCG
CAAGAAGGAG AAGTTGTGCA TCGCAAGCTA GAGGGGTTTG CTAGTCAATA CTATACATTG
CTGAAATCGC AATTAGAGCA GCAACGTATT TTTTATGAAG GTCGATTGGA AGAGATTCGA
CGCGATTACG ACGTGGCGAA GCCTCTTAAA AAGTCGACCG ACCTGATTGC TGCTCTAAAA
CAAGAGCGCA ATCAACTTTC GCAGCGACTA GTTACGCTAG AGACGCGTCG ACGAAAAGTG
CTGGAAGACG TTTCCTTTCT CGTCAGTATG AATGAGAGCC TGGTAGCCAA CAAGGAACCA
CTCAGGCGAC AGATCGAAGA AGCTCAACAA CAAAGCTTAA ATGCTCGTCG TACCTTCGAA
GAACTTTTAC AGCCATTGCA GGATAAGGTC ACGGCTCTGA TGTTACAGCT GGAGGATGAG
GAAAGCGATA AAAAGCCAGC AGCTCTATGA
 
Protein sequence
MSSTRSKRAS ARIRKRPISF GNPALGTYHG DVYTVEGDHD NIGTQLLAEG VVTANNLVAM 
VDVPPEQVPE GVLNLARSHR PLIDHMRVVI AEGNGQVEAV HRSTETDVVS EEARGKPLAS
YTFAGVNAAS ILASDVEVSE AAIDPKAHME QLLEDDGAEH HPCRTYLVLL EACSPEAAKD
LVQDLRGMPY TFLDETQTCS VFHVVALEGA DGVSLMSPFF APSTKATDNG HLEISSSHSS
DSRMEARNRC GSVDCKSGES EHPSGQHQRS EDYNCAVCLE HMDMTYPRSG ERTSILTTVC
NHSFHMDCLL QWQDSPCPVC RFDHSGLNEA LSQCHLCGST AHNYVCLICG IVSCSGGPRS
SSAAADTETQ HVWDFAGQGY VHRLLQNKED GKLVEVHDPY NTTSQERSLS PGLSESQEGE
VVHRKLEGFA SQYYTLLKSQ LEQQRIFYEG RLEEIRRDYD VAKPLKKSTD LIAALKQERN
QLSQRLVTLE TRRRKVLEDV SFLVSMNESL VANKEPLRRQ IEEAQQQSLN ARRTFEELLQ
PLQDKVTALM LQLEDEESDK KPAAL