Gene PHATRDRAFT_47107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47107 
Symbol 
ID7202182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp458505 
End bp459564 
Gene Length1060 bp 
Protein Length342 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181210 
Protein GI219121723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0432891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTTTTC GGTCGGCGGT TCTGTTTCTA TTGGCTCTTC TTTCCTCCTG TGACGCCCAA 
GATCCACTTC ACCTGTCGGC AAGGCCGTGG CGAAAGTATC CAACGAGCCC CGATAGAAGC
CACTTTTCCC GTTCGCGTCC AATTCCACAT GCGTACGGGG ACCTAAGTGC ACACTTTCTG
GACAACGAGG AAATCAAAAC TGACCGAAGA GCACGGCGAT GGCAAGTCAA CCTGAAAACC
AGAAATATTG GCCGGCTATC TTGGTCCAGC CGAATCATAT GGACAAACAT TGCGACCTTT
GCTGCCCAGG CTTGGAAGCC TTCGTTTACT CAATGGGGTA TAAAAGTATC CGAGAAGATT
TTGCGTGGCG AAGAACTGTA CAGACTTATT ACTCCAGTGT TCCTACATGG CGGCTTCGGT
CATATTTTTA CAAATATGAT TTCGCTGAGC AGAGTCGGAC CAGATGTGGA GCGATTGTTT
GGATCAGGAC GATTTCTGAC AACGTACATG GTTTCTGGAA TGACAGGCAA TCTTCTTTCT
GCATATATGT CTCCCAACCC TGGTTTAGGC GCTAGCGGAG CCGTTTTTGG GGTCGTCGGC
GCGTACTATG TTTTTTTGAC CCGCAATGAG TGGTTACTTG GACCAGCGGG ACAAAGCGTC
ACATCTAGTA TTACACAAAC GATGCTGTTT AATATTTTCC TGGGTGCATT GAATCCAGTT
ATTGATAATT GGGCTCATCT GGGCGGCGCT CTTGGTGGTG CGGCAATGGG CTACTACTTT
GGACCGCGAC TTTACCTAGT AGAACTTCCA GAAGGTGGAC GTATAGTGAT GGATCGCCCA
ATCGCTCGCC TTCCTAGAAA CATAGAATCG ATTCCCGGGA ATCTGGCGGG GCAAATCAAA
CGAATAACAC GACGGATGCA GGTTGAAAGA TACAAGACAG AGATGCCGAC AAGGCCTTGG
CAACAACGAC AACAACACAT GCGACAAACG GCACCAAACC GTTCAATCAA ACCTGGTCCA
GTGGATTAAG CACAGAATGT AGTCGCATCG TTTACGTGCT
 
Protein sequence
MCFRSAVLFL LALLSSCDAQ DPLHLSARPW RKYPTSPDRS HFSRSRPIPH AYGDLSAHFL 
DNEEIKTDRR ARRWQVNLKT RNIGRLSWSS RIIWTNIATF AAQAWKPSFT QWGIKVSEKI
LRGEELYRLI TPVFLHGGFG HIFTNMISLS RVGPDVERLF GSGRFLTTYM VSGMTGNLLS
AYMSPNPGLG ASGAVFGVVG AYYVFLTRNE WLLGPAGQSV TSSITQTMLF NIFLGALNPV
IDNWAHLGGA LGGAAMGYYF GPRLYLVELP EGGRIVMDRP IARLPRNIES IPGNLAGQIK
RITRRMQVER YKTEMPTRPW QQRQQHMRQT APNRSIKPGP VD