Gene PHATRDRAFT_35589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35589 
Symbol 
ID7200926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp297420 
End bp298727 
Gene Length1308 bp 
Protein Length435 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180211 
Protein GI219118889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.114952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGT TGTGGTCTGG TTCACTCGCT AGTTTGCTCA TGGCGTCCAT TGTCAGTGCT 
CTATTCAACC GGAGTAGCAG TACCTGGCGA GCCGTTACGG AAATGGGATC CACACTCTCC
AATCTCGGCA CATTTTCGTC ATCTTCCATC CCCGGGCCAA GAAAGCGAAC GATCAGGATT
CTCCCCAATA CCGTGCAACA AGTTCATGTA GTAATATTGG TACACGGTTG GATGGGAAAC
CCATCGGAAC TTGCATATCT TCAATCGACA ATGGAGCGAC AAGCGTCCAC AATAGAAGCA
GACGACCCAG CCATAATATT TTATGTACAC AGCGCCGAAG CGAACGATGG GCGAACAAGT
GACGGAATTG AAGCTGGTGG AAAACGACTG GCAGGCGAAG TGAATAAAAT ACTCTGCGAC
GCAATGGAGA GTGATGCATC ACGACGCGAC GTATCGCTCT CGTTTGTCGG GAACTCCTTG
GGTGGTTTAT ACGCACGTTA TGCGCTGAGC CAGATTGATG CTTTGCAACA GTGTAGCCTT
TCCAACGATA AAATCTCCCA AAAGAGTTCC AGAGTCATTC CCAGAGTCTT CTGCACCACA
GCCACACCGC ATTTGGGAGT CAGCAGATAC ACTTACCTGC CACTACCACG CGCGGCGGAG
TACATTGTAG CCAAAGTGCT GAAACCCACT GGTTTGGATC TTTTTCGCTA CACGGAAGTA
ATTCAAAATT TGGCGACCCA AAAAAAATTT CTGGATCCCC TCCGATCGTT TGCCAAACGA
ATCGCCTATG CCAATGCGTA TTCAACCGAC TTCCAAGTGC CTACTGCCAC AGCTGGATTT
TTGGCTGATA CTGACTCAAC TCATCGAAGG GTAGCTTTTC AAGAAAACTC CTCTTTCGTT
GAGTTGATCG TCGAGACGCC AAAGTATGTG GATGATAAAT TCGATAGCGG GGGTTCGGAT
GAGTCTCCGG CCACTTGCGA AGACCTCTCG CGTCGTTTGG ATGCTTTGGG CTGGACTAAA
GTATTTTGCG ATGTGAGAGG GAGTCTCCCA TCGGTGCCGT TGCCTTTTCA CACTAAAGAT
GCCTGGAGCT CGGATAGTGC GCACCGATCA AAAACGTATA CGTCCCGGGA ACTATTGGCA
TCTTTGGCGG GTCTGGATTG GGGACGATGG CATGCTCCTT TCGGTCACAC TGTGCTCGTA
GCGAATTCCA AGAACGACGT ATATTCCAAG TTGAATGCAG CGGGACAACC CATTATGGAT
CAGCTCGCAT CTGATTTAAT TCAAGATATC TTACGCGAAG AGTTATAG
 
Protein sequence
MNVLWSGSLA SLLMASIVSA LFNRSSSTWR AVTEMGSTLS NLGTFSSSSI PGPRKRTIRI 
LPNTVQQVHV VILVHGWMGN PSELAYLQST MERQASTIEA DDPAIIFYVH SAEANDGRTS
DGIEAGGKRL AGEVNKILCD AMESDASRRD VSLSFVGNSL GGLYARYALS QIDALQQCSL
SNDKISQKSS RVIPRVFCTT ATPHLGVSRY TYLPLPRAAE YIVAKVLKPT GLDLFRYTEV
IQNLATQKKF LDPLRSFAKR IAYANAYSTD FQVPTATAGF LADTDSTHRR VAFQENSSFV
ELIVETPKYV DDKFDSGGSD ESPATCEDLS RRLDALGWTK VFCDVRGSLP SVPLPFHTKD
AWSSDSAHRS KTYTSRELLA SLAGLDWGRW HAPFGHTVLV ANSKNDVYSK LNAAGQPIMD
QLASDLIQDI LREEL