Gene PHATRDRAFT_42647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42647 
Symbol 
ID7196000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp693275 
End bp694479 
Gene Length1205 bp 
Protein Length389 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176631 
Protein GI219109755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAAAAGTCG AAGCTCATCC CGAAAGGAAA CGAAGATGGT AGCCAAACAC AAGAGGATCA 
TACAAAGTCC GACGCATCAC GTTAATGGTG CAGAAATAGT CCTACCGCAA CATGGATATG
CTCCGGGAAA GTCTCCGCTG AATATGAAAT CCGACAGCAG TGGAAGCAGC GAAGACACGG
TTTCGACTAG GGAGACCTTT TGGATAGGGG CACTCCATGA CAGCGAGGAA ATGCAACTAC
CAACCTGGTC GGGGAATCCA CAAAGTCCTT TAGATGGAGG GCACTCATCA AGGAATACAC
TATTATTCAC AGCTCACAAG CGAAATTTCT CGGCGCCTTT GTTCCTTATT GCCTTCGTTC
TCGTGGGATT GGCTGCGATG GTTACTTCAA GAATTACAGT GAACGATGCT TCCGAGCAAG
TATCACTATT GACCACCAAT AGAGCGAAAA TGAACTTGCA ACTTCAAAAA TCACAAAAGG
ACATGCTCAG TCTGAAACGT AAAATCTCGG CAATGGATGC CATGATTCAA CAGCAGCAGG
GCATGGACAC TAACGCTTCC AGTTCAGGCG CTATTCAACA GCGTGCCTTA GAAGAAGTGA
ACAGTCTGCA AGAAAGCCTA ACGTTTTTAG GGAAACATTC TGAGGCATTA AAAAAACAGG
TGCAATCCAT GAGCCTTAAA TCCCTCGAAG ATTCATATGG ATCTTTGATA CAGCGTGTCG
AAGTTGAACT TCAATTTCCT GATCACAAGG TGGGGCCCCA CAAATTCGTC ATCGAACTCG
CACCTATAGA GGTTATGCCG CATTCTGTCG ACGTTTTTCT CCGAATGGTT TCGACTCACT
TACTTGATGG ATGCTCCTTT ATCCTAAACG CTTTGCATGT GGTAAAGGCC GCCCCGCTTC
CATATGACGG CAGTTCCGCT GCCGACAAGG CGAAGGCATT TACCGAACAC GGCTTGGAGA
GCGTAGCTTT CCGTGAATAC AACGCAGACT ACCCGCATAA ACAGTATACG GTGGGTTTTG
CCGCAGACGG CAGTCCGAGT TTTTACATCA ATACAGAAGA CAACAGTGAA ATTCACATCG
GAGATCCATG CTTCGGCAGG ATAGTTGAGG GTTTCGACAC TATCCGCAGA TTGGAAGCGA
GTCCTACCCG TAACGGTATC TGGTTTGAGA AAAGGATAGG CATCAAACGA GCTCGAATCT
TATAG
 
Protein sequence
MVAKHKRIIQ SPTHHVNGAE IVLPQHGYAP GKSPLNMKSD SSGSSEDTVS TRETFWIGAL 
HDSEEMQLPT WSGNPQSPLD GGHSSRNTLL FTAHKRNFSA PLFLIAFVLV GLAAMVTSRI
TVNDASEQVS LLTTNRAKMN LQLQKSQKDM LSLKRKISAM DAMIQQQQGM DTNASSSGAI
QQRALEEVNS LQESLTFLGK HSEALKKQVQ SMSLKSLEDS YGSLIQRVEV ELQFPDHKVG
PHKFVIELAP IEVMPHSVDV FLRMVSTHLL DGCSFILNAL HVVKAAPLPY DGSSAADKAK
AFTEHGLESV AFREYNADYP HKQYTVGFAA DGSPSFYINT EDNSEIHIGD PCFGRIVEGF
DTIRRLEASP TRNGIWFEKR IGIKRARIL