Gene PHATRDRAFT_38559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38559 
Symbol 
ID7203306 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp395130 
End bp396356 
Gene Length1227 bp 
Protein Length386 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182527 
Protein GI219124473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAA AGGTGGGTAT CTTGGGTCTT CCGAATGTTG GAAAGTCAAC CCTCTTTAAT 
GCACTTGTGC AGAAATCCAT TGCGCACGCC GCTAATTTGT AAGTCGTGGT ATGCATTGGA
TTGAGTTGCG TTCGAGGACA GCGTCTCACT GCGGTTACCA ACAGCCCATT TTGTACCATC
GATCCCAACG TGGCACCAAT CCCTGTCCCC GACCCATACT TGGAACGTCT CGGTCGTGTT
GCGCAAAGCA AGGCAGTTAA ACCAGCAACG ATGGAATGGG TCGATGTGGC TGGTCTCGCC
AAGGGTGCCC ATCGTGGAGA AGGTCTTGGC AACCGCTTTC TAGCATCATT GCGGGAATGT
GATGCGATTT GTCATATAAT TCGTGCGTTT GAAGACCCAA ATGTTGTGCA TGTGGACGGA
CAGGTCGACC CAGTTTCCGA TGCCAACGTG ATTGGACTGG AACTGATCCT GGCGGACCTG
GCTCACGTCG AGCGTCGTCT AGAGAAAACT ACGTGCAGTG GTGTCGAGCG AGCGACGCTA
GAGTCCATTG CCGAATCTCT CGAAAAGGGT ATTCCGGCAC GGGATCTACA ATTGTCGAAA
GAAGACTTGC TTTCCATCAA ATCTATGGGA TTGCTGACGC TCAAACCATT CTTATATGTG
TTCAATGTTG ACGAGGTTGA CTTTTGCTAC CGAGAACAAG CTCTCGAGAC GGCCAGAGAA
AGACTGCAGT TAATTCCGTA TTGTGATCTT GAAACGAAGG ATAACTTTAC GATCGTGAGC
GCAAAGATGG AAGCGAATTT AGGGGAAAAA CCCAGAGAGA CCCAATTGAG TTACCTCCAG
GATATGGGAA TGGAGTTTGA GAAGGATGAC CAGCTTGAAG GCATACGGAG TTACAATGTA
ATGCCCACCA TGGTTCAAAG ATTGCTAAAC CTGGGTTTAG TGTACACGGG ACCCGGCGTT
GCGTCCAGTA GGTCGCAAAC CACCAAAGCG TACATGATCG ACAGCGGCGG TGGTGCCAGT
ACCGGACGAG CAACCACCGC CCATGACTTT TCCGGTCGGT TGCATGGCGA CATTCGCAAG
GGATTCACCC GAGCCGAAAT CACCAAGGCG GAAGCGCTGT TAAAATACGA CTCCTACGTA
GCCGCCAAAG ATGCTGGCAT TGTCCGTACC GAAGGCCGTG ACTACATCCT CCAACCGGAC
GAAGTCGTCT ATATCAAATG GAAATAG
 
Protein sequence
MKIKVGILGL PNVGKSTLFN ALVQKSIAHA ANFPFCTIDP NVAPIPVPDP YLERLGRVAQ 
SKAVKPATME WVDVAGLAKG AHRGEGLGNR FLASLRECDA ICHIIRAFED PNVVHVDGQV
DPVSDANVIG LELILADLAH VERRLEKTTC SGVERATLES IAESLEKGIP ARDLQLSKED
LLSIKSMGLL TLKPFLYVFN VDEVDFCYRE QALETARERL QLIPYCDLET KDNFTIVSAK
MEANLGEKPR ETQLSYLQDM GMEFEKDDQL EGIRSYNVMP TMVQRLLNLG LVYTGPGVAS
SRSQTTKAYM IDSGGGASTG RATTAHDFSG RLHGDIRKGF TRAEITKAEA LLKYDSYVAA
KDAGIVRTEG RDYILQPDEV VYIKWK