Gene PHATR_10585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_10585 
Symbol 
ID7203933 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1338800 
End bp1340140 
Gene Length1341 bp 
Protein Length337 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186234 
Protein GI219113301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCGTCAGAC TGTTCAGTCG CCATTTGCTG GACAACTCGG AGCAATTTCC CGACGTAGCA 
GAATACTTGC TGCAAGCACG TAGTGATTCG GTGCACTCCT TTATTCTTGA TTCCGAAATA
GTTGGTGTTT ACATGGACGC AACGAAAGGA AAATTTCGAA TGCTTCCTTT TCAAGATCTG
TCTACTCGGC GAGGAAGTCA AAGCGTTGAA AGAGATAGAG TTCAAGTTCG CATCTACGCC
TTTGATTTGT TGTACCTCAA TGGAGAATCA CTTTTAAAGG TTCCTTTTTG GAAGCGAAGA
AAGCTTTTAC AAGAGGTATT CAAAGAGACA CTTGGATTCG CTCACGCGCA ATCGGTTGGA
CTTGCCACTT TCGATGACGA ACTCTTGAGA GCTACTCTGG AACAAGCGGT CACGGATGGT
GCTGAAGGCC TCATGATTAA ACTCACAGGT GAAGGCTATC ACGCACCGAA CGCTGACGTA
TCCTGTCGCA CCTTTGGCTA CGAATCGGGG ACACGCAGTC AGCTCTGGCT GAAGTTGAAG
AGAGACTATG TTGTGGGCTA TGCCGATACA ATCGATGTCG TACCGATAGG AGCTTGGTAC
GGGAACGGTC GGAAAGCACA GAAGGGATTC CTAAGTCCAA TACTCTTTGC CGTGTACGAT
GAAGACGAAG GAGCCTTCCG CTCCATCTCC CGGTGTATGA GCTTTACAGA CGCCATGTAT
CAAGGTAAGG ACTTGCTGCC AAGAAATGAG ATGCCTGAAC GTTTGTACTT GTCTACTCAT
TTGCTTTGAC CCAGTAGCCA TTAAAAATTT TTATTTCCAT GGTAAGCCAT ATCCAGAAAA
AGTAGGTTCT GGAGATCCAA AAGTTGATGC AGCGACTGTC GTGCAAGAAT CAAATGCAAG
CCACGTGGTT GCCTCTGACG ATGAATCCTT GGAGGACATG GAAGACCAGA CTGCTGAGGT
AGCTGATGGA CTCTTGGAAG GACGTGTCAA CTGCTACACC ACTACTCCAC CGTCAACTTA
TATCATTACA AATGAGACGC CTAGTATTTG GTTCAAACCG ATGGAAGTTT TTGAAGTTTC
TTTTGCGGAC CTTTCACTTA GTCAGGCCCA TACAGCAGGG GCAGGTCTTC TAGATGACCC
TCAAGGACGA GGGGTCGCAA TGGTACGTTC TGGATTGTTT TTGTTGCAAT TTGAAAGGAT
CGGAACAGCC TCATATATCA CATGTCCTTG CAGAGATTCC CTCGTTTTAA ACGCCGTCGG
CCAGATAAAT CAGTTGAACA AGCCACAACA ACTGTTCAAA TTGCACAGCT TTTTGGCCAG
CAGTCAAAAA TGAAGAGGTA G
 
Protein sequence
TVRLFSRHLL DNSEQFPDVA EYLLQARSDS VHSFILDSEI VGVYMDATKG KFRMLPFQDL 
STRRGSQSVE RDRVQVRIYA FDLLYLNGES LLKVPFWKRR KLLQEVFKET LGFAHAQSVG
LATFDDELLR ATLEQAVTDG AEGLMIKLTG EGYHAPNADV SCRTFGYESG TRSQLWLKLK
RDYVVGYADT IDVVPIGAWY GNGRKAQKGF LSPILFAVYD EDEGAFRSIS RCMSFTDAMY
QGRVNCYTTT PPSTYIITNE TPSIWFKPME VFEVSFADLS LSQAHTAGAG LLDDPQGRGV
AMRFPRFKRR RPDKSVEQAT TTVQIAQLFG QQSKMKR