Gene PHATRDRAFT_40919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40919 
Symbol 
ID7198748 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp268093 
End bp269441 
Gene Length1349 bp 
Protein Length422 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184849 
Protein GI219129340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGTC GCGAGCCCAA CAACGATCCG GACAACCGTC TTGGAGAAAA ACCGATGGAA 
TCCCTCGAAA TTGAAGACAT ACATCAGTCG AGTCGAAGGG CCAACGCCTA CGACAGTGAT
TCGTATCATC ACGATGCTTA CACAGATTCC GAAGAAGAAT CTCCCATCGC GCCCTACGAG
TACCGTCATC GGAAAAGCGA ACATAACGAT ACGACTTTAA TGTTGGATCC CGACGATGCT
CGTCCGCTGG TGGGAAGCAG CATGATTGGA ATCAACAATA CGCGATTGAC AGCCTTGCAG
ACCGTGGTCA AAAGTCGTGC CTGCTGGCCC GTGTACATTG GGATAGCCCT CGTGGTCTCA
CTCTTGATTG CGGCCATTGC TTACGTACCC CCGGCTCGAC GCAAAGTCAC GCGACCCGAT
TTTATTTGTC CCACCGAACC GGCGTCGTCA CTGTTTTGGA CACACTTCAC GCACAAGGTA
CAAACGTGGG TGGGGCCAGA ACGGTGTCGG ACTGGACGAA ATAATGAGGT GTGCTCGTGC
GAGGACCCTA CCCAGCCGTC AATTCCGCAA AGTCCTGATT CTTGGCGGGA GGGATGGCAG
AGAGCAACGT TACGGAATGC GGCTCTCATT CAAGATCGGA AACACCTCGC GCTGGACGTT
GTCTTGCTGG GAGATTCCAT TACTGAACAC TGGTTGGGCA CAGGTTTCGC CGAACCAAAC
AACGACTATC AAGCAAATGT GCCGGTCTAT CAGTCACTCT TTTCCAAAGA ACACGGCGCC
GTGATTGAAG GCCTGGCTTT GGGTATTATT GGAGATCGTT GTCCAAACTT GTTGGCACGG
CTGCAGAACA ACGAAACGGT ACAGGGCTTG TCCGTCAAAG TTTTGTGGGT TTTGGTAAGT
TGTGTCGGGT TCAACGGTAT TTTTTTCGTA GCGACACAAT GCGCTGACGC GCTGAAATTC
CTATATTGTA ACAGATCGGG ACCAACGACT ACGCGAGCAG CTTTTGTCGT GTGGATTGTA
TCGTGGCGGG CAACTTGGCC ATTGTCCGAG AGTTGCGACT CCAAAAGCCC GAGGCCACCA
TTGTAATCAA CGGCCTACTG CCTCGCAGTA AATCGCGTAC CGACGTGGCT TTTGCCGATG
ACTTTGCTGA GATTAATCGG CGACTCTCAT GCATTGCCGA TACCTTGGAC GATGTTGTCT
TTTTCGACGC TGCGTATCTA TTCTTGACCG AAGATGGTGG CCTAAATCGA ACCATGCTGC
CGGACGGTTT GCATCCTGGT GAAGTAGGAT CACGTGTATG GGGGCAAGCA ATTGTGGATC
GAGTTTTGAA GATTGATGGT GGACTGTAG
 
Protein sequence
MRSREPNNDP DNRLGEKPME SLEIEDIHQS SRRANAYDSD SYHHDAYTDS EEESPIAPYE 
YRHRKSEHND TTLMLDPDDA RPLVGSSMIG INNTRLTALQ TVVKSRACWP VYIGIALVVS
LLIAAIAYVP PARRKVTRPD FICPTEPASS LFWTHFTHKV QTWVGPERCR TGRNNEVCSC
EDPTQPSIPQ SPDSWREGWQ RATLRNAALI QDRKHLALDV VLLGDSITEH WLGTGFAEPN
NDYQANVPVY QSLFSKEHGA VIEGLALGII GDRCPNLLAR LQNNETVQGL SVKVLWVLIG
TNDYASSFCR VDCIVAGNLA IVRELRLQKP EATIVINGLL PRSKSRTDVA FADDFAEINR
RLSCIADTLD DVVFFDAAYL FLTEDGGLNR TMLPDGLHPG EVGSRVWGQA IVDRVLKIDG
GL