Gene PHATRDRAFT_25000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_25000 
Symbol 
ID7196911 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2175413 
End bp2176563 
Gene Length1151 bp 
Protein Length221 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176923 
Protein GI219110343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.19447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTCATTTT GACGAAATAG GGGCGAAGTT ACGTGCTCAA ATCAACTTTG CAAATGAGCG 
ATCTTACAGT TGCTGTCTCT GAGCCTCCTG CACCGCAACA CATGGAACCG ACGGATGCCA
ACAAAAAGCA AAAGATGTCC TATCAGATGG CAACAGACAG CGATTGGCCC GATGCATGGC
ATATGCCGGA TGCTGTCGAA GACCAAACCA AGCCCAACCG ATTGGAGCCC AATGTCCCGG
CAACAGTGGA CGAGCTGAAA AAGATAGGTC TCTGCTATTG GAAAATGGAC GCTGAGTCGT
ACGAGTATCC CGTCAAGGCT GTACCATGGG TAAGATGCAC CTTTGATAGA AATATTCTGT
GGATGGACGG CTGCTGAAAT TTCAATCAAT TTTGTCGACA TGAGACTGAC GCGATGTCCG
TTCATTTCTA AAAAAAAACA GAATCCGGAA AACGCCACTG ATCCCAAGCT AAAGGCTCTC
CGCGACGATC GTGGATATAG GTGAGATAGA GACTGTACGT TAGAAGAAAT TCCATCGACT
ATACACTAAA GTTTTGGTTG CGTTTCCCAA AGTTATGCGG ATATCATTAC GATCCATCCC
GATTACCTAC CGGTATGTTT CATGAGTCAA ACTGGGTCTG CGCACGGAAT CTCAGGATGG
ATGAGGTGAG ACTAACTCGT ATTTTATGAA TATAGGACTT TGAGAAGAAA ATTGCAAGCT
TCTTTGAAGA GCACATTCAT GATGCCGAGG AAATCCGATA CATTCTTGGT GGATCAGGTT
TTTTTGACGT CCGCAACTGT AAGTACCATG TCATCGACTG TCCAATACTA CGGCTCAAAT
GATCTCATTT TGTTCTTCTA CTCCTGCGAA TAGTGGAAGA CAAATGGATT CGGATTCATG
TCAAAAAGGG TGATCTCATG ACACTCCCAG AGGGCATTTA TCATCGTGAG TAAAAGGTCC
TAAACTTTCT GGTTGCTCGA CACCATTTTT GCATTCAACA ACACAGGGTG ACTCATTCCT
CTCGTGCGTT TTGAACAGGG TTCACATGCG ACGAAGAACG CATAATTCAT GCTATGCGTC
TTTTCATCGG AGAGCCCGTG TGGACACCAT TCAACCGTCC TCAGGAAGAC CATCCATCCC
GAAAAAAGTA C
 
Protein sequence
MSDLTVAVSE PPAPQHMEPT DANKKQKMSY QMATDSDWPD AWHMPDAVED QTKPNRLEPN 
VPATVDELKK IGLCYWKMDA ESYEYPVKAV PWNPENATDP KLKALRDDRG YSYADIITIH
PDYLPDFEKK IASFFEEHIH DAEEIRYILG GSGFFDVRNL EDKWIRIHVK KGDLMTLPEG
IYHRFTCDEE RIIHAMRLFI GEPVWTPFNR PQEDHPSRKK Y