Gene PHATRDRAFT_50498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50498 
Symbol 
ID7199283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp223964 
End bp226154 
Gene Length2191 bp 
Protein Length682 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185402 
Protein GI219130500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAATCTTT TTCCTCGGAA ACAAGTCTAG TCTTTTTGTC GTGTCACGTG CTTGCTGTTG 
TCATGCCGCA GTTCCTTGCG GTTGTGAGTG GATTCTTACT AGGCTCTGCT TTAACGTACG
CATGGTTGAG CCGACAGGAT CGTTCCTCAT CGAAGACACT ACCAGAGGAT GAGCCATCGT
CCACGTCAAG GTCAATCGCG ACACCATCGC GTCAAGCTGG GCCGACGGCG CATCTCCGCA
CGGACGGCCT CTTGACAGAT TTGGTACGCG AATTGTGGAG TTACATTAAC GTGGCAGGTT
GCGACACTAT CCGTTCCACA GTGGAACCCA TGTTCGTTAC GTTGCCGGGT CCGTTGAAGA
CTCTGCGTTT CACCAAAATC GATCTGGGAT CGGTACCGAT TCGGATGGAC AATTTGGTCG
TTCACGAAGT CCACAACGAT TCCGTCACTG TGGCAATGGA CGTGGCCTGG GATGGGAATT
GTGATATGCA GCTCAAGGCG GATTACATTG GTTCATTTGG TGTTAAGGCA ATCAAGCTCA
AAGGACGCCT GTCGTTACTT CTCAAGCCTT GTGTAAACGC TTTGCCACCG TTCTCAGCCA
TTCAGTACGC CTTCGTTACA CCACCACAAG TTGAGATTGA CTTTACCGGT CTCGCGCAAG
TCGCGGATTT TGCGGTCCTC GACAAGAGAA TTCGAGCCAT TATTCAAGAT TCGTTCGCCT
GTGTCACATT ACCGTCCCGC ATGATGTACA AAACGGATCC GGCCTGTGAT TACTTACGTA
CGTACCAACC TCCCTTGGGG GTGGCGCGAA TTACCGTCGT CCGCGGTCGC GGCTTTCATG
TCGAAAAGAG ATCCTTGCGA GCGCACGACG TGCCCGACGT CTTTTGCCAG GTTTCAATCA
ACGCCTCCCA ACCTTTCACA ACCCGTACCG TCAAGGATAG CCTGGAGCCG GTATGGGAGG
AGAGCTGTGA TTTCATCGTC ATGGATTTGG ATCAGCATGT GATACTGAAC GCCTGGGATG
AAGACAACGG GGCACTCGAC GCCAATGACG ACCTCGGAAC AGCGAAAGTG TCCGTTGGGG
ACCTTTTACT GGCCGGCAAG ACGAAGGAGG TGGAACTCTT GGAGGGTAAT CTAAAGCGTA
CGGGCGCTTT TGTCACACTG CACTGCGAAC TATTACCGTG GACATCAGGG GAGTCGTTTG
AGGGCCTCCC TAAGGCCCCG ACCGAATCGA CAAACACTGC CAATTCGTTG GCAGGCTTGA
TGACCGTAGT CGTGGTCAAG GCGAGCAATT TACCATTGGC GAAAAAAGAA GATGCGGCAT
CGTTCGTCAA GGTCAAGTAT GGGGCAGCCT TTGAGTTTTT GACCAGCGTG GTTTTGCCTT
GTCCAGGGCT GGACGCATTG AATCCAGTTT TTGACGAGGT CACCTTCATT CCGCTCCCCC
AGGCTTTAGT GGATGATAAG AACGACGTAG TTCTTGAACT CTGGAATGGG CAAGCTATCC
TGGGCAGTGT CAATATCACG CATCAGTCTT TATTGAACAT GGATGACCAT GTACGGACAG
AAAACCTTCT CGTCGGTGAC AAGGGTGCAA AACTTTTCTT TCGGGTCTCC TTACAGGGAG
TTGCGGAAGC TCCCATTGGA ATACCGACGG TGTCTGCTAT TGAACCGATG GGAAAAGACG
ATTCTGAGGA CCCCGGCACG GCAGCTGCAC TTGTTGGCGG TACTTTGGGC AAAGTAAGTG
TGACGGCAGT CCGAGGATGG GGCTTTGTGG TGGAAAAGCG TCGGTTTAAA AAGAACGACG
TTCCCGACGT ATACTGCAAT ATACAGTTTG GCTCGAGTCC TACAGTGTGG CGGACGAAAA
CGGTGCGCAA TTCTACGACA CCCACTTGGA ACGAATCCTC CACGTATCCG CTGTCCGATC
ACAGCCAAAT TTTGCATTTG AACGTGTTCG ACGAAGACGG CGGTGCCCGG GATACGGACG
ATGCGTTGGG ATCGGCCCGG GTAGCTGTCG GTAAGATTTT ACTGGCGGGC GGTAGTATGG
ATGTGGAGCT GCTCCGTACA GGCAAGCCTA CCGGATGTTA CATCCAAGTC CGTTGTGCTC
TATTGGACTG AAAACAACAT TCACCATCCA TAAGTGAACA GTAGTGACTA CATCACGTAT
GCTATCAAGT ATCTTATATT TTGCATGCTT T
 
Protein sequence
MPQFLAVVSG FLLGSALTYA WLSRQDRSSS KTLPEDEPSS TSRSIATPSR QAGPTAHLRT 
DGLLTDLVRE LWSYINVAGC DTIRSTVEPM FVTLPGPLKT LRFTKIDLGS VPIRMDNLVV
HEVHNDSVTV AMDVAWDGNC DMQLKADYIG SFGVKAIKLK GRLSLLLKPC VNALPPFSAI
QYAFVTPPQV EIDFTGLAQV ADFAVLDKRI RAIIQDSFAC VTLPSRMMYK TDPACDYLRT
YQPPLGVARI TVVRGRGFHV EKRSLRAHDV PDVFCQVSIN ASQPFTTRTV KDSLEPVWEE
SCDFIVMDLD QHVILNAWDE DNGALDANDD LGTAKVSVGD LLLAGKTKEV ELLEGNLKRT
GAFVTLHCEL LPWTSGESFE GLPKAPTEST NTANSLAGLM TVVVVKASNL PLAKKEDAAS
FVKVKYGAAF EFLTSVVLPC PGLDALNPVF DEVTFIPLPQ ALVDDKNDVV LELWNGQAIL
GSVNITHQSL LNMDDHVRTE NLLVGDKGAK LFFRVSLQGV AEAPIGIPTV SAIEPMGKDD
SEDPGTAAAL VGGTLGKVSV TAVRGWGFVV EKRRFKKNDV PDVYCNIQFG SSPTVWRTKT
VRNSTTPTWN ESSTYPLSDH SQILHLNVFD EDGGARDTDD ALGSARVAVG KILLAGGSMD
VELLRTGKPT GCYIQVRCAL LD