Gene PHATRDRAFT_50067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50067 
Symbol 
ID7198754 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp292775 
End bp294253 
Gene Length1479 bp 
Protein Length409 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184853 
Protein GI219129349 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0352666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTACAGGTCT GTTGGGGGAA GCCGATGAAC AACAACCGCA TGCATAACAC ATACGGGATA 
CGCTACGACT CCAGAGCATG ATTGTACACA CGTTGGTCAC CCGTGGATCA CTAGTAACGA
AAAAAAAACG CTATAGTGTC GTACTTGCCT TGTTGTATAC AAGTTGAACC AATTGTAGTT
GAGTTAGCCT TTGTTGGATT GACTGTGTGA TACGTGGAGG AGTATGAAGG ACCGCGACCG
ATACGACGAC GGCGAAGCCC TCAGCATCAA GAAGGAGGAG GGGTACACGG CTCTGTACGC
TGTCCTGAAT GTGTCACCGG ACGCTTCCCG AGCCGATATA CAAAAGGCCT TTAAGCGACT
GAGTCGAGTC TTTCATCCGG ACAAGCGGGT GCGTCTCGGG GTGTCGACCC AAAACAATGG
TACCAACAGT ATGGCGGAAG AAGCCTTCCA GACGATTCGT CAAGCGCACG ATGTTCTTTC
GGATCCGGTT TTGCGCTTGA CTTACGATTA CGCCGGCATG TTGGCGGTGG AACTCCTGCT
GCGCTCGCAT TTGGCACGGG GCGATCGTAA CGAACCAAGG GGCAGCCACG ACGAGTCGTC
GCGATCGACC ACGACCAACA AAAATGAGGA GGATTCCGAC GCGGAAGATC CCTGGGACCA
GGACGACGAC GACGACGACG ATGACGAGGG TAACTCTTTG GACTTGTACG TTCAAGTACG
AGACGCCCCA TCCTACCAGT ATGCTACTCA AATACTAGAC GATGCCTTGT ATCGAGTGCA
ATCGCACCAA GCATCCTCAC GCACACACTC GCTTAACGGT TCTCTCGCAT TCCCTCACGT
GCTCGGTGGT GGCGGTACAC AGGACGGCTT TTGGGAGCAA GATCGCGGTA GTTTGCAATG
GCAGACCAAA CGACAAGTAT CGGCGCAATG GACGGCAACG CTCGGGGCCG GTTCGGAAGT
ATCCCGGACG GCGCAAACGG AAATGTCCAC GCAACTTTCG CTCGCCTATA CCCGACCCGG
TTACGGACCG GTGGGATCCG TGGATGTCAT ATCGTCCTCC CGAATGCCCG CAGCACCGGT
CGTCAAAATT CAAAGTGGAC GTACACTTGC CAACCAGACC AACGTGTTGT TCAGTTTAGC
CGGATCCGTG GACAATCCGG AGACCTGGAC GTATTCATTT ATGTCCAGTC GGAATATTTT
GTGGAATTCG CCTAGTAGTG AAAGACGCAA GCAAAGTCAT TCGGACCCGT CGTCACCTAA
AACGATTCAC GCCTCGTGGC AGTTGGGAAT TTCGTTGCTG GGAAAATTGC AGTATTTTCG
GGTGGAATTG CGTCAACCTA CGTTGCCGCA TAAATGGAGT GCCCGCATCG GCCTCGATGC
GCTCGCTGGT ACCTACGAAA CCGCCACGTA CACTGTATCG TACGCGCGTC ACTGGATGTG
GACGCGTTGG AAGGCCCTTT GGCACCACAA GTGGGCTGA
 
Protein sequence
MKDRDRYDDG EALSIKKEEG YTALYAVLNV SPDASRADIQ KAFKRLSRVF HPDKRVRLGV 
STQNNGTNSM AEEAFQTIRQ AHDVLSDPVL RLTYDYAGML AVELLLRSHL ARGDRNEPRG
SHDESSRSTT TNKNEEDSDA EDPWDQDDDD DDDDEGNSLD LYVQVRDAPS YQYATQILDD
ALYRVQSHQA SSRTHSLNGS LAFPHVLGGG GTQDGFWEQD RGSLQWQTKR QVSAQWTATL
GAGSEVSRTA QTEMSTQLSL AYTRPGYGPV GSVDVISSSR MPAAPVVKIQ SGRTLANQTN
VLFSLAGSVD NPETWTYSFM SSRNILWNSP SSERRKQSHS DPSSPKTIHA SWQLGISLLG
KLQYFRVELR QPTLPHKWSA RIGLDALAGT YETATYTVSY ARPLAPQVG