Gene PHATRDRAFT_28684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_28684 
Symbol 
ID7202525 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp455523 
End bp457613 
Gene Length2091 bp 
Protein Length474 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181559 
Protein GI219122453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGATCTAC CAACAATGGA CGAAAATCGC AAGCGCAAGG CTTCGGAAGA TATTGAGGAA 
GGGATTGTTT GTCCATATTT GGACACGATT CAGCGATCGT TGTTAGATTT CGATTTCGAG
CCGGCATGCA GTATTTCAAT GCAAACAGGC CCACACATTT ATGGTTGTCT CGTTTGCGGC
AAGTACTTTC GCGGACGTGG AATCCAGACA CCAGCTTACA CACATTCAGT AGAAGAATCG
CACTTTGTGT TTGTGCATTT GACTAACGGG ACATTTCACT GTCTGCCGGA TGATTACGAA
ATTAAAGATA CGTCACTAGT CGATATTCAA GATGCATTGC ATCCGAAGTT TTCCCCGAGC
GAAATTCGAA CAATCGACAC CCACACAGAG CTCAGTCGGG ACCTCTTTGG ACGACGGTAC
CTGCCAGGCT TTGTTGGCTT GAATAACCTT CACAAAACTG ACTGCGTGAA TGCCGCCGTA
CAGGCACTGG CACATGTCCA GCCATTGCGT GATTTCTTTC TATCGAAAAG CCATAACGAG
TCACTCCTGT CGTCGAAAAA ATCCCAAGCG TCGAACCGAC TTGCTCATCA CGTGGCACAA
TGTTTCGGAG AGCTTGTGCG TAAAATTTGG AGTTCTAAGC GTTTTAAATC GACGGTCGAC
CCCCACATGC TGATCCAAGC AATTGCCACT GCCTCGAAAA AACGCTTCAA AGTCGGTGTA
CAGGCCGAAG CGGGGGAACT TGTGGCGTGG TTGCTGCATC GGTTGCATGT CGGGACAGGT
GGAGGTCGTA AGGCTGGTAG TAGTATTGTG CACAAAACAT TTCAAGGGAA AGTACGAGTC
ACGACAAGAG AAGCAAAGCG GAAAAGGTTG GAAGCGAAAG CTGAAGAAGA CGACCGATGG
GGAAGCGAGG ATGAAGGCGC GACTGAGCAG GAAGGTCTCA AAATGAATGA TCAAGAAGTG
TTAGTAGAAA TTGAAGAAAC CGCCACCGAT ACACACTTTC TACAGCTCAC TTTAGACATA
CCGGAAAAGC CACTATTTCG CGACGAAGAC GGTGGTTTGG TCATTCCACA AGAACCGCTG
GTGTCTGTTC TGAAAAAATT TGATGGTGTT ACTTTTTCAG ATGCCCTCAA CCGCAGCGGC
GTGGCCCAAC GGAAGCGCTA CCAACTCCTA AAACTACCGG ACTACTTAAT CTTACACTTG
GCTCGCTTCA AAGACAATCG GTATACAAAA GAAAAAAACC CTTCAATTGT CATGTTTCCG
GTAAAGAACC TTGATCTTGG CGAGTACGTG CACAAGGAAA AACAAAGTCT ACCAACTGAG
GAGCAAATTC GAGGAATGAC TGTACGTCAG GTGTGCGTTT TCGTATGATT TCGCTGCTAT
TTTTCAAATC GTTGTCTGAA TAGTGTTTTC GTGATGTCAA CAGGTAAAGG AGCTAATGGC
ACTGCTTGCG AAACACGACC GCACCGCTTT AGGAGTATCT ATGCTAGAGA AGAAGGAGCT
CGTTGACGCA ACCGTGGATT TTTTTTTGAA GAGTTTGCCC GACTTGCTCT CTGAGAAGTA
CGATTTGGTT GCGAATATAA CGCACGAGAG TCCTGCTGAC GTTGGTCGCG AAGGTCAACA
CGACCCATTG CAGGACGGCC ACTACAAGTG CCATGTGCAG CATCAAGCCA CGCGACAGTG
GTATGAAATT CAAGACTTGC ACGTTCAAGA GATTATGCCG CAGCAAATTG GACTTTCCGA
ATGCTATCTT CTGATCTTTC GAAAGTCAGG ATTGTAAGTA AATACTAGAT TTCAAAACTA
CCAACCTTCA ATGAACTTTA GAGACGTAGT CTTTGTCGGG TCCTGAAAGA GCCGGTGGAA
CAGCCAAGAA CAGTCCATAA ATATCCATGT TCTTACGAAC AAATGCTGGA TCCAACTCCT
GCGCTCGACA CCATTTCAAT GCGGCTTGTT CCATACGCTC TCGTCTCTGC AATGCCGTGT
CCGATTGAGA AGGCAAGTCG CTGTCTTCGT TTTCGTCTTC TTCGTCCGTC GACCGTCCCC
ACGGGAGTGA CGACGGAAAG AATGCACGAT CTCGTTGCGT ATTGGGATCC T
 
Protein sequence
MDENRKRKAS EDIEEGIVCP YLDTIQRSLL DFDFEPACSI SMQTGPHIYG CLVCGKYFRG 
RGIQTPAYTH SVEESHFVFV HLTNGTFHCL PDDYEIKDTS LVDIQDALHP KFSPSEIRTI
DTHTELSRDL FGRRYLPGFV GLNNLHKTDC VNAAVQALAH VQPLRDFFLS KSHNESLLSS
KKSQASNRLA HHVAQCFGEL VRKIWSSKRF KSTVDPHMLI QAIATASKKR FKVGVQAEAG
ELVAWLLHRL HVGTGGGRKA GSSIVHKTFQ GKEGLKMNDQ EVLVEIEETA TDTHFLQLTL
DIPEKPLFRD EDGGLVIPQE PLVSVLKKFD GVTFSDALNR SGVAQRKRYQ LLKLPDYLIL
HLARFKDNRY TKEKNPSIVM FPVKNLDLGE YVHKEKQSLP TEEQIRGMTV RQKYDLVANI
THESPADDGH YKCHVQHQAT RQWYEIQDLH VQEIMPQQIG LSECYLLIFR KSGL