Gene PHATRDRAFT_26290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26290 
Symbol 
ID7198120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1110483 
End bp1113752 
Gene Length3270 bp 
Protein Length989 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178642 
Protein GI219115693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.793152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCACGGAG ACCAAACCAC AGAAGTTCCT TCGCCCTCTT TACACCGAAC GCCACAACTG 
GCAGTGAGGA CAAAACGACC GTTTCATCCC TGACCGTGAC TCCTTTCTGC ACATTCTGCT
TACTCTCGCA CACACGAGTA GATAAGATAG CATAGCATGT CCTCTTTGGC TCTCGCACGT
CGACTTTCGT CGTCCCTTGG GCATGGTAAG AAGTCAATCC CTCACTCCGC CTTTGCGACG
CTGGCAAATC CTCTTGTTTC TTGGAACGGC GATGTGAGTA GGAATACTGA TGCTAGTCCG
AGCGTTAGTG CTACCTCTCG CAACCATGTC CATGCACTCT GTGCTTCCAG TTCCACCTCG
GCTACTGTAT TTGCGAAGAA TAGTACGTTC ACTGGTAGCA CCTGGTCTCC AGCCTCGGTC
CCTTTGATGC GCAGCTTTTC GACGGTGTCT TCGACGGACT TTCTCCAAGC CTACGAAGCG
CACGTGGCGG AGCGTCTCAA CGAAGCCGAC GGTCTCGGCA TTGCGCCAAA GCCTCTCAGT
GCCGAGCAAG TTTCGGATCT CATTGCGGTC CTAAAAACGG AAACTCCCAA CGCTGAAGCC
TTGCTCGACC TCCTCGTCAA TCGCGTTCCT CCCGGAGTCG ACGAAGCTGC CTACGTCAAA
GCTACCTGGC TCAGTGCGTT GGCCAAGCAA GAAGAAACTA ACCCCTACAT TACACGAGCC
CGCGCCGTCG AGCTCCTCGG GACTATGCAA GGAGGCTACA ATGTTGCTAC TATGGTGGAA
CTTTTGGAAG ACTCCGATAA CGAGATTGCC ATGATTGCCG CCGACAAGCT GTCGCATACG
CTTCTCGTCT TTGACGCATT TTACGATGTG CAGACCATGC ACGAAAAGGG CGTCGCCGCC
GCTACCAAGG TGATGGAGAG TTGGGCCAGT GCGGAATGGT TCACGAAGAA ACCCAAGGTC
CCCGAAGTTA TGACCGCCAC CGTCTTTAAG GTAACAGGAG AAACCAACAC GGATGATTTG
AGTCCGGCAC AGGACGCTTG GTCGCGTCCC GACATCCCTC TCCACGCCGT GGCCATGCTC
AAGAACGCTC GCGAAGGTAT CAACCCGGAT AAAGAAGGAG AAATCGGTCC GATTCAACAG
ATCCGCGCCT TACAAGAAAA AGGATTTCCT CTTGCATACG TCGGAGATGT AGTCGGCACC
GGATCGTCCC GCAAGTCCGC GACGAACTCC GTACTCTGGT ACATGGGCGA CGATATCCCC
TACGTGCCCA ACGTCAAATC TGGTGGACTT TGTTTGGGTA GCAAGATTGC GCCAATCTTT
TTCAATACCA TGGAAGATTC TGGAGCCCTG CCCATTGAAT TGGATGTGGG CGAAATGAAT
ATGGGCGACG TCATTGACGT CTACCCCTAC GAAGGTGTTG TCAAGAACCA CGACACGGGC
GAAGTCGTTA CGGAATTCAA ACTCAAGACT CCCGTCATCA TGGACGAAGT CCGAGCGGGA
GGACGAATTC CTTTGATTAT TGGTCGTGGC TTGACCGTCA AGGCCCGCGA AGCCTTGGGT
CGAGATGCCG CCGTAACGGA TGTCTTTCGT ATGCCGGCAC TTCCCGAAGG CAGCAAAAAG
CCGGAAGGCT TTACCTTGGC CCAAAAAATG GTCGCCAAGG CCTGTGGACT TCCCGATGGT
GAAGGTGTCG TGCCTAATCA GTACTGTGAA CCCCGCATGA CCACGGTGGG TTCGCAGGAT
ACAACTGGTC CAATGACTCG CGACGAATTA CGCTCCTTGG CTTGTTTGGG ATTCTCCTCT
GATTTAGTCA TGCAAAGTTT CTGCCACACC GCGGCCTACC CCAAACCGGT CGATGTCGTG
ACTCACCACA CGTTGCCTGA TTTTATCCGC ACTCGCGGTG GTGTTTCCCT CCGACCAGGT
GACGGTATCA TCCATTCCTG GTTGAATCGT ATGTTGCTGC CCGATACGGT AGGAACCGGG
GGAGATTCGC ATACTCGTTT CCCCATTGGT ATTTCCTTTC CTGCTGGCAG TGGCCTGGTG
GCTTTTGGTG CGGCGACGGG AATCATGCCA CTCGACATGC CGGAGTCCGT CTTGGTACGA
TTTTCTGGAA CTGTGCAGCC CGGTATTACT CTCCGCGATC TGGTGCAGGC GATTCCGTAT
ACGGCAATCC AGATGGGACT GTTGACGGTC GAAAAAAAAG GCAAAAAGAA CATCTTTTCC
GGACGAATTT TGGAAATCGA AGGTCTTCCT CAGCTCAAGT GCGAACAAGC CTTCGAATTG
TCCGACGCAT CTGCCGAACG ATCCGCCGCA GGTTGTACCA TCAAGCTCGA CAAAGAACCC
ATCATCGAGT ATCTTAACTC CAATGTTGTA ATGCTCAAGT GGATGATTGC GGAAGGATAC
GGAGATCCTC GCACGTTGGA ACGCCGCATT GCTCGCATGC AGGAGTGGCT CGCCGACCCC
GTTTTGATGG AAGCCGATCC GAAGGCGGAG TACGCTGCCG TTATTGATAT CAACCTCGAC
GAACTGAAAG AACCCGTGTT GGCCTTGCCA AACGACCCAG ATGCTTCGGC CTTGTTGTCC
GAAGTGCAGG GCAGTCACAT TGACGAAGTT TTTATTGGCT CGTGCATGAC CAACATTGGA
CACTTTCGCG CCGCCGGAAA GTTGCTCAAC AAATTGGAGA AGCCCATCCC GACACGCCTT
TGGATCGCTC CTCCTACCAA GATGGACGAA GCGCAGTTGG TCGAAGAGGG CTATTACAGT
ATCTTTGGTT CGGCCGGTGC CCGTACGGAA ATGCCTGGTT GCAGCCTTTG CATGGGCAAC
CAAGCACGGG TCGCTCCGGG ATGCACCGTC GTGTCCACCT CGACGCGGAA CTTCCCTAAC
CGTCTTGGAC AAGGTGCTAA CGTGTATCTT GCCAGTGCTG AACTCGCCGC CGTCGCCGCG
ATCGAAGGTC GTCTGCCGAC AGTGGAAGAG TATATGAAGT ACATGGACCA GGTTAAGGAC
GACGCTGCTG ATACGTACCG TTATTTGAAC TTTGACCAGC TTCCGGACTT TGTCAAAAAG
GCCGACTCTG TGGAGATAAG TGCAGAAATG AAAGATGCGG CCCACAAACT TTCCATGGGG
GAGTAAAGAA TAAGCAAACA AAGGAATGCG GATATAAGCA CAAGAATTCG ATTGTTTTAC
AGTGTGCACG GATACGCATC AACATTCAAT ATACCATACA TAATACGAAA AGGATGGTTC
TAGCAAAGGT TTTGTGGTCT CTGTTACATA
 
Protein sequence
MSSLALARRL SSSLGHGKKS IPHSAFATLA NPLVSWNGDV SRNTDASPSV SATSRNHVHA 
LCASSSTSAT VFAKNSTFTG STWSPASVPL MRSFSTVSST DFLQAYEAHV AERLNEADGL
GIAPKPLSAE QVSDLIAVLK TETPNAEALL DLLVNRVPPG VDEAAYVKAT WLSALAKQEE
TNPYITRARA VELLGTMQGG YNVATMVELL EDSDNEIAMI AADKLSHTLL VFDAFYDVQT
MHEKGVAAAT KVMESWASAE WFTKKPKVPE VMTATVFKVT GETNTDDLSP AQDAWSRPDI
PLHAVAMLKN AREGINPDKE GEIGPIQQIR ALQEKGFPLA YVGDVVGTGS SRKSATNSVL
WYMGDDIPYV PNVKSGGLCL GSKIAPIFFN TMEDSGALPI ELDVGEMNMG DVIDVYPYEG
VVKNHDTGEV VTEFKLKTPV IMDEVRAGGR IPLIIGRGLT VKAREALGRD AAVTDVFRMP
ALPEGSKKPE GFTLAQKMVA KACGLPDGEG VVPNQYCEPR MTTVGSQDTT GPMTRDELRS
LACLGFSSDL VMQSFCHTAA YPKPVDVVTH HTLPDFIRTR GGVSLRPGDG IIHSWLNRML
LPDTVGTGGD SHTRFPIGIS FPAGSGLVAF GAATGIMPLD MPESVLVRFS GTVQPGITLR
DLVQAIPYTA IQMGLLTVEK KGKKNIFSGR ILEIEGLPQL KCEQAFELSD ASAERSAAGC
TIKLDKEPII EYLNSNVVML KWMIAEGYGD PRTLERRIAR MQEWLADPVL MEADPKAEYA
AVIDINLDEL KEPVLALPND PDASALLSEV QGSHIDEVFI GSCMTNIGHF RAAGKLLNKL
EKPIPTRLWI APPTKMDEAQ LVEEGYYSIF GSAGARTEMP GCSLCMGNQA RVAPGCTVVS
TSTRNFPNRL GQGANVYLAS AELAAVAAIE GRLPTVEEYM KYMDQVKDDA ADTYRYLNFD
QLPDFVKKAD SVEISAEMKD AAHKLSMGE