Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26290 |
Symbol | |
ID | 7198120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1110483 |
End bp | 1113752 |
Gene Length | 3270 bp |
Protein Length | 989 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178642 |
Protein GI | 219115693 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.793152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCACGGAG ACCAAACCAC AGAAGTTCCT TCGCCCTCTT TACACCGAAC GCCACAACTG GCAGTGAGGA CAAAACGACC GTTTCATCCC TGACCGTGAC TCCTTTCTGC ACATTCTGCT TACTCTCGCA CACACGAGTA GATAAGATAG CATAGCATGT CCTCTTTGGC TCTCGCACGT CGACTTTCGT CGTCCCTTGG GCATGGTAAG AAGTCAATCC CTCACTCCGC CTTTGCGACG CTGGCAAATC CTCTTGTTTC TTGGAACGGC GATGTGAGTA GGAATACTGA TGCTAGTCCG AGCGTTAGTG CTACCTCTCG CAACCATGTC CATGCACTCT GTGCTTCCAG TTCCACCTCG GCTACTGTAT TTGCGAAGAA TAGTACGTTC ACTGGTAGCA CCTGGTCTCC AGCCTCGGTC CCTTTGATGC GCAGCTTTTC GACGGTGTCT TCGACGGACT TTCTCCAAGC CTACGAAGCG CACGTGGCGG AGCGTCTCAA CGAAGCCGAC GGTCTCGGCA TTGCGCCAAA GCCTCTCAGT GCCGAGCAAG TTTCGGATCT CATTGCGGTC CTAAAAACGG AAACTCCCAA CGCTGAAGCC TTGCTCGACC TCCTCGTCAA TCGCGTTCCT CCCGGAGTCG ACGAAGCTGC CTACGTCAAA GCTACCTGGC TCAGTGCGTT GGCCAAGCAA GAAGAAACTA ACCCCTACAT TACACGAGCC CGCGCCGTCG AGCTCCTCGG GACTATGCAA GGAGGCTACA ATGTTGCTAC TATGGTGGAA CTTTTGGAAG ACTCCGATAA CGAGATTGCC ATGATTGCCG CCGACAAGCT GTCGCATACG CTTCTCGTCT TTGACGCATT TTACGATGTG CAGACCATGC ACGAAAAGGG CGTCGCCGCC GCTACCAAGG TGATGGAGAG TTGGGCCAGT GCGGAATGGT TCACGAAGAA ACCCAAGGTC CCCGAAGTTA TGACCGCCAC CGTCTTTAAG GTAACAGGAG AAACCAACAC GGATGATTTG AGTCCGGCAC AGGACGCTTG GTCGCGTCCC GACATCCCTC TCCACGCCGT GGCCATGCTC AAGAACGCTC GCGAAGGTAT CAACCCGGAT AAAGAAGGAG AAATCGGTCC GATTCAACAG ATCCGCGCCT TACAAGAAAA AGGATTTCCT CTTGCATACG TCGGAGATGT AGTCGGCACC GGATCGTCCC GCAAGTCCGC GACGAACTCC GTACTCTGGT ACATGGGCGA CGATATCCCC TACGTGCCCA ACGTCAAATC TGGTGGACTT TGTTTGGGTA GCAAGATTGC GCCAATCTTT TTCAATACCA TGGAAGATTC TGGAGCCCTG CCCATTGAAT TGGATGTGGG CGAAATGAAT ATGGGCGACG TCATTGACGT CTACCCCTAC GAAGGTGTTG TCAAGAACCA CGACACGGGC GAAGTCGTTA CGGAATTCAA ACTCAAGACT CCCGTCATCA TGGACGAAGT CCGAGCGGGA GGACGAATTC CTTTGATTAT TGGTCGTGGC TTGACCGTCA AGGCCCGCGA AGCCTTGGGT CGAGATGCCG CCGTAACGGA TGTCTTTCGT ATGCCGGCAC TTCCCGAAGG CAGCAAAAAG CCGGAAGGCT TTACCTTGGC CCAAAAAATG GTCGCCAAGG CCTGTGGACT TCCCGATGGT GAAGGTGTCG TGCCTAATCA GTACTGTGAA CCCCGCATGA CCACGGTGGG TTCGCAGGAT ACAACTGGTC CAATGACTCG CGACGAATTA CGCTCCTTGG CTTGTTTGGG ATTCTCCTCT GATTTAGTCA TGCAAAGTTT CTGCCACACC GCGGCCTACC CCAAACCGGT CGATGTCGTG ACTCACCACA CGTTGCCTGA TTTTATCCGC ACTCGCGGTG GTGTTTCCCT CCGACCAGGT GACGGTATCA TCCATTCCTG GTTGAATCGT ATGTTGCTGC CCGATACGGT AGGAACCGGG GGAGATTCGC ATACTCGTTT CCCCATTGGT ATTTCCTTTC CTGCTGGCAG TGGCCTGGTG GCTTTTGGTG CGGCGACGGG AATCATGCCA CTCGACATGC CGGAGTCCGT CTTGGTACGA TTTTCTGGAA CTGTGCAGCC CGGTATTACT CTCCGCGATC TGGTGCAGGC GATTCCGTAT ACGGCAATCC AGATGGGACT GTTGACGGTC GAAAAAAAAG GCAAAAAGAA CATCTTTTCC GGACGAATTT TGGAAATCGA AGGTCTTCCT CAGCTCAAGT GCGAACAAGC CTTCGAATTG TCCGACGCAT CTGCCGAACG ATCCGCCGCA GGTTGTACCA TCAAGCTCGA CAAAGAACCC ATCATCGAGT ATCTTAACTC CAATGTTGTA ATGCTCAAGT GGATGATTGC GGAAGGATAC GGAGATCCTC GCACGTTGGA ACGCCGCATT GCTCGCATGC AGGAGTGGCT CGCCGACCCC GTTTTGATGG AAGCCGATCC GAAGGCGGAG TACGCTGCCG TTATTGATAT CAACCTCGAC GAACTGAAAG AACCCGTGTT GGCCTTGCCA AACGACCCAG ATGCTTCGGC CTTGTTGTCC GAAGTGCAGG GCAGTCACAT TGACGAAGTT TTTATTGGCT CGTGCATGAC CAACATTGGA CACTTTCGCG CCGCCGGAAA GTTGCTCAAC AAATTGGAGA AGCCCATCCC GACACGCCTT TGGATCGCTC CTCCTACCAA GATGGACGAA GCGCAGTTGG TCGAAGAGGG CTATTACAGT ATCTTTGGTT CGGCCGGTGC CCGTACGGAA ATGCCTGGTT GCAGCCTTTG CATGGGCAAC CAAGCACGGG TCGCTCCGGG ATGCACCGTC GTGTCCACCT CGACGCGGAA CTTCCCTAAC CGTCTTGGAC AAGGTGCTAA CGTGTATCTT GCCAGTGCTG AACTCGCCGC CGTCGCCGCG ATCGAAGGTC GTCTGCCGAC AGTGGAAGAG TATATGAAGT ACATGGACCA GGTTAAGGAC GACGCTGCTG ATACGTACCG TTATTTGAAC TTTGACCAGC TTCCGGACTT TGTCAAAAAG GCCGACTCTG TGGAGATAAG TGCAGAAATG AAAGATGCGG CCCACAAACT TTCCATGGGG GAGTAAAGAA TAAGCAAACA AAGGAATGCG GATATAAGCA CAAGAATTCG ATTGTTTTAC AGTGTGCACG GATACGCATC AACATTCAAT ATACCATACA TAATACGAAA AGGATGGTTC TAGCAAAGGT TTTGTGGTCT CTGTTACATA
|
Protein sequence | MSSLALARRL SSSLGHGKKS IPHSAFATLA NPLVSWNGDV SRNTDASPSV SATSRNHVHA LCASSSTSAT VFAKNSTFTG STWSPASVPL MRSFSTVSST DFLQAYEAHV AERLNEADGL GIAPKPLSAE QVSDLIAVLK TETPNAEALL DLLVNRVPPG VDEAAYVKAT WLSALAKQEE TNPYITRARA VELLGTMQGG YNVATMVELL EDSDNEIAMI AADKLSHTLL VFDAFYDVQT MHEKGVAAAT KVMESWASAE WFTKKPKVPE VMTATVFKVT GETNTDDLSP AQDAWSRPDI PLHAVAMLKN AREGINPDKE GEIGPIQQIR ALQEKGFPLA YVGDVVGTGS SRKSATNSVL WYMGDDIPYV PNVKSGGLCL GSKIAPIFFN TMEDSGALPI ELDVGEMNMG DVIDVYPYEG VVKNHDTGEV VTEFKLKTPV IMDEVRAGGR IPLIIGRGLT VKAREALGRD AAVTDVFRMP ALPEGSKKPE GFTLAQKMVA KACGLPDGEG VVPNQYCEPR MTTVGSQDTT GPMTRDELRS LACLGFSSDL VMQSFCHTAA YPKPVDVVTH HTLPDFIRTR GGVSLRPGDG IIHSWLNRML LPDTVGTGGD SHTRFPIGIS FPAGSGLVAF GAATGIMPLD MPESVLVRFS GTVQPGITLR DLVQAIPYTA IQMGLLTVEK KGKKNIFSGR ILEIEGLPQL KCEQAFELSD ASAERSAAGC TIKLDKEPII EYLNSNVVML KWMIAEGYGD PRTLERRIAR MQEWLADPVL MEADPKAEYA AVIDINLDEL KEPVLALPND PDASALLSEV QGSHIDEVFI GSCMTNIGHF RAAGKLLNKL EKPIPTRLWI APPTKMDEAQ LVEEGYYSIF GSAGARTEMP GCSLCMGNQA RVAPGCTVVS TSTRNFPNRL GQGANVYLAS AELAAVAAIE GRLPTVEEYM KYMDQVKDDA ADTYRYLNFD QLPDFVKKAD SVEISAEMKD AAHKLSMGE
|
| |