Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55108 |
Symbol | |
ID | 7198370 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 346647 |
End bp | 349341 |
Gene Length | 2695 bp |
Protein Length | 718 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | heat shock factor |
Protein accession | XP_002184603 |
Protein GI | 219128822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.260689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGGAA GCGCCTCGAG CCCGGTTTCC GTACGTTCCG TTCCAACTTG CGGGGGTTTG GTCGAGAGTC TTGGTGTACA ATCGTCCGAG TCTGCGCGTG TGTGTGTGTG TCCACCATGG CCGGCCCCGG CCCCGGTTGA AGGACGGTAG GTTTTTTGTA AAAGTACAGA CGACTTCTAT GTATAGTAAT AGACGTCAGA CGGGCTGACT GTGTGACTGT GACTGTCTGA GAACGAATCG AGACCCTGTG CATTCCATCC TGGCCTATTC AGAAGTGGAT GCACAAGTTA CCCTACAGGG ACTGTTCATA CTACTGTGGA AGGGTTGGTA GTGCAGTTCC AAATCAATCG ACTCGCAGCT CTTGGAAAGC CCCCCTCTTA CCCATTTACC CTACCCCTGT CTTCGTCACA CCCACACAAA CCGACAACCC CCTTTTTACA TATCCTCCAA AAACAGACAA CGTGCATTGC ATTCACTTGT CCCTCCTTCG ATCCCCGTTG CCCACCGGTC GACCAACCCG CGCTTCCTGG CGTCTCGTCT ACAAAAACCT AGAGGCACTC TCGTCCGTCC GCTTGCGTCT CTCAATGGAC AACCACAATC ACAACAACAA TCCGTCGCAT GTCCGGAACG CCTTCGCTCC GGGAGCGTGG ACTCCGTGGC GCGTTCCACC ATCATCGCAC GAGCCTCGTC GTGACGACGC GTCCGACACA CTCCGAAACC AACCCTTGTT CTCGGCGTCG CGGACTTCGA CTACTACCAG ATCCAGTACT ACTATTACTA CCACTAATAC TACTACAGGC GGCACCGTAT CAATGTCGGA CAAAAAACGG CGTGGTCGTC CTAGAACCTC CAAGGAGGGG ACCGCCTCGT CCACCGCCTC GCCCAGTACG GTGGGATCCA AGAAATTGCC TCAATCGTGT CGATGTAATT TTGGACTCTT TCTGGATACC ATGGTTCGCG AAGTTAGCTT AACCCGACCA GACTTGGTCC ACTGGACCAG GGATGGTTCC GGATTCTTCA TTCATCAGGA ACGCAGTGCC GACGTGAGTC AAATACTGGC CAAATACTTT CTCCGTACGT ACCGGGACGC CGCAATCGAC CCCCGACAAT GCATTCCGTT CTACTCACGG TCCCACGAAT CACTCACACT CGTTGTCCTG CTTCACAGAC GGGAATTTTG ATTCCCTTCG GCGACAATTG AACGTCTACG GCTTTCGCAA ACAAAAGAAG CCCGGTCCGT AAGTAGTACA ATAGTGTCAG AGCCCTGCTC GGATTGGATT CCCGTGTTTC ACCCCACGAA CCAAACCGTA TTTTACACTC AATCCAACTT TGCTCACATA CATACACACA CACTCACACC GTCATTGTCA ATGGTTTCGT ATCTCTACTC ATTTTGCCCC GCAGGAATAG AGGCAGTTTC CATCATCCCG ATTTTCACCG GGACCGACAG GGGGATTTGG ACGAAACCAA TCAGCCCCGT GCACCCTCTA TCAAACCCAG CAAGCGCGAG CGACGCAAGA AAAAAGAGTC CGAATCGGAA TCCGCCTCCG ACACCCGGGG CGGAATGGCA CATCGAGTCC GCCACACACC CACTCCCCAA CGGAAAGACG ACCCTATCCC TCCTAGCGTG GTTGCCGCCA CGATCTCCGC CGTACCGGTC CACGAACAAT CGACGGGAGA GCCAGAACCG AAAAAGAAAA AATATACCAG GAAAAAAGAC GAGACTAACA CGGTGGCTTG CGTTAAGGAA GCGATGCCCC CGCCACCCTT GGTTCTGCCG GAGGGGTCTG CCGATGTGAC GTCGTTAAAC GCCATCATAG CCGACGCCCT CCACGAATCC ACAGCCGCGG CCAAGACGAA AAGTCGCCGA CGAAAGAAAG CACGACGCCT CAATCAAAAC TACGCGACCT TTCATCATCT GTATACTACG CCGGTCTTTT CCGAAGATTA CTCTACCGGG GTTTACACTG GTGCAGTCCC TAACTGTGCA GTATACGGCC CACCGTACTT GCCCGTGATT CCGGAGGATG ATACCATTGG TCACGGCATC TGGCTGGACG GCGTTGCTCA GGAAGAGCCG TTGCGCGAAG GACCAACAAA AGTTGATTCG GACGTCGTGC CGGAACGGAA ACCAAGGAAA ACATACACTA GGAGAGGCAC CGCGGCTACG AAGGATGCTG CCAAACAAGC GAAACACAAG GACAACAAAA CAGTCAAATC AAAGCCGCCC CGCCAAAAAA GTGGACAACG TGAACCCCGT GTTCGACCCA GACTCGAATT CCAGAAAAGG GTTCCGCGGG CTGAGAAATT ACCCGTCAAG AAAGATAGCA GAAACATCGC GAGGCGTCCC GTCATGGATG TAGACATGGG TGAAATCGAT TTGACCGACA TATCCACGCT GCCGCGGGGC GTTACGATGC GCCCCTCAGG AAAGTGGCAA GCTCAGCTCT ACTTCAACGG TCGATCACGC TATATTGGAG TCTTTGATAA TGCGCGATGC GCCGCGTTCG CGTACAGGAT TGTGCACCAA CGGCTGCGGC ACGACAAGTA TCATTTCCTG AGCAGCGATG TTGCTACCGA AGTTTTTGCC AAAGTTCGGA TCGAGGCAAA CGATGCCGTG GACAACTTAC TGAGGGGGAA AACCACTGAA AAGGAGGCAT TAGATAGGCT CGCTGAAAGT GAATCTCTGC TCTAG
|
Protein sequence | MHKLPYRDCS YYCGRVGSAV PNQSTRSSWK APLLPIYPTP VFVTPTQTDN PLFTYPPKTD NVHCIHLSLL RSPLPTGRPT RASWRLVYKN LEALSSVRLR LSMDNHNHNN NPSHVRNAFA PGAWTPWRVP PSSHEPRRDD ASDTLRNQPL FSASRTSTTT RSSTTITTTN TTTGGTVSMS DKKRRGRPRT SKEGTASSTA SPSTVGSKKL PQSCRCNFGL FLDTMVREVS LTRPDLVHWT RDGSGFFIHQ ERSADVSQIL AKYFLHGNFD SLRRQLNVYG FRKQKKPGPN RGSFHHPDFH RDRQGDLDET NQPRAPSIKP SKRERRKKKE SESESASDTR GGMAHRVRHT PTPQRKDDPI PPSVVAATIS AVPVHEQSTG EPEPKKKKYT RKKDETNTVA CVKEAMPPPP LVLPEGSADV TSLNAIIADA LHESTAAAKT KSRRRKKARR LNQNYATFHH LYTTPVFSED YSTGVYTGAV PNCAVYGPPY LPVIPEDDTI GHGIWLDGVA QEEPLREGPT KVDSDVVPER KPRKTYTRRG TAATKDAAKQ AKHKDNKTVK SKPPRQKSGQ REPRVRPRLE FQKRVPRAEK LPVKKDSRNI ARRPVMDVDM GEIDLTDIST LPRGVTMRPS GKWQAQLYFN GRSRYIGVFD NARCAAFAYR IVHQRLRHDK YHFLSSDVAT EVFAKVRIEA NDAVDNLLRG KTTEKEALDR LAESESLL
|
| |