Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43394 |
Symbol | |
ID | 7197422 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 330003 |
End bp | 334359 |
Gene Length | 4357 bp |
Protein Length | 1071 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177912 |
Protein GI | 219112321 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.560278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGCACTT CCCTCCGCAA TGGTGGAGCA CGTTGTCTCG GATTGGCCAG ACGACTTGCA TCGCGTCTTA AACGGCACCG TTGCCACTAC TGTCACCACG AATGACGATG ATCTGCTCAC ACAGAATGAT TCCAGCATCG TCCGCGAAAC CTTTCGAATA TACGGAACCT TGTATATTTC TGCCTTTGTG CTGTACGCAA TTTTGAGACG GTGCTATCCG TGGCTGTTCA ATGTACGCGG CTGGGTTCCA GAGCTACGAT GTGATCTAGC GCAAACGAAA TATGGACTCA TATCGTGGTT CTGGAAAGTC TTTCGCGTGA AAGACCAAGA TATCTTTGAG CAATGTGGAA TGGACGCCCT TTGTTTCATT CGAGCCATGC GCTTCGGTCG TAATCTAGGC CTTATGGGAT CCTTCAATGC TCTGTGGCTG ATACCGGTCT ATATGACAGC AAAGGAGGCG CCCGAAACTG AGTATCTGAA CGATTGGCTG ACAGAAATTT CGGTCGCACA TCTTCCGAAC AGGTCATCAC GCTACACTGG AACGCTGCTG GCCACCTACA TTACATTTCT GTTCACAATG TACTTGATTC TACAAGAGAT GCATTGGTAC ACGTACTGGA GGCATCGCTT TTTATCACAG AGGGAACCCC GAAACTACGC CGTATATGTG GCAGGAATTC CTGACGAGTG GCGCAGTAGT AAAGACCTGA CCGCTTATTT TCATAACTGT ACTTCAAAAG ACTCCGTTTT GGAAGCACAC GTTGCGATGG ATCTACCGAG CTTGGAGGCA AAGCATTTAC GACGAGGTGA TGTCGTTCGA AAGCTTGAGC ACGCCGTGGC TTTGGAGAGA AGGACGGGAA CTACTGAAAA GCACCACACT ATCAGCCTCA GCTCCGGTAT CGAGAAGGTA AAAACTGTTG CAGCCTTAGA GGCGGAACTG GAGAGATTGA ACCGCGAAAT TCCGGTAGAT ATTCAGAGAA TTAAAAGAAG CAGTGACCGA GATATTTCGC TGCCGTCTAG AGCAGCCGGA GTACGCGATT TGATTGGAAG TGGCAATGCG GACATCGATT GGTCCCCTAT CGACGATCAA GCCACTGACC ACGATCGACC TGCAGTTGTC ACTTTTATTG AGATCGAGAA TTCAGACCCG GACAAGCCTA TAGCCATTAT TGATGAAGAC TACTGTCCTC GAAAACGGAA TTTGACGATG GTGGATGAAT ACGCTGACTC AGATGTCGAA ACTAGTGATG AATCTGAGAC TGCAGACGAT ACTTCACCGA AGCATGATAA CAAAAGTGTC TCGTCTACAG TCAATGGAAA TTGTGTTGCG GCGTCGACAT CAGAGGATGC TGAGTACGTC TTGCAAAGGT TCTCTCGCGA GGACGGTAAA ATGTTTGAGG AACACCTTGG CATGCATCGT GCTTTGCATC GGCAAGATCA AATATCTGGT CTCTCTTCAT CCGACATTGA TATTGCAGAT ACTACGGACA TTGATGAGGA CATTATTCGT TCTGCGGTGG GAAGACTCGT TGTGGAACGA AGTCGGTGTT GCTCTGAAAA TGAGAGTCAG GGGGACACCA GCTCATATGA CGGATTTAAG CGTGAGTCTA GCAATGACAG TTCACAGTTT CAGTCGCGAT CGAGTATTCG CTCTGGAGCA AAACAGCGTC CTTCTTCTCG AGGGCGGAGC CGCTCCGCTA CAAGGAACGA CTCCGAGCGA AGCGATAGGT CTCGATCAAA CTCCAGGATT AGAAATTCAA TCTCAGCAGG TGTAGAGGTC GTAAATACCG GGACTCGGGC CCTGGGTCAG TCACTTTCAG TCGCAGGAAA CTCGATTACT TCAGGCTCAC AAGCTGCTCG CCAATCGTTT GTGGCAAGCT CCAGAGCCGC AGGCGGAGCG ATTAAAAAAG TTGCGAAAGA TGTCAACGTT GACAGGGTCA TTAAATCAGC GGCCGAAGGC GGAGCACAAA GTATCATCAA AGTAGGAACA ACAATTGTCG CGTCTGCCAG TGCAGTTGTT CCTAGTCTCC GAATTAAAGG AGAGGGGTCT ACGAGAAACG CCGGTTTTGT TGTCTTCAAG AATCTTTACA CAGTACAAAG CGTTTTGCAA ATGGTTCACG ATGCGCGTCC TTATGTTATG GACTGCTTTG AAGCGCCTGA ACCCGGGGAC ATCTTTTGGA GAAACGTTGG TCTCGTTGCA AAGGCACGCC GTGTAGGAAA TCTTTTGAGC GTCAGTGCAA CGGTTGTTAC GTGCATCTTT TGGTCTATTC CGATGACAGT AATCGCATCT TTAACAGAAG TGAACTCCCT GAAAGAAGAG CTTCCCAAGC TGGGTCGTTT CATTGACAGA CACCCTAAAG CTGAGACTGT CATAGTGCAG CTTGCACCCC TCATCCTCTT GATATTCAAT GAAACTATTT TGCCCAGTGT CCTCAAATAT TTCGCTCGCT GGGAAGGACA TATTTCGGCA ACCATGCTGG AGGCATCATT ATTTGTTAAG CTTGGCTTTT TCATGGTGAG AAATATAGAC AGTCCGCAAC ATGACGATTG CAACGATGTG CTAACAATTG CGGTATTTGT CGTTATATAG ATTATTCAAA CTTTCTTTGT CTCGGCAATC TCTGGAGGAA TTACATCGGA ACTATCAAAT ATACTGTCAA ATCCAGAGAT GATTATCGAT TTGCTCGCGA ATTCGCTCCC TGCCCAGTCG ACATACTTTG TTCAAATTAT TCTTGCGTCA ACATTTCTGC TTCAATCTCT TGAACTTCTA CGGGTCTATC CGCTCGGAGT TGCATTGCTA CGCCGCTTTT TTGGACCGCA GCTCACGGCT AATGAGCGTC GCCGAACCTG GTGGTGGCTC AATTCTCTCG AAGACCCACC GGACTTTTGG CATGCAGAGA CTTTCGCACA AATCATTCTT TACTTTATGG TGTTTTTTGT GTACGCTGTG ATAGCTCCGT TTACGTCCTT TGTGGTGCTC CTGTGTTTCA CAATTTTGGA AAGTGGATAC CGGTATCAGC TTATTCATAA CTACCCAAGG GCATTTGACA CGGGTGGGAA ATTGTGGTAC TACTTTATCC AATTCATCCT GGCAAGCATG GTTATCGCTC AGCTTACTCT GATTGGTCTG ATGGCTCTCA AACAGAGCAC ATATGCCAGT CCAGTTCTTA TTCCTCTGCT AGTTGTCACC TGCTGTAAGT CGTGTCTCTT GTGATGCTTC TTTGAGCAGC CGCTTTTCCT AACCAAGCTT TACTCTTGAC AGTATTCATA ATCTACATTA ATTCGCGGCA TTCAGTGGTC GCACGTCATT TACCAACAAG AAACTGTATT GAAGCAGACC AGCATTACGT ATTGGTATCT GAAGATGATG AGGAAATTGG TGTCCACCTC AGCGATTTTA CATTTGTGTA CGGTAAATAT CTTCAGCCGG CTCTTCAAAA CGAGCAGGTC TGTCCGGACT ATGAAGACGA TGAGTACGGT GACGCCCCGA ACAGTTTGGA TGGCCAACAA TTGGACGACG ATGACTTGTC TATTCTCGGG CAGTAGCAAA TACTTCTGTT TACAGTTAGA CGCTCTCCTT TTTACGCGCC ATCGAATCTG TCTATAGAAA TTGGTTCACC AACACATTTT GGGCTCATTT GCTATCGGTA AACACACCAT TGTCTCCAGT ACCATGACCA CGATACTATA ATCAGATGTA CGCAATTTAT TGTTCTAAGC GCGAGGAGTA CGGAAAGGAT TTATACGAGC AAAGATACTT TGGCCCCGGA AATAATGCAT AAAGGCAGCC CCTGCGTGAA GAGGGACGAG ATATTTTCCG TAGGTGCCCA ACGTTTTGTG GATGTTGTAA GTCTGTTTTG CGATTTGCCC GTTGCGCTTT TTCTCGAGAT CGTCGGCTGG AGTAGCACCC TCGAACGTTT TGTAAAAGAA AGGCAACCCC TTACCACCAT AGTAGCCCAT AGCGATACCG CTTGCCGGCA TGATTGTCAT GAATGCATAG AGGCCGTAGT GCGTGACCTT GCTAGCCGCG TTTTCCCAGG TCGCGTTGCC GATCATGTTT CTGACGTTAT AGTCGGAACG AGAAAGCAGC CGATAGGCGA ATCGAGGCGC CACAACCATA CCAGTCAATA AACCGAGCGA TTTGTGACGC CACATCCATT TGCCTTTATC TTCCTTAGCG GCATCTTGAG CCTTCAGCAC GGATCCAACA GAACCGATCA GAGGAATCGC AACCATCCAA TGAAAAGCCG AAGCCGTCAA CGAATACGCA CCCGCGGGAA CAACACTCAT AACAAAGGCG ATTGAAACAG TATTCAGTTG AATAGGGTCG ACCGCAATTT CATATAAGAT AGGGGAC
|
Protein sequence | MVEHVVSDWP DDLHRVLNGT VATTVTTNDD DLLTQNDSSI VRETFRIYGT LYISAFVLYA ILRRCYPWLF NVRGWVPELR CDLAQTKYGL ISWFWKVFRV KDQDIFEQCG MDALCFIRAM RFGRNLGLMG SFNALWLIPV YMTAKEAPET EYLNDWLTEI SVAHLPNRSS RYTGTLLATY ITFLFTMYLI LQEMHWYTYW RHRFLSQREP RNYAVYVAGI PDEWRSSKDL TAYFHNCTSK DSVLEAHVAM DLPSLEAKHL RRGDVVRKLE HAVALERRTG TTEKHHTISL SSGIEKVKTV AALEAELERL NREIPVDIQR IKRSSDRDIS LPSRAAGVRD LIGSGNADID WSPIDDQATD HDRPAVVTFI EIENSDPDKP IAIIDEDYCP RKRNLTMVDE YADSDVETSD ESETADDTSP KHDNKSVSST VNGNCVAAST SEDAEYVLQR FSREDGKMFE EHLGMHRALH RQDQISGLSS SDIDIADTTD IDEDIIRSAV GRLVVERSRC CSENESQGDT SSYDGFKRVE VVNTGTRALG QSLSVAGNSI TSGSQAARQS FVASSRAAGG AIKKVAKDVN VDRVIKSAAE GGAQSIIKVG TTIVASASAV VPSLRIKGEG STRNAGFVVF KNLYTVQSVL QMVHDARPYV MDCFEAPEPG DIFWRNVGLV AKARRVGNLL SVSATVVTCI FWSIPMTVIA SLTEVNSLKE ELPKLGRFID RHPKAETVIV QLAPLILLIF NETILPSVLK YFARWEGHIS ATMLEASLFV KLGFFMIIQT FFVSAISGGI TSELSNILSN PEMIIDLLAN SLPAQSTYFV QIILASTFLL QSLELLRVYP LGVALLRRFF GPQLTANERR RTWWWLNSLE DPPDFWHAET FAQIILYFMV FFVYAVIAPF TSFVVLLCFT ILESGYRYQL IHNYPRAFDT GGKLWYYFIQ FILASMVIAQ LTLIGLMALK QSTYASPVLI PLLVVTCLFI IYINSRHSVV ARHLPTRNCI EADQHYVLVS EDDEEIGVHL SDFTFVYGKY LQPALQNEQV CPDYEDDEYG DAPNSLDGQQ LDDDDLSILG Q
|
| |