Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45710 |
Symbol | |
ID | 7200482 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 1012498 |
End bp | 1015462 |
Gene Length | 2965 bp |
Protein Length | 890 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179766 |
Protein GI | 219117963 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.262092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTCG CGAATGGCTT TGGAGACAGT AGTGATGATC TGATGGAAAA CGCAAGGATC GTTAGCCCTT ACGGCGATGG AGAAACCCAA GCGTTTCCTG CTGATCCTTG GGCAAATTTT CAGTACGCAC TGCCTGAAGG GCCCTCGCAG TATAAACATC GTGCTTCTAG ACAAAGACCA ACGTTTGGAG GATCACGATC TCCCTCGGAG CCAATTCCCA CCCCACGAGA AAGGATGGTG ACATCTTCTA AGATCTCAAC TCCTCACCGA GACGGTATGG TAAATCGAGG CCATGCTAGT CGATACGAAT ATTCTAGTGA CCAGCGATGC GATGACGTAC ACGTAATATT TACAGATCCC TCCGACGGTG TTGGGATCAG CGTTGAAAAA AGGAGCCAAC AGGACTTGTT GCGACAAAAG GTTCGGGAGG ACACCAGTCG TAGGAGCTTA CAACATTCAA CCAGCGCACA AAGTATGTAC CGGAAACGTA GCGAAAACAA AGGTGCGTTT CCTTCATTTC CACAAGTAGC GCCGAGCAAA GAGAAAGCAC TGGTACGTAC TCCGAACACC TAAACTTGAA TGCATTGACC AATTTTGAAA AGTGAACTAC ACTCCTCTAA ATTACGCCTT CAATTCGTAT TCTTTTGTCT GAAGCGAGGC CAGCGTCCAT CTAAACATAC TCGGGACCAT ATCTCACGGC ATACTTCGCC AAAATCATCA GTGGAAATAA CTTCAGAAAG CACAAACGCA GAGACCGATG GTCTAAATTT GAATTGGGCA TTCTCGCGTG TTCAAGTTCG AGCGAATAGT GAAACGGAGT CAACGACGGT TTGGCTAAAT ACTGATCACA GGCAGGATTT AGAAGGAGAA GACATTTGGT TACAAGAAGA ACATGCATTC CCTTTGTTGT CAACGGGGGC TTCAAGCAGA TTTTCGGGCC CATCTGGTAA AGATGGCCTC CGAAAACGTA CTGATAAGAA GTCAAATCGT GTCCGTTTTG CCGAACCGAT ACAGAAGCCG CGATCTGTCC CTTTTCTGAA AACTGTTTCA CGGGAAACTA TACGGGAAGA ATCCTCAGAT CCTCGTTCCA AAAAGATCAC ACATAGTGAC AATCGCGCGT GGTTCGTGGC CCAACCGAAG TCAATTCTCC GTCGGCGGCG TTTCGCTGGC GAAACCGTGT CCCATGACCC ACAGTACCCC CAGAAGAATC GCCCATCCTC TCATCGCGCA GCTCCACAAC GGAAGTCTGC TACATCGTTT TTGGATACAC AAGGATCCCT ACTCTCTCCC ATTCATTCAG ATAGGCGGCC TTGGGACCGC ATCTCTGAAA CAGGCTCGGA GTCACTCAGT CCTTCGTACA GTGATGTTGA GCGAGAGAAA CGCGTTAGTC TTGGTCCCTA CCACCTGCAG GAGCTAAATG AGATGTATCC CGATCCTCCT CTTGAGTTGC AGGTAAGACA TTTTTGACTT TACAAAGCCC TATCTTTTGT AGCCGACCTA CTCACCGACA TACATTTTCT TTTAGTTCGA CGACGAGTCA ACTGTAGTAC CAGCACGCGC TTCTTTCATC GACACTGTCG CTGCTGTTGT CGTTCAAGCC GCTGTTCGTA GATTTCTTGC CCAAAAAGTG ATGCATGAGA TGGTCGGCAA AGCATATTCC TTTCCGCATT TAGAATCTGA CGATAAGAAA TATCGACCAT TGTCGTCGCG AAAGGTAACT CCTGAAAAAA GGTCGTCCCG AAAAAGCTAT GCAGAGAGCC CGTATGGTAG GAATTTGGGA GGAGCAGTGT TCATAGAAGT CATGGCTGCG ATAAAAATCC AATCTGCCTT CCGAGGCTTT TGGGTTCGAG ATTCGTTGAA TGTGGATCAT TTTTGCGCGA CTATGATCCA GAAATGGTAT CGACGACATC ATCAGAGGCA CCACTATTTT GCAGATCTTT CTCGGATCAT ACTGGTCCAG TCCATTTGGA GGCGCAGTAT AGCCAGGGAG CACGCTGCCT TTTTCCTTGG GAGCGTAATT ACAGTTCAGT CGCTGTTTCG CTCGTACAGC GCTCGCAAAA AGCTCTACTC AGGACTCACT TGCCTACGAA AGGATACTAT GGCAGCTGTA GTGATCCAAT CGCAATGGCG TACATATGCT TGCGAATGCA ACTTTATTCG CGATCTTGTC GATATTTTGA TCGTTCAAAG TGTTGTGAGA ACTTGGTTAG CAAGACGACA CCTGTCATCA CTACGCTCCA GGGCCCAAAG TATTTCCGGC AAAAAGTCAC CAACAGTATC AAAAAAATAC GCGAATCAAG TGGCGGCGCA ACCTACTGGA AGTCCTCGAC CTGGAGAGGC CAATCGTAAA TTGGCGACAG GGCAATGCTA CTCCTCGTAT AGGTCTGTCG AAGAGAGTTC GTTCAGCGCT ATTCTTGGCA ATATAAAGAG CAAGGAGAAC AATCACCTCA TTGTGTTGAT TACATCTCAG TCTCTCTCGC GCAATCAAGC TTCCACAAGA AGTAATATTG GTACAATCTT ACGCGTCCAT AATGTCTCAT TCGAGGAAGT GGATGGAGCA AATCCGCTAA CCCGAGGACG ACGCGACGAA CTCTTTGCTA TATCACAAAT GCGCGGCGTG TACCCGCAGT TCTTTGTGGT AGACTATGAA ACAGGGCTCA CGTTATTTTT CTGCAACAGT GATTCTTTTT TCGGTGCCAA CGAAGAAGGC TCTCTACCCA GGATACTCAA TATTGCTGGT GTTGTGCAGA GCGCGATCGG AGGACATCAA GAAAGAAATA GTACCATAGA CGAAGCTCCT AAAGCAAACA AGCACCTGTT TGAGCCAAAG AGGCAAAGCT CACATACTAC GGTTTCAATT GACAGTGAAA CTTCGGAGCC CTCTGTAGGA CGGAACAGTT TGCTTTCGAT GTGGAAAAAT CTTGACAAGA AGAACACATT AGTATTAAAT GGACACAGGA ATTGA
|
Protein sequence | MDFANGFGDS SDDLMENARI VSPYGDGETQ AFPADPWANF QYALPEGPSQ YKHRASRQRP TFGGSRSPSE PIPTPRERMV TSSKISTPHR DGMVNRGHAS RYEYSSDQRC DDVHVIFTDP SDGVGISVEK RSQQDLLRQK VREDTSRRSL QHSTSAQSMY RKRSENKVEI TSESTNAETD GLNLNWAFSR VQVRANSETE STTVWLNTDH RQDLEGEDIW LQEEHAFPLL STGASSRFSG PSGKDGLRKR TDKKSNRVRF AEPIQKPRSV PFLKTVSRET IREESSDPRS KKITHSDNRA WFVAQPKSIL RRRRFAGETV SHDPQYPQKN RPSSHRAAPQ RKSATSFLDT QGSLLSPIHS DRRPWDRISE TGSESLSPSY SDVEREKRVS LGPYHLQELN EMYPDPPLEL QFDDESTVVP ARASFIDTVA AVVVQAAVRR FLAQKVMHEM VGKAYSFPHL ESDDKKYRPL SSRKVTPEKR SSRKSYAESP YGRNLGGAVF IEVMAAIKIQ SAFRGFWVRD SLNVDHFCAT MIQKWYRRHH QRHHYFADLS RIILVQSIWR RSIAREHAAF FLGSVITVQS LFRSYSARKK LYSGLTCLRK DTMAAVVIQS QWRTYACECN FIRDLVDILI VQSVVRTWLA RRHLSSLRSR AQSISGKKSP TVSKKYANQV AAQPTGSPRP GEANRKLATG QCYSSYRSVE ESSFSAILGN IKSKENNHLI VLITSQSLSR NQASTRSNIG TILRVHNVSF EEVDGANPLT RGRRDELFAI SQMRGVYPQF FVVDYETGLT LFFCNSDSFF GANEEGSLPR ILNIAGVVQS AIGGHQERNS TIDEAPKANK HLFEPKRQSS HTTVSIDSET SEPSVGRNSL LSMWKNLDKK NTLVLNGHRN
|
| |