Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46837 |
Symbol | |
ID | 7204690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 590628 |
End bp | 593844 |
Gene Length | 3217 bp |
Protein Length | 997 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185738 |
Protein GI | 219121013 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAC GCAAGCAAAG ATCCAAGCGT TCTTTGGCTA GTCCTCCAAA AGGTCTTGCT ATTCGAGGTG GTACGAACGG AGCCACGAAT CCGTTTGAGA TCACATCTCG TCACAAGCGA CCGAAACACG AAGTGCACAA TCGCAAGATT CCCACAAAGA ATGTCGCCAA GCCGTCCGCC TTGGCCCAGG CGGTCTCTCG TCGCAGGAAT GCACTCAGTG AAGCTTTGCA GAAGTCAAAG AAGAACAATA CTTTTGCCGA TAAGCGCATA GGAGAGTACG ATCGGAGCAT GACGTCGGAC GAGCAAAATT TGGCTCGATT GGTTCGGGAA CGGGCTAGGC GTAGCAAACG CTCTACAAAG TTTAGTTTGA ATGACGAGGA TGAGGACGGC GGCCAAACAA CCCTTACACA CAAAGGAAAA TCCATCAGTG ATATGTCTGC TAGAGACCAC GTTATTCTGT CGGATGATGA CGAAGACGAC ATTGGAAACC TAGACGCTGC CGATACGGCA CTGCATTTCG GCGGCGGATG CCGGTCGGCC GTCCCAGATG ACGGATACTA CGGTCCTACC GCTGGCGCTC AGGGTAAAGA TATGTCAAGC ATGTATATGA CGCGAAAAAT GGAATTGGAT GATCTCATAT TGCGGCGAAA AATTAAGAAA GCCGAAAGGA TGAAATCTCG CGAAGATCAA ACCGAAGCCT TTGAGGCCAT GGACGAATCG TTCGCCGAGC TGTCGAATAT GCTATCTTTC CGTGACAAAG AAAAGGAGAT CCGGGAGCAT GTTAAGTCCA AACGTGCTGG CATGCTTGCA CCGGAAGACC AAGAAATGGA GGACTGGGAT AAGCAAATGA AACAATATCA GTTCAATGAG AGAAAAGTGA AAGCAACTGA TCGCGTAAAA ACACCAGAAG AGATTGCCAA GGAAGAGGTA GATCGTTTGC ACGAGTTAGA GACCCGTCGC TTAGCTCGCA TGAATGGTGA CTTTGTCGAC GATGACCTTT CGGATATTTC CGATGGTGAG CGGAACCTGC AGAAGCGAAA GACAAGAAAG AGTACCCTAG CTAGTACCGC TGAAGCGTTA GATGATTCCG AAGTGGAGGA TGACGCGGAA GAAAAGGTAA CAACTCGTTT TACTGCCGAC GGCCTTGTGG AAGTCGACAG AAATGGTGTT GTTGCAAAAA AGGTTAATGC ACCAGATAAT GACGCTCTGA ATGGTGCGTT TGCAAAAGGA TCAAGAGTTA GCGCTTGCTA CCACGCGAAA GAACAGCTCG ACGACGATGC GACCTGGTTC AATGGGGTTG TTTCGGGTGT CTACGCACGA GACGACGGCT TGGTAGTATA CAACATCGAA TACGATGATG GAGATTTCGA AGATGGTGTT GAGCATCGAC ATGTTCGCCC CGCGGATGAA ACGAAAAGTG TTAGGAAGGT GGAGGAAATA GAAGCAAACA AGCAGGAACA GGAGCTTGAG CTGAAAAGAA AACGCCTACG AGCGAAAGAG AAAGCAAGGT ATGTAAAACA ATCTGTAGTA TGGGGTGATC GCACGGATTT CGCGCTTGCA TCTTTGTTGC ACCTGTATGG GGCTTGGTAG AGCGTTCTGC TCTCAATTGC TTCTCGAGTC GCGCTTCCCC TATATTGAGA ACCGTCCACA ATAGTTCTTG CAAGACAAAG TCACGCCTTG TGAGTCTCCA TGGAATTTAC ATTTCTAACT TTTAATGGTG CATTCTTCCA GGGCGGAACT TCCTTTTGTG TTTGAAGTTC CAACAACGTT GGAGGCACTG CATGATATGA TTGGTACATA CGCCTCTACC GGCGCGGATG CTTCTCTAAT CATAAAACGA ATTCACGCAT CCAATTCAGT CAGGTTGGAC AGAAGAAATG CTGAAAAGAT GCAGAACTTT TACGATGTGG CGATTCGACG GTTTGTTGCA GTTGGTGATG CGATTTATAG ATCAGGTGAT GGCGATTCAG AGTTAGGTCG ATTCAAACAG CTCGATGAGT TGGCCAGGAT CATGTACACA ATGGCGCAAG ACTCGGCTGA AAGTGCAACT GCCGTATGGG GGCGCCGTTT AGGTGTTTTC CAAAATGCAC ACGCCAAGAG ACTGCGCGAC GCTGAGTTAG ATCGTGATGA AGACGATGAG GATCAGTCTG CGTGGCCGTC GATTGGAGTT TTCTTAGCCC TCCGTGCGAT CGGTCATATC TTTCCAGTGA CCGATCAGCG TCACCAGATT ATCACTCCCA CGCTGCTAAT GCTTGGTCAA ATAGTTTCTC AAACACCTGT TCTTTCCATA TATGATCTTG TTGTCGGAAC AATGTGCTCA GCACTACTAA TTGAGTACAC AAAAGCTGCC AAACGTATTT CACCGGAAGC GATTGCTTTC ATCGCCGGTG TTTTGCGAAT GTTCGCATCG GATTCCTTGA AGCGACAAGG GCCATATTCT CTTCCAAGTT TGGAGAAAGC GGTGACGGGA GAGCAATTTG ATTCATTTCG AGCCCTTGCT AGTAGCTATC AAGAGGTGTA CCCACCAAAT CTGAGTTTTG AAAGGATTGG CATGTTTAGT GCGGAAACAC CCGCTGCGCT GCTCTATGCT GCGCTACATT TGGTTGAGGT CACCGTGCTT TGCCTTTCGG GCTCAGGCAT CGACGCTGAA CGAGAACTTT TTCACACTTT GGCGGAGTGT GTATTGAACA TTAAACCATA CAGTAAGAAG CACACACTCT GCAAGTGTTT GCAAGAAAAG GCTGCTTCAG CTGCTGAGTG CCTTGAGAAA GCGTGTAAGC TTGAAAATGC CCGTCCTCCA CTGAGGCGCA GATCGCTACC GACAATTCGT GACACTATGA TTAAGTCACT TGCTCCACGT ATCGAGAATG CTGAAAAATA CTCCATGTCG AAAGACAAAG GCAAGAATGC CCCACAAGCC GCTTTGGATC GCACTCGCCG AGAACTTCGC CGTGAGCACA AGGCTGTTTC TCGCGAGCTG CGGCTGGATT CTTCTTTCGT GGAGAGGGCT CGGCGCGAGG AACAGACCAA GAAGGATTCC GCTGCTCAAG CCAAACGTCA GAAGGCTTAC GGATGGCTTG AGAATGAGCA GGCTACCATG AACCAAGAAG TGCGGATGGG AGGCGGCTTG TTGAAGGGTG GTGGTATGGG AGCTGCAAAA GCGAAGGCGG CTACTGGGAA ACTAGGCATG AAAAAGGGCG GCAAACTAAA GCGGTAG
|
Protein sequence | MGKRKQRSKR SLASPPKGLA IRGGTNGATN PFEITSRHKR PKHEVHNRKI PTKNVAKPSA LAQAVSRRRN ALSEALQKSK KNNTFADKRI GEYDRSMTSD EQNLARLVRE RARRSKRSTK FSLNDEDEDG GQTTLTHKGK SISDMSARDH VILSDDDEDD IGNLDAADTA LHFGGGCRSA VPDDGYYGPT AGAQGKDMSS MYMTRKMELD DLILRRKIKK AERMKSREDQ TEAFEAMDES FAELSNMLSF RDKEKEIREH VKSKRAGMLA PEDQEMEDWD KQMKQYQFNE RKVKATDRVK TPEEIAKEEV DRLHELETRR LARMNGDFVD DDLSDISDGE RNLQKRKTRK STLASTAEAL DDSEVEDDAE EKVTTRFTAD GLVEVDRNGV VAKKVNAPDN DALNGAFAKG SRVSACYHAK EQLDDDATWF NGVVSGVYAR DDGLVVYNIE YDDGDFEDGV EHRHVRPADE TKSVRKVEEI EANKQEQELE LKRKRLRAKE KARAELPFVF EVPTTLEALH DMIGTYASTG ADASLIIKRI HASNSVRLDR RNAEKMQNFY DVAIRRFVAV GDAIYRSGDG DSELGRFKQL DELARIMYTM AQDSAESATA VWGRRLGVFQ NAHAKRLRDA ELDRDEDDED QSAWPSIGVF LALRAIGHIF PVTDQRHQII TPTLLMLGQI VSQTPVLSIY DLVVGTMCSA LLIEYTKAAK RISPEAIAFI AGVLRMFASD SLKRQGPYSL PSLEKAVTGE QFDSFRALAS SYQEVYPPNL SFERIGMFSA ETPAALLYAA LHLVEVTVLC LSGSGIDAER ELFHTLAECV LNIKPYSKKH TLCKCLQEKA ASAAECLEKA CKLENARPPL RRRSLPTIRD TMIKSLAPRI ENAEKYSMSK DKGKNAPQAA LDRTRRELRR EHKAVSRELR LDSSFVERAR REEQTKKDSA AQAKRQKAYG WLENEQATMN QEVRMGGGLL KGGGMGAAKA KAATGKLGMK KGGKLKR
|
| |