Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48535 |
Symbol | |
ID | 7194777 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 141363 |
End bp | 144776 |
Gene Length | 3414 bp |
Protein Length | 1112 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183038 |
Protein GI | 219125546 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.12643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGT CGAACGCATC GGCAACACAT CGAACGGCCT CTATGATTAG CGATATTTCT CACGAGTTCG AAGAGGAAGG CTTCGGACGA GTACGATTGT TAGCCCCCTT GAGTTCCTCG CGACGCGGCC CTTTTTATGA ATTCCGTTTG AATGAAATGA CGGTCAATAA GCTCCAGTTT GCTTCGCTCG GTCTCTACGG ACGCGACGAG GAGATACGGA AGCTTCAGGA AGTCTTTGTC AAAATGAAAA GCCAATCTCG GGAACTTGTA ATGATTTCGG GCGCGGCCGG AACGGGGAAA TCCGCTTTGG CCCAGAATTT ACAGAAATCT GTACAAAAGG CCAAGGGATT TTACATTTCG GGAAAATTCG ACGTCAAGCA ACGTGATGAA CCTTACGCAG GCATAGTCAC CGCCTGTCGT CAGCTATGTG ACGATGTTTT GTCCTTAACC GAGCCTTTAC CAGAAATTGC ACCAGAACAG ACAGTCGAGC CGTGTACAGG ATTTCTGCCT CCACAAAGCT CGCTATATCG TTGGAACTTT TCTTTCGAAG AAATGAAAGC AAAATTGCAG CAGGAACTGG GTCAAGAAGC TGAGGTTCTG TCGACTGTGA TACCAAACCT GAAACAAATT GTCGGGGGTG ACTATTTGTT GGCACCGGAG TCTGGAGTTG GCCGTCTTGA AGCAAGGAAT CGTTTCAACT TCGCCTTTCG GCGATTCATG AGAGTAATGG GTAGTTTTGG ACCTCTTGTC ATGGTGCTGG ATGATCTACA GTGGGCGGAC TTGGCGTCCC TAGACTTGGT ACAGCTTCTC ATCACGGATC GGGAAAATCC ATCCCTGATG GTGGTAGGCT GCTATCGTGA CAACGAAGTT CACGATGCGC ACTTGATGAC AAAAATGATC CGTGACATCA AGACGAGCAG TGATGAAGAT GGCGTTTCAT TAACATCAAT TACTGTGGGA AATCTCGAAG TTCCACAGGT TGACAAAATG CTGGAAGATT TGCTGTCCTC TACCCCCAAA AAGACGATGG AGCTTGCCGA GGTCGTTCAT AAGAAGACAC TCGGTAATGT GTTCTACGTC ATACAGTTCC TTTCACAACT AGCGGAAGAT GAACTGTTAT CTTTCAATAT TGGATCTCTC AAATGGACCT GGGACATCGA AAAAGTACTG TTGAGTACTG CATCGACAGC AAACGTCGTC GAACTGATGC AAAAGAAACT CAACAAGCTT CCAAAAGGGG TTTCAGAAGT GCTTCAGTTG ATTGCTTGTC TCGGATCTAC GTTCAACTAT GGACTTTTCC AACTTGTGGT TCATCATCTT CACGCAATCA AGCCACCAAA AATTCCGATC GAATCGCCCC AAGTCGATAC CTCCTCTGAA AACCCAACTT CCAACGGATT TCGTGCGATT GATTGCTTGA ACATTTCCGT CAACGAAGGT ATAGTCGAAT GTTGCGGAGA GTACTCGTTT CGCTGGGTAC ACGATAAGAT ACAAGAAGCA ACCTACTCAT TGATAGACCG GGATGAGCTG GTAGGGATTC AATATCGAAT TGGTCAATTG TTGATGAACA GTTTGACGCC TGACGATATC GACACCATTA TTTTCGTCGT GGTCAAGTTG TTGAACGAAG GAGCAGCAGC AGGTCTTGCC ACGGACTATG CGATGTGCGA TCGAGTGGCC AAGTTGAATC TTCGTGCTGG AGTGAAAGCA ATTGAGACTT CTGCATTTTT GACCGCAACA GTCTACCTGG CAAAAGGAAT TGAGCTACTG TCCAATGACC ATTGGAACAG CCAGTACGAT CTGAGCCTAG AGTTATATTC TACCGCTGCG GAGGCTGAAT TTTGCGTCGG AAATTTCGAT CGCATGACCA ACCTTTGCAA TGAAGTCCTG GATTTGTCGA ATCGGCCTCT TTTGGACAAG AGACGGGCGT ACAATGTGCT CTTGGATTCA TTGGGTGCGC AAGACCGTAT GAGCGAAGCT TCCGAGTTGT GTTTGTACGC TCTATATTTA CTGAACTGCC AACTTCCAAA ATACCTCATC AAAATGTTTG TAGTCGCTGG ATTTTTACGA ACCAAGGTTT TGCTCAAGCG TTTGACATCA GAAAAAGTGT CCGATCTACC AACAATGAAA GAAAAGTCCA AAATATGGGC AATGGCCTTG CTTGGAAGGC TGACAACTTT TTCATACTTG GAAGAATTGG ACATACTACC TCTGGTTATT CTCAAAAGCT TGAGATGGAC TCTCCGGTTC GGTGTCAGCG AATTTTCTCC ACCTGCCTTT GCTCTCGTCG GCCTATCCCT CGTTGCCCAT CTGAATGATT TCACTGGAGG GAAAGTGTAC GCTCAACATG CTTTGAGTTT ACTGGAATCA ATCCACTCCC GTAAAGTAGA GTCGCGCACC ATATTTGTCA CCCAGGCTTT TGTTATGCAC TGGACGGACC CAATTCAAAG CACTCGGAAG CCATTGTTAC ATGGTTACCA AGTTGGAATG GCAATGGGCG ACACGGAAAG CGCCATGTGG TGCATCTACT TCTACCTAGA AAACTCTATG CATGGAGGTG GAACCCTTCC GTGTCTGGCC GCTGATTTAC GGATATACAG CAATCAAATG AGGGAATTCA AACAGTCAAA ACAACTCGCT TCTACTCTTC TCCTGTGGCA GGTAGTTTTG AATCTGATGG GCGAGTCGGA AAACTCTGTA GTTTTGACGG GGGAAGCTGT TAACCAAGAC CTAACTTTGA GGCAAGCACA TGATAACGTT CATCTGTACG CCGCGCTCCG ACGAATGCAG CTTTACATCG CATGCTTGTT TGAGGAGTTT GAGCTAGCTG TCGGTTTGAT TAAGGAGACG GGGATCAACT ATTTTGAGAA AGTAATGCCG GCAGTCTTTG GACTGTGCCC ATTGGCCTTT TTGAATGGAA TGGCATGTTT TGCCGTTGCA TCGAAAAGAT GCGGAAGAAA ATATCGGATG TTGGGCAAGA AATTTCTCAG GAAAATTTCG GGCTGGGTGG AAAAGGGGGT AAGTGTATTT CTTAGCATCA TGCTTTGTTG ACATGGCGGT AGGTCTCACA ACTTGAAACC TTGCACGCGC CAGAATGTTA ATGTCCGTCA CTATGAGTCT CTTCTAGAAG CTGAACTCGC TGCCTTGGAT GGACGGCGTA GTATCGCCTT GAAGCACTAC GATATTTCTA TTCTTTTGGC TGGTCGTCAA GGATTTATCC AAGACCAGGC TATGGCGCAC CAAAAATTGG GCGAATTTCA CTTAAAGTCT GCAAACATGC AAGATGCGGA ATATCATGTG GGGGAAGCAA TAAAGCTCTA TGAGGAGTGG GGCGCTTTCC AGAAAGCTCA CCATATTAAA GTTAAACACG AAAAGCTTCT TGCGCCTCCC TCCGAAATTC TTGTGGCTAC GTAA
|
Protein sequence | MFKSNASATH RTASMISDIS HEFEEEGFGR VRLLAPLSSS RRGPFYEFRL NEMTVNKLQF ASLGLYGRDE EIRKLQEVFV KMKSQSRELV MISGAAGTGK SALAQNLQKS VQKAKGFYIS GKFDVKQRDE PYAGIVTACR QLCDDVLSLT EPLPEIAPEQ TVEPCTGFLP PQSSLYRWNF SFEEMKAKLQ QELGQEAEVL STVIPNLKQI VGGDYLLAPE SGVGRLEARN RFNFAFRRFM RVMGSFGPLV MVLDDLQWAD LASLDLVQLL ITDRENPSLM VVGCYRDNEV HDAHLMTKMI RDIKTSSDED GVSLTSITVG NLEVPQVDKM LEDLLSSTPK KTMELAEVVH KKTLGNVFYV IQFLSQLAED ELLSFNIGSL KWTWDIEKVL LSTASTANVV ELMQKKLNKL PKGVSEVLQL IACLGSTFNY GLFQLVVHHL HAIKPPKIPI ESPQVDTSSE NPTSNGFRAI DCLNISVNEG IVECCGEYSF RWVHDKIQEA TYSLIDRDEL VGIQYRIGQL LMNSLTPDDI DTIIFVVVKL LNEGAAAGLA TDYAMCDRVA KLNLRAGVKA IETSAFLTAT VYLAKGIELL SNDHWNSQYD LSLELYSTAA EAEFCVGNFD RMTNLCNEVL DLSNRPLLDK RRAYNVLLDS LGAQDRMSEA SELCLYALYL LNCQLPKYLI KMFVVAGFLR TKVLLKRLTS EKVSDLPTMK EKSKIWAMAL LGRLTTFSYL EELDILPLVI LKSLRWTLRF GVSEFSPPAF ALVGLSLVAH LNDFTGGKVY AQHALSLLES IHSRKVESRT IFVTQAFVMH WTDPIQSTRK PLLHGYQVGM AMGDTESAMW CIYFYLENSM HGGGTLPCLA ADLRIYSNQM REFKQSKQLA STLLLWQVVL NLMGESENSV VLTGEAVNQD LTLRQAHDNV HLYAALRRMQ LYIACLFEEF ELAVGLIKET GINYFEKVMP AVFGLCPLAF LNGMACFAVA SKRCGRKYRM LGKKFLRKIS GWVEKGNVNV RHYESLLEAE LAALDGRRSI ALKHYDISIL LAGRQGFIQD QAMAHQKLGE FHLKSANMQD AEYHVGEAIK LYEEWGAFQK AHHIKVKHEK LLAPPSEILV AT
|
| |