Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49723 |
Symbol | |
ID | 7198404 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 66437 |
End bp | 68565 |
Gene Length | 2129 bp |
Protein Length | 658 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184481 |
Protein GI | 219128567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.382168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAACT TCGGTGACGT CGGCATGTTG ATGCCGAATC GTCCTCCAAC AACCGATATG AGAACGAGAA CACCTTGTGA GTGGCAGCCG GTGCTGGACT GCACCCACCG GCCATCTGTG GACGACCCCA AAAATATGGC TTCCCTCATT GGAGGACTGT CTGTGCTGCG CAGGTCTGGT CCCACTACCT TGAAATTCGA ATCTAGCTCT AGCTTGAATT TTGAACAATA ATAGTTCGAA AGTTTTGTGC GCCGTGGTCC TGTGGACCGT GCACCACCGG GAATCACTGT CCGCATCCAC GTGGAAGAAT CGAGAAACAC GAAAACGTCG TGTGGTGTTT TGGTGCGTGA CGGTGAGAGT GCGCACAGTG TGGCGTTCGG GTCTCCGATG GATGAAACCC GTACACCCCA ACAAGAAGGA CAACGTAGTA AGCCAAACTC GTGACAAGCT CCCGTACGCT CATGCCCATT CCAAGCAATA CCAAACATGC CTACGTCGTT TACAATGAAC CCCGCACGGA GTCGACATGG ACAGCTGCAC CGTTCTAGCG TGAGAAAAGC GCTTCTTCCG TTGCTGGTGT CGATGGTGCA CGCGGGTCGA TTGCCGTTGG GAATTCTCCT TTCGTCGCCG TCGGCTTTTT CGGAAATAAG AACGAATAGT GAATCAGCGT TAGGCACAGT ACGGCACCAG TTACACAACG AAGCCTTTGG TGGGTCCGGA GACCAAACGC GGTTCAGATA TCCGGGTCGA AAAGGACCGC TTTGCGCTGT CACCAAGTTG CGAGGAGGTG ATGTTTCCGA CGGGGAAGAC GAATCTTACG ACAGTGACGG GGAAGATACC AATGATTCGG ATGAAGAGGA GGAAGCTGAG ATTGCCGCGG TACCGGAGTC TCGAGATCCA CAAAGCGAAC CAGAAGACGA ATCCGAGGCC CATCCTTTAT CTTCGGTCAT TTCTATGGAA CCGGTACCAA TCACCATCAA AACATCGTTA GGTACCAAGG CTCTAGACCA CATAGTAGAG CTGACGGTAC ATCGCTCGAG GAATATTGCT TCGCTTAAAC TGAGTGCTAG TCGCCAGCTA CCAGGACGGC CACCAGTTTC TGTGATGCAC ATGTTGCTCA ACGGAAAAGT ATTGAGTGAC GAAATGCTAC TGGACGAGTT GATTGATGAC GACGACGAGA AAGACGATAC CGACGGTACT GGTAGTGAAA GCCAGCTCAC TTTAACGTTG GATATGCTAC CACCTGTCGA TCCCAAATTT GTCGGCCAAC TAGAGTCACA AATGAAGGAT ATGACCACCG CAGAGCTGTT GGGAACATTC GCTGCCAATG AGGCTGCATT GTACCAGAAC GCGGCACTCT TGCTTGCCGA GCAAATGGAG CCAGTATACG ACGATGATCA AGTGGAGGTA GGCATTTCCG AGGTGGCGAC ACACCCACCG CCGCTCGTGA ACGTGCAAGT TCGCGAGCAA GCAGCACGCA TCCGTCGAGA CTTGGAAAGT AAAATCTTGG CTTCGGAGCA CTCGCAAAAG ATTCTCGCTG ACCCTTTGCC CCCTTCCGCC AAACTAGCAG ATCTACAACG TGTCGAGCGT CGTGGCCAAC GGGTTCGACG AGTCGCGGGA TCGGGTGGCG TAACGACCGG CTTGAAACGA TCGATTCAAA AAAATCTGAA CGTACACTGG GGTGACGCTA TACGGAATTT TTGTCTCTTT TTGTTCTTTG GATACTTTGG TGGGCGTACA CCCGTAAGTC GGGCTATTCT GTTGCTGGGT GCACCAAGCG TCTTTGTGCT ACAGGCACGG CCCGTCAAAC TGTGGATCAA ATGTCTCATG TACGCAATGC TCGACCATCC GCCTGGAATT TTCTTGAGTT TGCTGCCCGC CCCCCAACAG GCCATTTTGA GTCTGAATGT GGGCGAGGAA ATGAAAACTA TTTATGGTAA TACACTGACC AACACGGTGG TGAACGAGGT TGATGCAGAG CCGGAAGAAC TGGCAGACCT GTACGAAATG ACCGACGTAA TAATTGACGG CGAAGATGAT GACGAGTTCT ATGCTGTCGA CGAATACGAA AGTAGCTATG ATGATGACGA CGAGTAATTA TGTATACTAA TTGCTGTTTC GAAATCGCT
|
Protein sequence | MSNFGDVGML MPNRPPTTDM RTRTPCEWQP VLDCTHRPSV DDPKNMASLI GGLSFESFVR RGPVDRAPPG ITVRIHVEES RNTKTSCGVL VRDGESAHSV AFGSPMDETR TPQQEGQRTI PNMPTSFTMN PARSRHGQLH RSSVRKALLP LLVSMVHAGR LPLGILLSSP SAFSEIRTNS ESALGTVRHQ LHNEAFGGSG DQTRFRYPGR KGPLCAVTKL RGGDVSDGED ESYDSDGEDT NDSDEEEEAE IAAVPESRDP QSEPEDESEA HPLSSVISME PVPITIKTSL GTKALDHIVE LTVHRSRNIA SLKLSASRQL PGRPPVSVMH MLLNGKVLSD EMLLDELIDD DDEKDDTDGT GSESQLTLTL DMLPPVDPKF VGQLESQMKD MTTAELLGTF AANEAALYQN AALLLAEQME PVYDDDQVEV GISEVATHPP PLVNVQVREQ AARIRRDLES KILASEHSQK ILADPLPPSA KLADLQRVER RGQRVRRVAG SGGVTTGLKR SIQKNLNVHW GDAIRNFCLF LFFGYFGGRT PVSRAILLLG APSVFVLQAR PVKLWIKCLM YAMLDHPPGI FLSLLPAPQQ AILSLNVGEE MKTIYGNTLT NTVVNEVDAE PEELADLYEM TDVIIDGEDD DEFYAVDEYE SSYDDDDE
|
| |