Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37283 |
Symbol | |
ID | 7201938 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 613568 |
End bp | 617034 |
Gene Length | 3467 bp |
Protein Length | 1033 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181236 |
Protein GI | 219121777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCGT TAGAATTGTT GGCCCAGGAG CGTCTTCCAA GACGTGACTT CAACCGGAAA CGGGTCCCGT ACCGGGACGC GTACGTGTTG GTAGGTTGGT CGGTCACGGA TGCGTACGGG CACTGAGGCA AAACGGCCAA ACAGGGTTGG ACACTGCGAA TGTCGGCGAT TTCCGCCGGA TCTTTTGGTC ATTTCGAACC GCCTCTCGCG CAATTGCACA CTTTTGTATT TCCGGCACAC GACCAACGAA ACAGACCCCC AAAAAAAAAG AACCGAACGG GTATCTCTAC CGGATTTGTG CTACCCAACC GTACCTCGTG GTGATTGTCT CCACTCCAGC ACCACCACTA CCACCCATTT CCTCGTAAGA CCAAAGAGAA CACGGATATC GAAGGCACCT TTATTTCAAC AAGCCGAACG AGAATAGTCG AACGTCGAGT ACAGCCTTCA TCCACAGTCC CATACATTCC CACCTACATC CTTGGATTAA CCCACCCACA CGTCCACCGA CCAGCATGCC ACCGTCCATT CTATCGGACG CCTCGGGCGA TGCCTTTAAT TCACCCGAAC AATTCGTCGT GGTGGTCAAT CCCTACTCGA CGGGCTGTTT GATTGCCAAA GAAATGCACA AGCGCGGATA CGTACTCATT GTCGTCTGGA CCAAAGGCTT CTCCGCGGAT ATGAAGACGC ACATTCCCAT GAGTGCCGGG CCGATGGAGT ACTACGCCGA AATTGACGAA CAAGACAATC TCGACGACAC CGCCGCGCTC GTCCGCAAAA CGGCCAACGA ACAGGGAATC GCGGCATGCC TCGCTGGAGG CGAAGCCGGA GTAGACCTCG CCGACGCACT TTCCGAACAT CTCAACCTAC TCACCAACGG AACCACTATT CCCAATCGAC GTGACAAAAA GATTCAACAA GAACTCATTC GACAAGTAGG TCTGCGCTCC GTTCGACAAG CCGGGGGTGA CCAGTTCTCG GACGTCGAGG ACTTTCTCCG GACGGAACCG TATCCCCTCG TCCTCAAGCC CATCGAATCC GCCGGATCGG ACGGAGTCAA GCTCTGCGAG AACTTTCCGC AAGCCAAGGA TCACTTTGAG TTATTGATGA AGTCACAAAT GGTCAACGGA GGGTCCTGTC CCGCCGTACT CTGCCAAGAA TTTCTCCGCG GCAAAGAATA CGTCGTAGAT CATGTTTCTC GCAACGGCGT CCACAAAACT GTCATGGTAT GGGTCTACGA CAAGCGTCCC GCCAACGGAT CCGCCTTTGT TTACTTTGGC TGTGTCCCGG TCGACTCCAA TTCCCCGGAA GCGCTTATCG TCATTCCCTA TATTCGCGGC GTCCTCGACG CCATGGGGGT ACAGAACGGA CCATCGCACG GGGAAGTCAT GATGACCGAA GACGGTCCTT GCTTGATCGA AATGAATTGT CGCGCCCACG GTGGAGATGG TAACTGGAGA CCCCTCTGCC GGGCCCTCAC TGGAGGCTAC TCCCAAGTCG AAGCAACCGC CGACGCCTAT CTAGACGCTT TCCAATTCAA TCGACTCCCC GATAAACCTC CCAGTCCCTT CAAAGCGTCC GGACAAGAAA TTATTCTCGT CAGTTATTCC CAAGGGACGG TCCAGAGTTG TCCAGGATAC GACGTGATCA AAAGTTTACC CTCGTTCGTC TGTTTGGAAA CGGGCGTTAA ACCGGGATCC GAGATAGACT ACACCGTCGA CCTCTTCACC GGCATCGGGA GTGTCATTCT CATGCACAAG GACCCGGCTG TGTTGGACCG CGACATTGAC TTTGTCCGGT ATATGGAGAC CATCAACGGC CTTTTTGTGT ACGAAACCAA GTTGGAAAAT CTCAAACGCC CCCGTGGCGA GGCCGTTACC GAAAAGGGCC ACCGTCGCGT TTTTTCTGCG GAAGGCCCTG GTCTCATTCG CCACATGTCC AACGATCGTC CCGAATTGCG CAATCCGCTG GTGAAACGTA TGACCACGGT CGATGCCTCG CGGGAAGTGG TCGTTATAAT CGACCCTTAC TCGACTGGAT GCTGCATAGC GGAAGAGATT ATCAAACGGG GCTTCAACGT GATTGCACTC TGGACCGAGG GATTTTCTGA AGAAATGAAA AAGCATGTAC CCCTCAGTGT GGGAAATGTG ACATACTTCA AAGAAGTGAA TCAGGCCGAG ACGTTGGAAA AGACGGAAGC GGTAGTTCGC AAAGCGGCGG AACTGTTCCG TATTGTTGCC TGTATCGCCG GCGGGGAAGC CGGAGTCGAC CTTGCCGACG CTCTTTCCGA ACAGCTCAAG GTTCGGACCA ATGGTACCGG CATTCCCAAC AAGCGCGACA AGAAACTCCA ACAAGAGCTT GTCAAAAAGG TGGGACTGCG GTCAGTACGC CAAGCGGGAA GCGATAAATT TGCCGATGTC GAGCCTTTCT TGCGCCGAGA ACCGTACCCG GTGGTCCTGA AGCCAGTTGA ATCGGCCGGG TCAGATGGTG TCAAACTGTG CCACAATTAC GACGAAGCAA AGCAGCATTT CGGGGTACTC ATGAAATCGC AAATGGTCAA TGGGGGGGAT TGTCCAGCGG TACTCTGTCA AGAATTTTTG CGCGGGAAGG AGTACGTCGT CGACCACGTG TCTCGCGACG GTAAGCACAA AACCGTCATG GTTTGGGTGT ATGACAAGCG CCCGGCAAAC GGATCAGCCT TTGTCTACTT TGGTTGCGTC CCAGTAGATT CCGATTCACC CGAAGCTCGT CTATTGATTC CGTACGTACG CCGAGTACTA GACGCCTTGC AAATCAAGAA CGGTCCATCG CACGGCGAAG TCATGATGAC GAACAACGGT CCCTGTTTGG TAGAAATGAA TTGTCGCGCA CACGGTGGTG ACGGGAACTG GCGTCCCCTC TGCCGCGCAC TCAATGGTGG CTACTCCCAG GTCGAATCCA CGGTCGATTC GTACTTGGAC AGTCGTCAAT TCATGATTAC CCCCGAAAAA CCACCTAGTC CCTTTAAAGC TCACGGCCAG GAAGCAATTT TGGTTTCATT TTCACGCGGT GTAGTCAAGG CCACTCCCGG TACGTTGGAT TGCTTATCTG TGTTGAGTTG ATTCCGGCGT TTTCTATCTT AAAATGTTTT ACAGTCAGAG ACTCACTCAA TCCCTTCTTT ACTATGGGCA TCCCCAGGTT TTGAGGAAAT TCAAAAGCTC GAGTCATTCG TCTATTTGGA GACGGGTGTT CGCGTCGGCA CCTTTGTTGA CTACACGGTC GACCTCTTTA CCGGAATAGG TAGTGTCATC GTCATGCACC AGGACGAAGA CGTATTGGAA CGCGACGTTC GTCGCATTCG GCAGTTGGAA TCGGAAAATT TGCTGTTTGA ATACGAAACT GGCAAGGTAG TCTTTTCCTC GCCGAGCAAC ATTCACGACA CTGGAAGCGT GACGGTGGCC TCCGCGAACC GTCCCGACTT GTATTAG
|
Protein sequence | MEPLELLAQE RLPRRDFNRK RVPYRDAYVL TPKKKEPNGY LYRICATQPY LVVIVSTPAP PLPPISSPIH SHLHPWINPP TRPPTSMPPS ILSDASGDAF NSPEQFVVVV NPYSTGCLIA KEMHKRGYVL IVVWTKGFSA DMKTHIPMSA GPMEYYAEID EQDNLDDTAA LVRKTANEQG IAACLAGGEA GVDLADALSE HLNLLTNGTT IPNRRDKKIQ QELIRQVGLR SVRQAGGDQF SDVEDFLRTE PYPLVLKPIE SAGSDGVKLC ENFPQAKDHF ELLMKSQMVN GGSCPAVLCQ EFLRGKEYVV DHVSRNGVHK TVMVWVYDKR PANGSAFVYF GCVPVDSNSP EALIVIPYIR GVLDAMGVQN GPSHGEVMMT EDGPCLIEMN CRAHGGDGNW RPLCRALTGG YSQVEATADA YLDAFQFNRL PDKPPSPFKA SGQEIILVSY SQGTVQSCPG YDVIKSLPSF VCLETGVKPG SEIDYTVDLF TGIGSVILMH KDPAVLDRDI DFVRYMETIN GLFVYETKLE NLKRPRGEAV TEKGHRRVFS AEGPGLIRHM SNDRPELRNP LVKRMTTVDA SREVVVIIDP YSTGCCIAEE IIKRGFNVIA LWTEGFSEEM KKHVPLSVGN VTYFKEVNQA ETLEKTEAVV RKAAELFRIV ACIAGGEAGV DLADALSEQL KVRTNGTGIP NKRDKKLQQE LVKKVGLRSV RQAGSDKFAD VEPFLRREPY PVVLKPVESA GSDGVKLCHN YDEAKQHFGV LMKSQMVNGG DCPAVLCQEF LRGKEYVVDH VSRDGKHKTV MVWVYDKRPA NGSAFVYFGC VPVDSDSPEA RLLIPYVRRV LDALQIKNGP SHGEVMMTNN GPCLVEMNCR AHGGDGNWRP LCRALNGGYS QVESTVDSYL DSRQFMITPE KPPSPFKAHG QEAILVSFSR GVVKATPGFE EIQKLESFVY LETGVRVGTF VDYTVDLFTG IGSVIVMHQD EDVLERDVRR IRQLESENLL FEYETGKVVF SSPSNIHDTG SVTVASANRP DLY
|
| |