Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47112 |
Symbol | |
ID | 7202026 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 478623 |
End bp | 480595 |
Gene Length | 1973 bp |
Protein Length | 540 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181386 |
Protein GI | 219122089 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0949637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTGCTTCG ATGTGATTAG TGGAAGATAG CAAGATGGGC CATGCTGAGC CATTCTAGCT CCTTTAGCAG TAGGCGACTT GCTTCTCAAA TCGAAAAGCG ACCCATGATG GAATCGGTGA GGAGAAGAAG GTCGTCGCAG CTCAGTATGG AACGAAAACG CGACAAATTG GAAGATCCCA ACAAGAAACA AACGTTTACA AAATATCGAT GGTCAAGATG TTTCTGGTCC AAAATCATTC ATACTCTCCC AAGGAGGAAT TGCGCACGCA AGGGTGGCAC CATCATGCTG TTGCTGATTC TGCTGCATGT GTTATCGCGG GATTATCTTT GGGGCGGCCC ATTTGACATC AGAGGCGGAC TCTTCGGTGC TTGGGCTGAC GCTGGCTTGC GGCTCTTATT CTCATCGAAA GCAGCTACGC CGATTGCAAA AGTAGACTAT CCACATGTTG TATTGCGATC CAGTCGCAAA GAGATCCCCG GTTACAGTCT TTTTTTAGAA GAGCTGCTGT CGAGAGCAGC AGGAGAGAAT ATCGATTGGG ACGCTTCAGA AGTACGCAAC CACGACAATC CACGCCATGC GCATTCGGTG GTGAACTTTT TTGAAGGTGG AGATTACGAA TCATCTTCTA ATGCTTCCTC GGATTCTGTT ATTGACGTTT TGGCGAAACA GAACTGTCAA CCACGTTGGC TTTGTCAGAG ATGCTTAAAC GCAGCCCAAT ATGGGTCGCT GACTCAATGT CGTCAATTGT GCCCAGAGTG CCTTGAAGAT ACACTCTGCC AGCCTTCTTT GGTTCGCAAT CCACCCTTTT CTATCCTCAT GCAACGTCCA GTGACATCAA CGATTCCACG TATAGTTCAT ATGATGTGGC ACGAGCCACT GGACAGTTTG AAGTATCCGG AGCTCGTCCG GATTCAAAAC GGATGGCGGA GTACCGACTT TTCCTTTCGA TTTTACACGC CAGATACCGC ACGTCGGTAT ATTCAAAAGA GCTACCCGTT GCGTATCATA GAGGTGTATG ATTCTATCCA GTCGTTGTCA ATGCAGATTA ATTGCGCGCG AGTTCTGATT CTTTTGAGGG AAGGGGGCGT GTTTGCAAAT GGTAAGCCTT GGTGGTGGCT ACTTTTGAAT GCCCCTGTGC TTGTTCTCAA AAATCGAAAT TAATTTTTCC AAGTGGATTT ACTCTTGGAA GTCAATCTAG AGGTTCTCTT GGTTTCAGGC GTGTCCTTTT TTGCTGCGCG AGAAGACGAT ATGGAACACT GCCTTTGGAC TGGATTGGTT GGAGCAAGTC CTGGTCATGT GATTTTGGTT AAGGAAGCGG AAGAATTTTT AACACGCCTG TCTACAAAAG GAAGCTATTT TGACATCGAC CGTAGCTTAT GTACGGCGCT GGGACCGAGT GCCGAGCTGT GGAAAGCACG TGTGTATGTC GATGAAACAA CTATCGATTC CTGCGCGCTT GGCCGAGCCG TTCACACTGC TCTTGGAGAG CGGAATTCAG TGATGCATTT TTCATTGGGA AAACTTCAGA TTCCTTTAAG CAATAACAGA CTCTATGAAG GAGATGCCCT TATACTTCTC GTAAGTCTAG ACAAGATCAT CGGCAAACAC AGATACTTTG ATTTTTTTAA CGGATCTTGT CCCTCTTTTC TTACAGATGA GTAAATCTGA CACTGGTGCC ACTCGAATTT CTGACATCGA GCGAAATATA TTGATTGCCT CCACATCTAT GGTGGGACTC TCGAAAGAGA GCCTGTACGA GCGTATACAT CACGCCACAT TGCGGAACTC GAGAGCGGAA ACCAGCACTG TGAACCTAGG TATGGGAGTA ACCAAACAAA TTAGATTTAT TGATAAACGT TAACTTTCCG GATGCAAAGA GTGATCCTGA ACTGATTTGT ACACGTGTTG TCTATTTATC CGCGTGCCCT TTCTACTTTG TAGCCCATGA AGGGGACTGT CCTGGAACCT TAG
|
Protein sequence | MLSHSSSFSS RRLASQIEKR PMMESVRRRR SSQLSMERKR DKLEDPNKKQ TFTKYRWSRC FWSKIIHTLP RRNCARKGGT IMLLLILLHV LSRDYLWGGP FDIRGGLFGA WADAGLRLLF SSKAATPIAK VDYPHVVLRS SRKEIPGYSL FLEELLSRAA GENIDWDASE VRNHDNPRHA HSVVNFFEGG DYESSSNASS DSVIDVLAKQ NCQPRWLCQR CLNAAQYGSL TQCRQLCPEC LEDTLCQPSL VRNPPFSILM QRPVTSTIPR IVHMMWHEPL DSLKYPELVR IQNGWRSTDF SFRFYTPDTA RRYIQKSYPL RIIEVYDSIQ SLSMQINCAR VLILLREGGV FANEVLLVSG VSFFAAREDD MEHCLWTGLV GASPGHVILV KEAEEFLTRL STKGSYFDID RSLCTALGPS AELWKARVYV DETTIDSCAL GRAVHTALGE RNSVMHFSLG KLQIPLSNNR LYEGDALILL MSKSDTGATR ISDIERNILI ASTSMVGLSK ESLYERIHHA TLRNSRAETS TPMKGTVLEP
|
| |