Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47614 |
Symbol | |
ID | 7202664 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 276755 |
End bp | 279977 |
Gene Length | 3223 bp |
Protein Length | 744 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182040 |
Protein GI | 219123456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.81104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACGAGCAG AAACAAGTTG TGGGCGCCGA GATTACAGTA ACACTCCCAT CATCGGAGAC GACACCCGAC AAAACGCATG GAAGCTAACC CAAGCAGGAA AGCGATGCTC GTGAACGTGC GATATCCTCC GATACTGCGG TATCTGCACA CGCATAGCAT TGTTGCTTTC CCGTGTCATA TTTCAGTTCA CAGAAACATC TCTCCCGTAG CTTACATCTA TATTTCATTT CCTGTACGAG GTTGCAGCCC TTTTGAAATT GTCCAGGATG CACTGGTAAG GAGTCGTAGC TAGGAGGACC TTTCTTCGTT GTTGGGAATT ATATTATATA CTTTCTACAG AAAAATTTCC TTAAAATAAT CCAGTGACAT GTTTCGATAG AGCTCACAGT CAGGCCTTTT AAAATAGTTC TGGCTGGGAT ATCATACCTA TTCAAAGAGA GAACATTCGA TCCTTCGCAG GAAACCAAAA TTTGGTTTGT CGCCTGAGTG GACGCGGCTG TGATTCGGGC TATTTTTTTG TCGGAAGACA TCATTCCTTT TATTATTGTG AAGCCGCAGT ATGGATTATA GGAACGATAG TCTTGTTGCT TCAGTCTTTG AGTGGACATT TAAAAGGCTG AAGTCAAAAC AAGTGGGTTC AAATGCCCCG TTGGTGATTA ATCTGCCATG CCATCGTTGC GTTTGTTCAA TCACAGGACA TCCGTTGCCG GCGATGATCT CCGTTGTCAC TCCACTTTTG CGATTCTTCT GAGACTTCTG CAGGTCGCTC TCTTAGTCCC AGCAATGATT TACTTGGTTA GGATCAACGC AGATCCAATG CACTGTGAAA GCTCTGGAAT TCTTCGCCAT GGCCACAAAT TGTTTTATAC CTATTGGGTG TGCTCCGTCG TATTGGTCTT TTTCTCGTTG CCAATCGATT ATCTAATATT TCGAACTTCA GGTAGAGGCA CGCCTTCCCA ACCAGCAGCA AGAGCTTGGT TGAGGCCTCT GTGCTACATC AAAATCGTTC CTCTATCGTT GTTAAGAATC ACAACGTTCG GTTTGGGGGT CGTTGTAGCT GTATTCCTCA AACAGTTTTG CACTTGCGCA ACCGATAGTA TCGTGGATCC AGCAGAAAAT CTGAACGCTC TATTGGCTGA GATACGCACA TTTCGGTGTC CAGATTATGA TTCTATGATG GTTGTTCTCC CAACGCTGAT CGCTACTCAC GCTTCAGAGG TCATGGTCGC AACCATAATC TACTTTTATT TTGTTTTCAA GACAGTTCGA CCGGGACGAT GGTTGCGACC TCTGCTTGGA ACACCAGAGT TTCGCTGGGA AGCCTTTTGT TGTTGCTGCT GCACGCTCAG TTCCGTTTTC ACGTGCTGCA TGTTTGGGGG ATGGGAGGTG AGAAACTCAA GTTTTGCCGA TATCAGCTTG GCGCTGGCTG AGTTCCTGGA CGATGGCGGA AATTTGGATG TGACGCTTTC GGATCTGGTA GCCGGAGTAA TGATGGTCGG TCGGGAGCAC AAGGAAAAGT GGGACCGAAC CCGAGGATTG CTCGTTGAAA AAGTCAAGTC TAGCAGACAT CATGTCGCTC ACAGTCAACC TTCTGCTGTC TCAAGTCAGT ACACACACAG TAACATTTAC ATGCTTCGAT TGAAAAGAAT TCATGAAGAT AGAAGCACGG CCTTCGAGGT AGACGAGCGT GAACCGCTTT GTGTTCACAA TATCAGCGAT GTAGATGTTG TTCAGGAAGG GGCCTATTAC ATGAGGTTTG CCCTCGCCAT TTACGGCTAC ATCATGTACG TAATGAATCA TCGAATGGAA GGGCTATGCT GCTTGACAGC TCAGTGCGCA GGATGCTTCC GATCATGTTG CACCGAAGAA GTCATCGGTG ATAATATGTG CGGCTGCAAT GCGAGTGCTT TCTTGAGGGA GACAGGCATG GAAGTTTCCT GTCTAGCATA TTTTAGTTGT CGATCGGGTG TAGGAAAGAT TCCGTACTGC ATTGTTGTAG ATAAGGAAAA GGGGAGCGTT GTCGTCGCGA TACGAGGCAC TTTAGCCATT GAAGACGTCG TCGCAGATTT AACTATCCAC CCGACACTCC TTGCCGCATT TGGACAACAG TACGACTTTG TGGGTGACAA TGCCTACGCT CATTCAGGTA TGTTGAAATG TGCAGAATGG ATTGCTGAAG ACATGAGAGG CCATGGAATT CTTAGGAGGC TTTTGCTGGA CGAGCGCTCA GATACCAGTG ATTTTCGGCT GGTCGTAACT GGGCATTCGT TAGGAGCAGG ATGCGCCGCG ATCTTATCCC TCTTTCTCAG GAAGGACTTC CCCTGTTTGA GGTGTTTCTG CTTTGAGCCC CCCGGCTGTG TTTTGTCCGA CCAATTGGCG GACTTTGACT GGATGATATC CTTTGTTCTT GGAGATGACA TTGTCCCGCG TCTGTCGTTT GAATCGCTCA AGAACTTACG CGACGATGTA CTGTCCGCTA TTCAACGGCT TAAAGTACCA AAGCACAAAG TTTTTGAAAT CTTCCAGCCA TTGAATTGGA AGAAGTATAG TGATCACACA AAATGGAACA GAAGAATGCT CCACCGGACG AACTCAACGC CACAATCCGA ATTTGGATCG CAATTATCTG CCTTTCGTGC TCACCAGCAT GGTAAGTTTA GTTGTTGTAT CGGCAGATTG GTGCGCGATA GGCCAAAATC TCAACAAAGT TTCTAGAAAG AGCAATAGAG CGTGGAATGG CTGGGCAAGA ACTTCATCCT CCAGGCAAGA TCATCCACTT GGTTAAAGCC TCTGACACGT CGAGCTCTAC TAGACCCTTT CTACGTTGGG AAAAGGACGC CTACGTTCCT GTCTGGGCCA AGCGTAAGGA CATATGTGAA ATCAATCTAT CTGGTTCCTT GCTCCTGGAC CACCATGCTG GAAAGGTATG CACCGCGCTT CAACGGATAG CTGCATCCTT CGGGGAAAAG AGTCCTGTCG CAGGACCAGG CAATGTATGT CAGGCATAGG CAGTGTCAGT GACTAAACTC TTTTTCTTTG CGCGGGAAAA TTCATTTGAT AGGTCTCTCC GACCATATTG TTTGTTAGAA TAAGGTGTGC AGCCAAAGGA ATTGATGTGA ATAGAAAATG TCAGGCACAA CATTACTACT GTCAGTGACC AATCTAACTT TCCATTTTCA AAATAGTACG TACGCCTATC ACTATCGCTG ATTAACGAAT TAG
|
Protein sequence | MPSLRLFNHR TSVAGDDLRC HSTFAILLRL LQVALLVPAM IYLVRINADP MHCESSGILR HGHKLFYTYW VCSVVLVFFS LPIDYLIFRT SGRGTPSQPA ARAWLRPLCY IKIVPLSLLR ITTFGLGVVV AVFLKQFCTC ATDSIVDPAE NLNALLAEIR TFRCPDYDSM MVVLPTLIAT HASEVMVATI IYFYFVFKTV RPGRWLRPLL GTPEFRWEAF CCCCCTLSSV FTCCMFGGWE VRNSSFADIS LALAEFLDDG GNLDVTLSDL VAGVMMVGRE HKEKWDRTRG LLVEKVKSSR HHVAHSQPSA VSSQYTHSNI YMLRLKRIHE DRSTAFEVDE REPLCVHNIS DVDVVQEGAY YMRFALAIYG YIMYVMNHRM EGLCCLTAQC AGCFRSCCTE EVIGDNMCGC NASAFLRETG MEVSCLAYFS CRSGVGKIPY CIVVDKEKGS VVVAIRGTLA IEDVVADLTI HPTLLAAFGQ QYDFVGDNAY AHSGHGILRR LLLDERSDTS DFRLVVTGHS LGAGCAAILS LFLRKDFPCL RCFCFEPPGC VLSDQLADFD WMISFVLGDD IVPRLSFESL KNLRDDVLSA IQRLKVPKHK VFEIFQPLNW KKYSDHTKWN RRMLHRTNST PQSEFGSQLS AFRAHQHERA IERGMAGQEL HPPGKIIHLV KASDTSSSTR PFLRWEKDAY VPVWAKRKDI CEINLSGSLL LDHHAGKVCT ALQRIAASFG EKSPVAGPGN VCQA
|
| |