Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45855 |
Symbol | |
ID | 7200961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 478753 |
End bp | 480782 |
Gene Length | 2030 bp |
Protein Length | 650 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180055 |
Protein GI | 219118571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.306932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGACATTGG TATAGTATAT ATCAACTTTG CAGTTCGTTC CGAGAAATAC GAATAGTTCA TCTGTAGAGA AATCAACATG GCGACTATGG AAGAGTCTCT CCTGGATGAG ATCACGGTAG CTGCTTCCGT CCAGGAGCCA GTCAAAGTCC AGCTGGTGGT ATCGGTGCAA CCTAACGACG ATCCCATTCC CCAACTGGCG CACGATGGCA TTACTGCCGG CGATCTAACG ACGGCGATCC GTGTCTTGAA CGCCGTCGCC TCCCTATATC CGTCTCACAA GGGACAAGAC GAACAGCGCG AACAAGGATT GGAGCGCTAC AAACAGCCCA ATCTCCGGTC GTTCCGCAAA GCGCTCGCGG CTTGTCTCGA GCTGCACCGC CGCACAATGT TCAACGGCAA AGACGAGGAA GAACACTACG AACGACGACT GAAGGACCGT TCTTTGAAAC GGCAAAAGAC AGCGGAACGG GATATGAACA AAAAGTACAT TGCGAGTACG GCGTTGCGTC AAGGTCGAGT AGAGCGTTTG CAGCAGTTAC AAGACGATGC CGCGGACGAG GAACGTCACA AACTGCTCCT CGCCCTGCAA CCCGACGGAC ACGTCGACAC CACCTTGGCA CGGACAATCC CACTGTTGGA AGACTCCTCC GCCCCTGCTG AACACGTACA ACTCCCCAAG CTCCGTTCCT GTTACGTTTG CAAAGTGCGG TTCCGCGAAT TACACGAGTT CTACGATCAA CTGTGTCCCG CGTGTGCGGC ACTTAATTGG CAAAAACGCC ACAATTCGGC AAACCTGCAC GGCCGTGTCG CTATTGTGAC GGGATCACGA GTCAAGATTG GGTACCAAAC CTGTCTCAAA TTGTTACGGG CGGGCTGCGT TGTGGTGGCC ACCACCCGAT TCCCCAACGC CGCCGCGGCA ACGTACCGAG CCGAAGCCGA CTTTGATAGC TTTCGATCGA GACTACACGT GTATGGGCTA GACTTGCGGG ACGTGACGGG CTTGGAAGCA TTTACTCGCT TTTTGAAGCA AAAGTATACG GATGGGATCG ATATTCTGAT CAACAATGCG TGTCAGACAG TCCGCCGGCC CGTGGGATAC TACCGGCCGC AAGTGGAACG CGAACAGATG TTGTGGATGC AGGCCGATCA AACGCACAAA TCCTTGCTGG ATGACTGTGC CGACTTTGAA CGAGTTCGCC GGCGATTACA ATTGGATCAT AAACAACAGG CTTCCTTGCA AGTAGAAGGA AATAAGGATA ATGTTCCTGC TATTCTAGAC GTTGAAATGG ACAATGGAGG GAATGGTTTG TCGGAATCGG AAGAGGCAAC AAGGAACGAA AGTGCGCTTG TGGCGTCAAC AAAACCGAAC AACTCAGTCG CTTCGACGCC GTTCGAAGCC ACAGGCCTAT CGCACTCCGC AGCTATGTCC CAGATGGTGA TTCTTCCCGA AGACGCGGGT GTCTCCGATT CAGTGCTGCC CCCGGGAGTG TCGGATGTCA ATGGACAGCA GCTCGATTTG CGGACAACGA ATTCTTGGCT CCTAAAGATG GAAGAAGTAT CGACACCGGA GATATTAGAG TGTTTCTTCA TCAATGCTAT CGCACCTTTT GTTCTCAACT CTCGACTCAA ACCTCTTATG ACTACTCCCA ACACTGCCAA TCGTTCTGAC CGTTATATCA TCAATGTTTC GGCTATGGAA GGGAAGTTTT ATCGCTACAA AATGCCCAAT CATCCTCATA CTAATATGGC AAAGGCGGCT CTCAACATGC TCACACGCAC ATCAGCGGAA GACCTGGCGA AGCAGCATCG GATCTTCATG AATTCGGTCG ACACTGGTTG GATTAACGAT GAGAATCCAA CCGTTAAAGC CAGCAAGATT GCCGAAACCA ACTTGTTTCA GACTCCAATC GATGAAATTG ACGCTGCCGC TCGTATTTTG GATCCGATCT TTGAAGGCGT CAATGGCGGA ACAGAGTTTA AGAAAGATTA CGGCAAGTTT CTGAAGGATT ATCGCGAAAG TGAATGGTAA
|
Protein sequence | MATMEESLLD EITVAASVQE PVKVQLVVSV QPNDDPIPQL AHDGITAGDL TTAIRVLNAV ASLYPSHKGQ DEQREQGLER YKQPNLRSFR KALAACLELH RRTMFNGKDE EEHYERRLKD RSLKRQKTAE RDMNKKYIAS TALRQGRVER LQQLQDDAAD EERHKLLLAL QPDGHVDTTL ARTIPLLEDS SAPAEHVQLP KLRSCYVCKV RFRELHEFYD QLCPACAALN WQKRHNSANL HGRVAIVTGS RVKIGYQTCL KLLRAGCVVV ATTRFPNAAA ATYRAEADFD SFRSRLHVYG LDLRDVTGLE AFTRFLKQKY TDGIDILINN ACQTVRRPVG YYRPQVEREQ MLWMQADQTH KSLLDDCADF ERVRRRLQLD HKQQASLQVE GNKDNVPAIL DVEMDNGGNG LSESEEATRN ESALVASTKP NNSVASTPFE ATGLSHSAAM SQMVILPEDA GVSDSVLPPG VSDVNGQQLD LRTTNSWLLK MEEVSTPEIL ECFFINAIAP FVLNSRLKPL MTTPNTANRS DRYIINVSAM EGKFYRYKMP NHPHTNMAKA ALNMLTRTSA EDLAKQHRIF MNSVDTGWIN DENPTVKASK IAETNLFQTP IDEIDAAARI LDPIFEGVNG GTEFKKDYGK FLKDYRESEW
|
| |