Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50362 |
Symbol | |
ID | 7199187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 84279 |
End bp | 86051 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185279 |
Protein GI | 219130244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACAAG GGATCGTTCG GAGCCCTGTG GTTCCGTCCA CCTATGAGCG ATCACCTGTA CCGATCAAGG TTGTCGGACG CCCCGCCAAC GTGTCTAGCA ACAATAGATA TACTGCTTCC CCACCGCGAC ATTCGTCGCC GAGCAGAAAC AATCTTAGTT CTGTTCCGTC GAGACACCAA AGCAAGGCTC GTGATGGTAT TCCATACGTC CCACGGCTAG CAAATCCTTC CCCGACACGC TCGCGTATAA GCGTCGCTCG GACGACAAGC TCACGAGATC GCATGCTGCG ACGTCGGCGC GAAGAATCCC GAGATCCGCC GCCGTCCGAG CAATCCCCGC CCGAACGATT ATGTGATTAC GATACGTCCG CGACCACTTT GTACGAAATG CTCGAGTCTT CCAATTGGGA TGAGGCCCGA AGTCGATGTC GATCTCACCC GGAAGAAGCC CGAACTTGGA TCGTTCGGAA AGACAAAAGC TTGCAGGTTC GATGGAAGCT TTTGCCTTTG CACGCCGCCA TTATCTTCCA GTCCCCAAAT TTTGTGGTTT CTTCCTTGCT GGAAAAATAT TCTGCGGCAG CTTCTCGCCA AGACGACCAG GGTATGTTGC CATTGCATCT CGCCTTTCGA CACAAACAAG AAGATGAAGA TCTGCTTGAA CTATTGCTGG TACAGTTCCC CAAGGCGGTA ATCATGAAGG ACAGGCGTGA TCGCGTCCCT CTGGAGCACG GTCGTGATTG CAAGTACAGC GCCAAGCTCA TGCGGTTGTA CGCGGATGCA ACGGTGGCGG GATCCCGCGC TCTGGCCGCC AAAACGCTGT CGGGATGTGG AATGAACAGC AATCATACTG CCACAACATC GACAAACACC AGCGTCCGTC AAAAGTCGGA AACAGAGCAC GAGAATCAAA TCGCGGCGTT ACGGGCAAAG TACGATTCGA ATCTTCAATA TGTGAAACAG CAGTTCGAAG AGCGCATAAA CACGTTACGT GAGAAAAATA CTGTAAACAC TCGGCAAATG CGACTCTCTG CCGCAGCGGA ACGTCAGGTA GTAGCCCAAC AACATCAAGA CGAAATGAAC GATTTACGCG ACCTGCTGAG CCAGCAGGTT GGGAAGGATA GGGACCAAGT CAAACGATTA AGAAGTCAAG TTGAAAACCT GCAACGGCAA TTGCAGCAAA ATGAGACGCA AAATGAATCA AGCGCTGCCG AAATTGCACT CGTCCAGGCA TATGCAGAGG AATTAAAGGA ACACTTGGAG CAAAGTGTTC ACGATCAACT CCAAATTCGT AATCTTGCGC TTCAGCAACA AGATGAGCTG GACTCGCAGC GTCAGCTACG AACTCAACTC GTAGAGACCT TGCTGCAACA AGAAGACTCG AATGTACAGA ACGACCGGCT ACGAGGGTCA AAGATGTTAG AAGTATCTGA CAGTATTCGA AATCGCATAA CAGATCTATT AGAGAGCGCT CCATCGCTTG AGTCTCGCGA TCGCTATGGT TACAATCGTG TCAGAAAGGC ACGAAAGGAT ACCGTTGAAC TTGTCTACAG CAGTAAAACT AAACCGTCTG CCTCAGACGA AGGGGTACGT TTGCAGATGG ACCAGAACAT GAATCGTGGC AGCCCCCAGG AACAACATTA CAGTGAAGAG CCCATCGAAG TTCAGGCCCA AGAGTTTTTT GCTGAAATAC CACAGCCCAC GGGGGATGTT TGCGGCCAAA TTAAAATTTT GGGGGACGAC ATTAGTGCAA TTACTGATCA CTCGCACTAT TAG
|
Protein sequence | MVQGIVRSPV VPSTYERSPV PIKVVGRPAN VSSNNRYTAS PPRHSSPSRN NLSSVPSRHQ SKARDGIPYV PRLANPSPTR SRISVARTTS SRDRMLRRRR EESRDPPPSE QSPPERLCDY DTSATTLYEM LESSNWDEAR SRCRSHPEEA RTWIVRKDKS LQVRWKLLPL HAAIIFQSPN FVVSSLLEKY SAAASRQDDQ GMLPLHLAFR HKQEDEDLLE LLLVQFPKAV IMKDRRDRVP LEHGRDCKYS AKLMRLYADA TVAGSRALAA KTLSGCGMNS NHTATTSTNT SVRQKSETEH ENQIAALRAK YDSNLQYVKQ QFEERINTLR EKNTVNTRQM RLSAAAERQV VAQQHQDEMN DLRDLLSQQV GKDRDQVKRL RSQVENLQRQ LQQNETQNES SAAEIALVQA YAEELKEHLE QSVHDQLQIR NLALQQQDEL DSQRQLRTQL VETLLQQEDS NVQNDRLRGS KMLEVSDSIR NRITDLLESA PSLESRDRYG YNRVRKARKD TVELVYSSKT KPSASDEGVR LQMDQNMNRG SPQEQHYSEE PIEVQAQEFF AEIPQPTGDV CGQIKILGDD ISAITDHSHY
|
| |