Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46344 |
Symbol | |
ID | 7201896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 56932 |
End bp | 58635 |
Gene Length | 1704 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180743 |
Protein GI | 219119989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00912933 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGCTTTTT GACGGTGAAC TTCATTTCTA AATTTATTCT GCGTTTCGGA ATTGTCAAGG AGGAACCAAA AATCGCATGG CTACAAGGCG AGACCGGATT GACGATTCCG TTTTGGAAAA ACCAAATCGA CTCGCGGATT GGCATCCTTC GTCGTGAACA CGTGCTCTTT TACATCGTCA CAGAAAGTCA CATCCAAGGT CATGTCTTGA AAGAAACAAT AGCAACAGAT GGTGTACAAA AATAGGAAGC GGCAGTTTAG GATCGAGACG TCATATGGAG CAATGATTTC GGTGGCGAGG GTCTTCGTTC TTTTTATGAG TGGCAGTCGA GCTGTTACGG AAACACACGA TCGGACGCGT TGTTCCAGTA CCCTCCGGAA GGATGAGTTT GATTGGGATG CCTGGCAAAT TGGCACCGGG AAGCCGTACG CTGCTGTCGT CGAAAGCAAC GGCAAACGTG ACGCTTCCTT CCTCGATGGA ACCTGTTTTG CCGACCGTGG TTGGAACATC TTGCCGTCCA TTGCCACCTT ACGGTCTATC AAGGTCGAAT CACCCAAAGG TGAGGCAAGT GTCGTTGATG CAACAACCCT TCACGTCGAT GGACATTCCG TTCGTGGTCG CATGGCACTG CTACGCTATC TCATCGAGGA ATGGGAAGAT TCGTTGTTTT TCCCTAAGCT CAATCTTCAG CCGGACGTCG CCATTTGGGG ATCCAGTGGG TCGACCAACG AGTATCACTG GTTGTGGGCG TACGGCGCCC AACTTGACTG GCAGCAGCGC TCAGGCCGAT TGGCGATCGG TAAAGACGAA CCGGATGATG GCGATCATAT TTCGGTACGT TCCTGGTGGG GCTACATGAA CTACTGTTTC TCCGTGGCCG TCCTCCTGGG CGCCATCGAA GCCACCAACA GCATTCAAAC AATCGACGAC GTCCAAGAAA CAGCAGTAAC TGTTGAATTA GATGCGGATA GTCAACGATT GATGGAAGAA GACTATGCCG TTCGGGATTG TGTCACATAC TGGCGCGACT TTTTCCAATG CGAGTATCCC AACTATCAAA CGCGAATACG CGACGCCACG AAGCAGTTGG ACGAGAAGCC CATAACTACC AATAACATCA AGGAAGAAGA CTTCCAACGG CTACGCTTTG AATTTCAAAA AGAAGTGTGG AAGGTACATA CGAAAGTGAT CGAGCGCGCG GCTTCTTCCG CAAGGTCCAA ACAGCTTTTG AATGTTTTGC CCAAAGCAGA GCAGCAATTT GGACTTGGCT GGTCGCGTAT GGTGGATATA TTGGCAGCGT CCGTCTTTCC GACCGATTTG GTGACCTTGA TCGAGGACGG AAGCGGCTTT CTGCCTTACA ATATTACTAT GACCACGGCC CTTGGGCAAA ACACTCCTAA GGTTGTGATT GAAAACCCTG CGAGCTACCG AGATCCCCAT ACACGGAGAA ACTTAGTGGC ACAGTCGCAA CAGCGTTCGA TTGACACGAC GCATCAATTG GTGCTTTTAC CCGACTGGGC TTTGTCCACA ATGGTACACT TCTGGAGTCG CGTTGTGCGT CAGCCATGGA TCAGTCGGGA AATGCCTGCT CGAGTCAATC GTTTGGTCCA CGGATCAGTG AAGGTCAAAC TAAAAGAGCT TTTCAGGGTG TTAGTTCTTT TCGTCAAGCC AAAAATCTAA GAAATAGTTC ACACCATTTC AAGG
|
Protein sequence | MVYKNRKRQF RIETSYGAMI SVARVFVLFM SGSRAVTETH DRTRCSSTLR KDEFDWDAWQ IGTGKPYAAV VESNGKRDAS FLDGTCFADR GWNILPSIAT LRSIKVESPK GEASVVDATT LHVDGHSVRG RMALLRYLIE EWEDSLFFPK LNLQPDVAIW GSSGSTNEYH WLWAYGAQLD WQQRSGRLAI GKDEPDDGDH ISVRSWWGYM NYCFSVAVLL GAIEATNSIQ TIDDVQETAV TVELDADSQR LMEEDYAVRD CVTYWRDFFQ CEYPNYQTRI RDATKQLDEK PITTNNIKEE DFQRLRFEFQ KEVWKVHTKV IERAASSARS KQLLNVLPKA EQQFGLGWSR MVDILAASVF PTDLVTLIED GSGFLPYNIT MTTALGQNTP KVVIENPASY RDPHTRRNLV AQSQQRSIDT THQLVLLPDW ALSTMVHFWS RVVRQPWISR EMPARVNRLV HGSVKVKLKE LFRVLVLFVK PKI
|
| |