Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50443 |
Symbol | |
ID | 7199255 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 77117 |
End bp | 78819 |
Gene Length | 1703 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185374 |
Protein GI | 219130442 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00232774 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTTTGTGA GATCCAAATA TCGACTCACT CTTTTCAGGA TCAGCTACAA GCCATTAACC TTCTGTAGCT TGTAGCTACC CCCTTGATAT TCGTATTCAA GGCTAAAAAG TATGAAATTT CTCCATTCTG CTCTGATAGT TTTGACATCG GCTTCGGCGT CGTCTGCGTT TACCGCTACC AATGTTCCGC TGAAAAGACC TACCGTCTAC AATAAAAGCA GTTTTTCTGC TTATCGCAGT ACGGCTCTTC GGTCAGCGGT TGCCCCGAAG ATTTCATCCG TCAATGGAAA GCAGCAGGTT CCACAACAAA CAGAATCAGC AAAAAAAATG TGGCTTATGA TTGCGATGAA GACGCCAACT GCGTCATCGT CGACGCATGC GACGACGAAC AATGCCGAAC TTCGCTCGAC GTTCGCATTC ACGGAAAATG GTATGATCTC TCCGGCTGGC GCAAGGCTCA CCCTGCTGGA GCCCACTGGA TTGACTGGTA CGATGGCCGC GATGCTACCG AAGTTATGGA TGCCTTTCAT TCCGAAAAAG GACGCGCTAT GTACAAGCGC TTGCCTGCCA GCTCTACTGA AAGTGTCGCC ATGTTGGAAA CTACTATTGC TCCTGACAGT TCCACACAAA TTGCTTTTCG CCAACTACGA GACGATCTGG AAAAGGAGGG TTGGTGGAAA CGTGACATGG TGCACGAATT TACGCAGCTT GGTATTTGGG CCTCGCTCGT GGTCGGTGCC GCCGTAACTG CACATTCAGC CCCTCCACTT GCTACTTTTT TGTTGGGACT TTCAATGACG GCAGCCGGTT GGTTGGGCCA CGATTTCATT CACGGCGTTG ACTCGTTCAC CGATCGCCTC CGGAATTTTG CCGGTGTTGC CGCTGGTCTC GGGCCTACCT GGTGGTCCGA CAAGCACAAC AAGCATCACG CTTTGACCAA TGAACAAGGC GTAGATGAGG ATATTGCTAC GGATCCTTTC TTATTCACGT GGGCGCCGGA TCCTAAGGAT GATTCACCCT TGCGCAAAAT CCAGCACTTG ATTTTCTGGG TTCCATTCTC GGCACTTTTT GCGCTGTGGC GTGTCGATAC CATGCAGGTA GTAATTGAAG CCGTCGAAAA CAAGCGTGTC GGGGCAAAAG GTGAACTGTA TGGACTTTTA CTGCACTATG CTGTGTTGTT TACTGTCTTT CCGGTCACTG TCTGGCTTCC CGCGATCTTT TTGAGCGGAC TGATGTCAGC CTTGATCGTC ACGCCCACGC ACCAATCCGA AGAAATGTTT GAAACTTATC AGCCAGATTG GGTCACGGCG CAGTTCCAAT CGACCCGCAA CGCAGTAACC ACCAATCCTT TTTCCGAATG GCTCTGGGGC GGCATGCAGT ACCAACTCGA ACACCATTTG TTTCCTTCTA TGCCACGCAA TCGCTATCCG GCACTGCGCG AACGCCTAAT TCAGTTTGCC GCGGACAATA AGATTCCCGG TGGCTACCGA GAAAGCGGCG AGTTTGAAAT TCTACGCATG AATTGGAATC TTTACAAGTC GGTGGCGGAA GCAGATGCGG TCCCTGGTGC ACCTCCTACT CGTGGTCGTC TAGGACAGCA AGGTGCAATT CGCGAAACAA ACAGTCCGGC TGCTCAGCAA GAAAAGGCGA AGATCGACCA GACGGTAGCA AAGGGGAATG GCCCGGCGTT GGAATCTGTG TAG
|
Protein sequence | MKFLHSALIV LTSASASSAF TATNFFCLSQ YGSSVSGCPE DFIRQWKAAG STTNRISKKN VAYDCDEDAN CVIVDACDDE QCRTSLDVRI HGKWYDLSGW RKAHPAGAHW IDWYDGRDAT EVMDAFHSEK GRAMYKRLPA SSTESVAMLE TTIAPDSSTQ IAFRQLRDDL EKEGWWKRDM VHEFTQLGIW ASLVVGAAVT AHSAPPLATF LLGLSMTAAG WLGHDFIHGV DSFTDRLRNF AGVAAGLGPT WWSDKHNKHH ALTNEQGVDE DIATDPFLFT WAPDPKDDSP LRKIQHLIFW VPFSALFALW RVDTMQVVIE AVENKRVGAK GELYGLLLHY AVLFTVFPVT VWLPAIFLSG LMSALIVTPT HQSEEMFETY QPDWVTAQFQ STRNAVTTNP FSEWLWGGMQ YQLEHHLFPS MPRNRYPALR ERLIQFAADN KIPGGYRESG EFEILRMNWN LYKSVAEADA VPGAPPTRGR LGQQGAIRET NSPAAQQEKA KIDQTVAKGN GPALESV
|
| |