Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21970 |
Symbol | |
ID | 7202974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 198131 |
End bp | 199793 |
Gene Length | 1663 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182346 |
Protein GI | 219124093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00490278 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTACTGTTG ATATCAGTGA AGACAAAAAA CTAATGCTGG CAGTCATTGG CTCGAACGAA ACATTGCGCT CTCTCGGTCC CGAGTCAAAT GATATAGCCC ACTTCGAGGC GACTGGACTC GATGGCGGAG ACGAGTTCAC GTTCAAGGGA TGGGTGCAAC AAAAGCTCTT CCTTGGTATT GAGCCATCTC CAGATATTAT AGCGATCGCA ACGATATACT TTGTGGAAGG CGCTTTGGGT TTGGCGAGAC TGGCACAAAC GTATTTGCTC AAGGACGAGC TTGCACTTGG ACCGGCGGAA ATGTCGGCAC TCACGGGGGC ACTGGTGTTA CCCTGGACTA TCAAGCCACT CTATGGATTT TTGAGTGATG GATTTCCATT GTTCGGGTTT CGGAGAAAAA GTTATCTGAT TGTAGCTGGT CTTAGTGGCG GGCTCTCTTA CTCTGTCCTT AGTTTGTCTG GCTTTTGGGA AAGCCTCGAC AAGGGCGTTG CCATTAGTGG TACTGTTGGT GCATTACTAC TGAGTAGCGC ATGCATAGCC ATGTCAGATG TTGTGGCCGA TGGAATAGTC GTGACTCGGA CTAGGCAGGC AAAAGATCCT GCAATAGCAG GCGGTCTTCA GTCTCTGTGC TGGGGATCGG CGGCTGTCGG AGGTTTGTTG TCGGCGTACT TTTCTGGAGC TTTGTTAGAA GTTATGTCTA TTCGGAGTAT CTTTGGTATT ACAGCTGTGC TGCCATTTAT GGTCGCATTG ATAGCGCTTC AAATGGAAGA GAAGCCTTAC GTAAAGGAAG AAGGGCACGA AGGATTGGTC ATGGGTGTCA AGGACCAAGC GAATGCTCTT TGGGAGGCAC TCAAACAGCC TTCCATTTGG AAACCTACGC TGTTTTTGTT TTTATGGCAA TCAACACCAA CATCCGATGG TGCGTTCTTT TATTTTATGA GCAATGATTT GGGTCTGGGA CCGGAATTTA TGGGACGTGT TCGACTGGTT ACATCGCTCG CCACTTTGGG CGGAGTTGTC GTATACAACC AATATCTGAA ACGAGTGCCC ATAAAATCCA TTTTGTTTTG GTCTACAATC GCATCCTTCC CGCTCGGCAT GCTGCCCGTT CTACTTCTCA CCCACGTGAA TCGCGAATTG GGTATTCCCG ATCAGGCCTT GATTTTTGGA GACGACATTG CCTTGGCGGC CCTCGGTGAA ATCGCCTTTT TGCCGACTCT TGTACTGGCC GCTCGTCTTT GTCCACCAGG GGTCGAAGCC GTATTGTTCG CCACACTCAT GTCGGTATTC AACGGTGCCG GCACGGTGGG AACCGAACTT GGCGCTCTTT TGACCAAGTT GTTTGGTGTG ACGGATAGCA ATTTTGACAA CTTGGTGTGG TTGACTGTCC TTTGTAACGT CACTTCTTTG TATCCACTTT TCTTTATCGG GTGGCTCGAC AAGATAGGGG ATGTCTCCGA AGAGGAGATG GAAAGCAAAA AGGGTGTGAT TGAAACAACG GCAAGAACTA AAGAAACGTA GAGGTAGATC AATAGGCCGC GAATGAAAGT GTTCGTAGTC GCAATTAGAA GATCCGGTGC ATCGTGTGGT GTCTGTCACC CAGCTGCAAC TCTTTACTGT TCACGAATGC CGATTTGTTG CTACCTCTAC TCGTACACTA ATT
|
Protein sequence | MLAVIGSNET LRSLGPESND IAHFEATGLD GGDEFTFKGW VQQKLFLGIE PSPDIIAIAT IYFVEGALGL ARLAQTYLLK DELALGPAEM SALTGALVLP WTIKPLYGFL SDGFPLFGFR RKSYLIVAGL SGGLSYSVLS LSGFWESLDK GVAISGTVGA LLLSSACIAM SDVVADGIVV TRTRQAKDPA IAGGLQSLCW GSAAVGGLLS AYFSGALLEV MSIRSIFGIT AVLPFMVALI ALQMEEKPYV KEEGHEGLVM GVKDQANALW EALKQPSIWK PTLFLFLWQS TPTSDGAFFY FMSNDLGLGP EFMGRVRLVT SLATLGGVVV YNQYLKRVPI KSILFWSTIA SFPLGMLPVL LLTHVNRELG IPDQALIFGD DIALAALGEI AFLPTLVLAA RLCPPGVEAV LFATLMSVFN GAGTVGTELG ALLTKLFGVT DSNFDNLVWL TVLCNVTSLY PLFFIGWLDK IGDVSEEEME SKKGVIETTA RTKET
|
| |