Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14688 |
Symbol | |
ID | 7203550 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 635242 |
End bp | 636672 |
Gene Length | 1431 bp |
Protein Length | 402 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182727 |
Protein GI | 219124891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCCGTTTCC GGAACAATGT CTGGTACAGT TGGTTGACCA CCTGCATTGG TCAGAGTGGC GCCGCTTGGA TCAAGTGGGG ACAATGGAGC AGCACTCGTA ACGATATGTT CCCCGACGCA TTCTGCGAGC AGCTCGCAAC GTTGCACGCC GCAGCCCCCG CACACAAATG GAAATTCTCC GAACAAACCT TGGAAAGCTC TCTCGGAATC GCGCCGGGAA GCCTGTTGCA AGTATTTGAT GAAATCGATC CAGTCCCCTT GGCGTCCGGT AGTATTGCGC AAATTCATAA GGCTGTGCTG GACGGCAAAT CAATGGCGGT CAAAATAAGA CACCCTAACG TCGCCGCTCT TATCGATATG GACTTTCGCC TCATGAAGGC CGCAGCCACC CTTCTGGATG CGATTCCGGC TCTTTCTTGG TTGCGCATTC GGGAATCGGT GGAACAATTC AGCCACACCA TGGCGGCACA AGCTTACCTG CACGTCGAAG CACATCATTT GGAAGTCTTG AACTACAACT TTCGGTCGTG GCCGCACGTG CGTTTCCCCC ATCCTTTCTA CGCAAGCTCG GCCGTTATTA TGGAAACGTT TGAGCAGGGA CAAATATGCA CCGAAATTTT CGATATGTAC GACGATGTGG CCACGCACAT GAACGACGGA ACCGCTGTAT CTACGGAGGG CAAAGTGACC GTCGAAGAAG CCGCCGAGGA CGCGGTTAGC CAAACGACCA ACTCTAACAA TGTCGGCGCC AACACGAGTA CGCCCGAGTC GACGGGAAAG GGCCGCATGA TTTCCACCAA AGCCCTCCAG GTTCAAGGAC ACGAGCTCAT CCCTCTCAAA ATGGCTCAAT TCCTCGTCAC CAACGGTGTT GCGCTGTATC TGAAAATGCT CTTGGTCGAC AACCTCATGC ATGCGGATCT CCACCCTGGA AATATCATGG TCGATTGCCA CTGTGATTCG TACGAACAAG CGCACAAGCG AGAACAAATA GCCATGGTAC CAGTCATGTC GGATCCCAAT CAAAAACTTG GCAAGTTTCG CATTGCGCTG GTAGACGCGG GCATGGTGGC CCAACTGACC GACAAAGAGA GTTCCACATT TATCGGACTC TTGGCCAGTC TGGGTGAAGG CGATGGTCAA CAGGCGGCCG AATTTGCCCT CCAGTTCTCT TTGGAAAACC ACATGAACGA AACTCAACGC TCTGCGTTTA CCGAGGACAT GGTAACCATG TTTGCCGAGC GCTGTCGCGG CTACGGAACC GGCGTTGACG TTGGATACGT CTTGCGTGGC GTTCTTGGAC TCATTCGCGA CCACAAGGTC CGTATTGACG CCAACTTTGC TACGCTGGTG GTGAATTGTT TGTGCATCGA AAGTTTGGCC GCCCGCGTTT GTCCCAGCTA CAATGTTCTG GACGCGGCGC GACCCTTGTT G
|
Protein sequence | PRFRNNVWYS WLTTCIGQSG AAWIKWGQWS STRNDMFPDA FCEQLATLHA AAPAHKWKFS EQTLESSLGI APGSLLQVFD EIDPVPLASG SIAQIHKAVL DGKSMAVKIR HPNVAALIDM DFRLMKAAAT LLDAIPALSW LRIRESVEQF SHTMAAQAYL HVEAHHLEVL NYNFRSWPHV RFPHPFYASS AVIMETFEQG QICTEIFDMY DDVQGHELIP LKMAQFLVTN GVALYLKMLL VDNLMHADLH PGNIMVDCHF MSDPNQKLGK FRIALVDAGM VAQLTDKESS TFIGLLASLG EGDGQQAAEF ALQFSLENHM NETQRSAFTE DMVTMFAERC RGYGTGVDVG YVLRGVLGLI RDHKVRIDAN FATLVVNCLC IESLAARVCP SYNVLDAARP LL
|
| |