Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35537 |
Symbol | |
ID | 7200779 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 146446 |
End bp | 147568 |
Gene Length | 1123 bp |
Protein Length | 352 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179983 |
Protein GI | 219118420 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.991073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTCT CTTTTCTCCC TATTGCTTTA TTCTTGGCTT TCATCGGCGG TTACGCAGAG GAACTTTCTC GGGCGCAAAG AGTATCCTTT GGCGAACAGA AGACACGAAA TGCTCAGGTG GTCGTCGCTA ATTATTCTGA AGCGGAGATG CTTGGCTTCA AGATCCAGCA CCAAGTGCAC AGCCAAGCCG ACCTATTCGT CCAGCAGCTA GAAGGAAAGT TGCAACTCGT GAAGAATGAG ATGATATCAC TTGAGAATCG CGACCTGGCC AATCTCGATA TCTTTACTTC TTACGCCAGT GTTGTCGCCG ACGGCACAGA GAAGAACGCC GCGTCCTCAT TGTTTGTGAG CAGACAGGAT CCGGCCATAA CGATTGCACT GGATTCCGAA GGCAACTTGA GGGAAGCCGT ACGCCTTAAC CCTGAAATCG GTGAAGCAAT ATCCATTTCA CGCATTGATT CCCGAAAGGC GGATCGATTT GTTACGATCA CTGCGGAAGA CTTCGATCAA GACAAACTTG CTAGTTTTGA AGTAGAAGAC AGAGTAGCTC CATTAGCACG TCAACTCAGA AGCTCACACA AGAGCGGAAC AAGCAGAGAG AGGTCTCTCC AAGCCATCGG TGCCTGCTCC GAATACGGTT TTGACGTAAT CGAAGTTGCT GTTGTGGTGG ATTCCCTTCT CTGTGCTGCT GTAGGTGGAA CTGAAGGAGC TGCTTCCACC GCTGCACAGT CTGTCATTGC AGGCGCCAGC CAGTTCTACG AGGTTGACGG ACTTTGCAAG AAACTCCGCA TTTCGTATTT GGAAATTCAC TGCAATGCTG GTACCAATCC TATCGCTCCT TTGCTTCAAC AAGCAGGAAA CTCTGACATT TGTAATACCG ACGCAAATGG TTTATTGCAG AACTTTATAT GCTACACGGT AGATCAAGGT ATTGCCGCGG ACTTGAACCT TCTGTTCCAC GGCAAGTTCT TTACTGTTAG TGGCTCTCTA TCAACTGGCT GCGCTTTCAC TGGAACACTT TGTCTTACTG ATGGTACAGA TTCTGGAGTC AATCAGATCA ACTTTACAAC CGATCCCGTG TCCCGGGCCA AATTGGTCGC TCACAAGGTG GGCCATATCC TAA
|
Protein sequence | MSFSFLPIAL FLAFIGGYAE ELSRAQRVSF GEQKTRNAQV VVANYSEAEM LGFKIQHQVH SQADLFVQQL EGKLQLVKNE MISLENRDLA NLDIFTSYAS VVADGTEKNA ASSLFVSRQD PAITIALDSE GNLREAVRLN PEIGEAISIS RIDSRKADRF VTITAEDFDQ DKLASFEVED RVAPLARQLR SSHKSGTSRE RSLQAIGACS EYGFDVIEVA VVVDSLLCAA VGGTEGAAST AAQSVIAGAS QFYEVDGLCK KLRISYLEIH CNAGTNPIAP LLQQAGNSDI CNTDANGLLQ NFICYTVDQG IAADLNLLFH GKFFTILESI RSTLQPIPCP GPNWSLTRWA IS
|
| |