Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45321 |
Symbol | |
ID | 7199964 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 834207 |
End bp | 835901 |
Gene Length | 1695 bp |
Protein Length | 438 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179518 |
Protein GI | 219117447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.202846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATCGCCAA CAATAAATGA CCCGACAGAG AAGTTGATCG GCCACCTAAA GTCGGCGGCG ACAGAAGAAT ACAGAACGTG TTGGACCGGG AAACAAGGTC GTGTGGTCCC ACCGATTGGG AAACAGTCGG CAAAGGCAGT CGGCGTCGGC GCCACGATTA GGTTTTCCAC TCACCGGCAA CCCACAAATA CACACGTAGC GGGATCTACT CATATTTTTG AACAAAGCAC AACCACTTTA TTCACAATCA GAGCCTTTCC CTTACCTTTT TTGTTTCTCT CGTCGGCACG AAAGAAGCCA TGCGACCAGC GTTTCCCAAG AAAACGTCAC CATGGTTCTG GCTGCTCCTC AAGCTCTCCG TCGTCCTTTC GTTGACGTGC ATTGTTCTAT TCGCAAGCTC TCCCAACGCG ATGAAAATCA TTGTCTACGA CGATCCGTCG CACACTCGGT CATCCTCCAC AACACTGCCT ACTCCCGATT CCGATGAATC GGATTCACCA CACCCCGGTC ACTTTTCGAC GTTGCTCGCC AACGTCAACA ACAGTCTCCT GCACTCGGTA CAGCAACAGT GTACCGACTC GCAGCGACAG ACTATTCGTT CCCAGCTCCC ACCCCAAGCC TGCGAAGAAG ACAAAGGAAA ATACGGGGGG CGTCGGTGCT CTTTCTCACT GGCGACTCGC TGTTCGGACG CCATCTGGTT CGCCGAGTTT TGGAAAGAAG CCGCACAGCG GGGAGTCAAC CGACCCACGG CCATCTATGT CGGATGCAAC AAGGGAATGG ACGCCGTCAA CACGTTGCGC ATGATCTCGG AGAATCCCGA GTTCGACAAG TCGGTATGGC AAAAGGAAAT CCTCAGCAAC GTGCTCAGTC GGAACCAAAC GATTGTGCCA GGCGCGTGTA ATCAAGAGAA TGCCCCACAA TTCGACATTT CCACGCTGCA GGGTACAGCC CGTACAGCTA GTGACAGTAC AGCGTCGGTC TTTTGTATTG AAGCCATGCC ACAAACGGCC AATCAGCTGA ATCGCTCGTC ACATCAGCTT GGCTGGCAAA ACTCCTTCGT GGTAACCAAC GCCGCCATAT CGTCCACCGA CGGTGTCGCT TTGTTCCCCA GTGGACAAGC CGGACAAAAG ATTGGCGTGG AGCATATGGG CCTTGGCGAT TGTCTTCGCC AGAGTACAAA GCATCTGTGC GCGGAAGTTC AACAGTATAC GCTCGACACA TATTTCGACA AGTTCTTGCG CAAGGACTCG TGGATTGACT TTTTGAGTAT AGACGTGGAG GGCTATGACT GGGATGTCCT TATGAGCGCT AGCAAAGCCT TGGAACGGGT CAAGTATCTC GAATTCGAGT TTCACAGAGT CGGTAAGTGG TGAACCTTTG TTGAACGCAA GTTCCTTCTA CAGCGACGTT ATGTTTCCTA AACTCGTGTT GTCTTTTGCA GGAAGCTGGC AGCAGCATTC CTTAAAGTCG GCCATTGATC ACTTGGATAC GAAAGGGTTT GTGTGCTATT GGGCAGGTGC CCATGGCCAT ATTTGGCGCA TCACTAACTG TTGGATGGAG TACTATAATC GAAAAGTATG GTCAAACGTT GCCTGTGTGG ATCCTTGGTC GGCACCGGCA TTGGCAACTC GTATGGAACT CATGTTCCAG GAGACCCTGG CTGCTGGGGA TCAGATAAAG TATGTGCCAA AATAA
|
Protein sequence | MRPAFPKKTS PWFWLLLKLS VVLSLTCIVL FASSPNAMKI IVYDDPSHTR SSSTTLPTPD SDESDSPHPG HFSTLLANVN NSLLHSVQQQ CTDSQRQTIR SQLPPQACEE DKGKYGGRRC SFSLATRCSD AIWFAEFWKE AAQRGVNRPT AIYVGCNKGM DAVNTLRMIS ENPEFDKSVW QKEILSNVLS RNQTIVPGAC NQENAPQFDI STLQGTARTA SDSTASVFCI EAMPQTANQL NRSSHQLGWQ NSFVVTNAAI SSTDGVALFP SGQAGQKIGV EHMGLGDCLR QSTKHLCAEV QQYTLDTYFD KFLRKDSWID FLSIDVEGYD WDVLMSASKA LERVKYLEFE FHRVGSWQQH SLKSAIDHLD TKGFVCYWAG AHGHIWRITN CWMEYYNRKV WSNVACVDPW SAPALATRME LMFQETLAAG DQIKYVPK
|
| |