Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35949 |
Symbol | |
ID | 7201317 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 158939 |
End bp | 160236 |
Gene Length | 1298 bp |
Protein Length | 301 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180584 |
Protein GI | 219119658 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAGC AGCAGCAACA ACAACGACCC CTCGCGACTT CGGCCGTGCC TCCTCCGAGT CTATTGTCTC AAGTTGGCAT GGCTGGTTCC GCAGCCGTCA TTACAGTCTC TTTCATTCAT CCCATTGACG TTGTCAAGGT AAGCCTACCG CACATTCTCG TTGTCGGTGA ATCACGACGT GCGAACCACG GGGTCGAATC TCGACCGTAT TGGAACCCAC CGCACCAAGT TTGTGAACGC CACACTCTTC AGTATTCGAT CCACTGTAGC GCACGACAGT GAGATTCGAC GGGAAGATCC GGAACACTGC CGGCACGGAC ACGAAACTGT CTTCCGAACG AACTCTTTTG TCGTGCACTA TTTGAATGAA AATGGTATCC TAGTTTGTTA GTGCGCGACG TGACGTCGAT GGAGAGTGTA GGTTGACTGT GCGCGCATTC ATTCTGACTG TCCCTTTGTT ACCACTGCCA TTGCTGGGAC TGCAACTCAC CTTGACTCTT TCCATTACAC TCTTTCTGCT GCCTTTCTAG ACGCGCATTC AAATTTCTGC CGAATACGGA AACATGGGTA TGTTTGGTAC GATCAAGAGT GTGGTCGGCG AAGAAGGTGT TCTCGGTTTG TGGAAGGGAG TCAACGCGGC CTGGCTGCGG GAAGCATCCT ATACCTCGCT CCGCCTCGGT CTTTACGAAC CCATCAAGGT GGTCTTTGGA GCCGCCGACC CGGAGACGGC TACCTTTATG AAAAAATTCT TGGCCGGTAG TGCCGCGGGT GCGATTGGTT CAATAGCGGG CAATCCCTTT GATGTCCTCA AAACAAAAAT GATGGCATCC AAGGGCAAGC AAGTTCCTTC CATGGTCAAG ACGGCCAAGG ATCTCTACGC CAACCAGGGA GTTGGTGGAT TTTACCGTGG TATCGACTCG AACATTGTGC GTGCCATGGT TCTGAACGGA ACCAAGATGG GGGTTTACGA TCAATCCAAA GGCTACGTCG TTGCCGCCAC CGGTCTCGCC AAGACCTCGC TCACCACACA GTTCCTGTCC GCCGTCACGG CCGGCTTCTT CATGACCTGC ACCGTCTCTC CTTTTGATAT GATCCGAACC CGACTGATGA ACCAGCCATC CGATGCCAAG ATCTACAACA ACGCCTTGGA CTGTATGATC AAGATTGCCA AGAACGAAGG ACCCTTGACC TTCTGGCGAG GATTCATGCC CATCTGGTCG CGATTCGCCC CCACCACAAC CCTGCAGCTC GTCATTTTCG AACAGCTACG TGGCATGATG GGCATGAAGG CTCTCTAA
|
Protein sequence | MMKQQQQQRP LATSAVPPPS LLSQVGMAGS AAVITVSFIH PIDVVKTRIQ ISAEYGNMGM FGTIKSVVGE EGVLGLWKGV NAAWLREASY TSLRLGLYEP IKVVFGAADP ETATFMKKFL AGSAAGAIGS IAGNPFDVLK TKMMASKGKQ VPSMVKTAKD LYANQGVGGF YRGIDSNIVR AMVLNGTKMG VYDQSKGYVV AATGLAKTSL TTQFLSAVTA GFFMTCTVSP FDMIRTRLMN QPSDAKIYNN ALDCMIKIAK NEGPLTFWRG FMPIWSRFAP TTTLQLVIFE QLRGMMGMKA L
|
| |