Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_3559 |
Symbol | |
ID | 7198979 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 292418 |
End bp | 293539 |
Gene Length | 1122 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185168 |
Protein GI | 219130010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0310467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGAGTTTC CCAAAATAGA GCTGCACGTT CATCTCGACG GAAGTTTTGA CCCCTTATTT TTGTGGAAAT ATATGCAAAA GCATCCCGAA AGCATGTTGT GTCTGCCAAC GGAAACCGTA CCTCCTTGGC AGCCAACAAG GAAACTTGAA ATCCGCAAGC TTGTCGAAGA CTGTACCACA TCGCAAGAAT ATCACAAGCT TTGCACATGT CGTGGATACC GTTCGCTCCA AGAGATGCTA AATTGTTTTG AAATGTTTTT ACCTCTCGTT CGACGCAATT TGGACCTGCT GGAACAACTC GCGTACGATT TTTGTCAGCG CCAATGGGAA CAAAATGTTG TATATACGGA GGTGCGCTAC TCCCCCTTTT TGCTTGCTGA AAGTTTTGAA GTCGAAAATA AGAACTCACA GTCAGTGGAC GCCGAAGCGG TCTTTGCTGC CATTACCAGT GGACTACGTC GCGGATCACA CAAGTTTGGT ATTATTGTGA ATCAGATCAT TTCCGCAATC ACGTGGCGAC CCGACTGGGC GATGCCTTCA CTGGAACTCG CCCAGAAACA CCGCGAAGAC TATCCATGTG CAACCTTAGG TATCGATATT GCTGCCGGCG AGGAACATTT TGACAGGGAC CAGCACTCGG CGCTCTACGA ACCCCATTTT GCCATGATTC AAAAAGCCAA AGAGTATAAG TTGCCAGTTA CCCTGCATGC GGGAGAAGCT GCGATGGAAT CTTCCATGGA TAACGTACGC CGGGCAATTG ACGTATACGG TGCAAGCCGT ATCGGGCATG GTTATAGGAC GGTCAACGAC TTGGATCTCA TAAACTATGT GAAGGAAAAG AAGATTCACT TCGAAGTGTG TCCAACATCG AGTGACGAAA CGGGCGGTTG GATGTACAAG GAAGAAAAGA ACTGGAAGGA ACATCCATGC CTTGCCATGC TCAAGCACGG CATTCCCTTT TCGCTCAATT CGGACGATCC AGCGGTCTTC CACACCTCCT TATCGTGGCA GTACCGGATC GCTTTGGCCA AAATGGACTT GACGCGGGAG GACATTGTCA AATGCAATCT GCAAGCCATT GATGCGGCTT TCTGTTCCGA GGAGCGGAAG GTTGCACTGC GC
|
Protein sequence | QEFPKIELHV HLDGSFDPLF LWKYMQKHPE SMLYCTTSQE YHKLCTCRGY RSLQEMLNCF EMFLPLVRRN LDLLEQLAYD FCQRQWEQNV VYTEVRYSPF LLAESFEVEN KNSQSVDAEA VFAAITSGLR RGSHKFGIIV NQIISAITWR PDWAMPSLEL AQKHREDYPC ATLGIDIAAG EEHFDRDQHS ALYEPHFAMI QKAKEYKLPV TLHAGEAAME SSMDNVRRAI DVYGASRIGH GYRTVNDLDL INYVKEKKIH FEVCPTSSDE TGGWMYKEEK NWKEHPCLAM LKHGIPFSLN SDDPAVFHTS LSWQYRIALA KMDLTREDIV KCNLQAIDAA FCSEERKVAL R
|
| |