Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41220 |
Symbol | |
ID | 7199051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 236009 |
End bp | 237271 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185238 |
Protein GI | 219130156 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.132681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTAC GCTGTGTGTA TGTGTGCTCT TCACTTTTGG CGCTAGGTTC CAGTTTTTCG GGCGCCTCCT TTATAGGGTC GCTCCAAGCC ACGAAGGAAA TTTCGCGCCA TCCTTTCAAA TACACCAAGA CATTTCTCCC CATGGTCGGA GGCGGTGGTT TAAGTGATGG TGCAGGATCT GAGCTTACCA ACACACTAGC TCGTTTGGAT CAGCAGTGGA AGATTCAGCA AAAGTCAAAG CCTACTTCTC GTTGGTCGAA AATTATTTTG GATCGTGACA CACAGGAAGT CTCCGAGGAA CCACCAGAAA CGTATGTTCC TCCCCTACAA GAGAGGCAAG ATTTCGTATA CTTGCTAGAA CCACCCAGTA AGTCGAACCC TTCTTGTGTA ATCTTTTTTG TTGGCGGTGC CGGCCTAGGA CAATTCCCCC AAATAGCCTA CAACGAATTC TTGTTGCGTC TTTCGGACCG GCTGAACGCT GCGGTGATTG CGGCGCCTTA CGCTGTGGGA TTGGACCACT TTGGACTGGC GAAAAGCGTC GGGGAACTTA TGCGCAAGGC AAAACTTCAC TGTGAAGAGG ACTCGTCAAA ACTGTATCCG AAAACTTTGC CAACCTATTG CATTGCGCAT TCATTGGGGT GCAAGTTGTC CAGCATCTAC ATGGCAGCGA CAGAGCAAAC GTATGATGGC ATTGGTTTTA TGAGTTTCAA CAATTTTGGA TTTAGCCAAA CCATCGGTAT GGCCAAAACA TTTGCCGATC AACTGCAAAA AAATATTGGT ATCGGCCGTG GTATTCGACC TGAAGTGCTG GATCAGGTAT TTTCATTCGC AGAAATGGCG GTGGGTTCGA TTGGGTTGGA CTTCACTCCG AACCCCATGG AGACAGAGAG GTTACTAACG TTGAAGTATG ATGAAGAACA GCAGGAACGT ACGCGCCTGT TTGTTTTCGA TGACGACATG TTGGATTCGA CGCAGAACTT TGTGCAAGCT TGCAACGGGG CAGGTCCCGA TGTGTCGGGT TTGCCAGGGT CGCATTTGAC ACCCGTCTAT TTCAAGTTGG GCCTCGATGA ACTACCTGAC GAAGTGCGAG GCGTCGCTAA GGAGGCGTCA GGCGGGTTGG AATCCGCATC ATTTGGAAAT GAGGAAGAAC TCAACGCTTT GGTGACCGAA GTCAGTGGCT GGATTTTGGG AAAAGGTCCC TCGAGAAAGC CTTTGTGGCA AACCGAGCGA CCAACAATTT CTGGTTCGGC AGAAGATCAG TGA
|
Protein sequence | MRLRCVYVCS SLLALGSSFS GASFIGSLQA TKEISRHPFK YTKTFLPMVG GGGLSDGAGS ELTNTLARLD QQWKIQQKSK PTSRWSKIIL DRDTQEVSEE PPETYVPPLQ ERQDFVYLLE PPSKSNPSCV IFFVGGAGLG QFPQIAYNEF LLRLSDRLNA AVIAAPYAVG LDHFGLAKSV GELMRKAKLH CEEDSSKLYP KTLPTYCIAH SLGCKLSSIY MAATEQTYDG IGFMSFNNFG FSQTIGMAKT FADQLQKNIG IGRGIRPEVL DQVFSFAEMA VGSIGLDFTP NPMETERLLT LKYDEEQQER TRLFVFDDDM LDSTQNFVQA CNGAGPDVSG LPGSHLTPVY FKLGLDELPD EVRGVAKEAS GGLESASFGN EEELNALVTE VSGWILGKGP SRKPLWQTER PTISGSAEDQ
|
| |