Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35958 |
Symbol | |
ID | 7201427 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 193868 |
End bp | 195240 |
Gene Length | 1373 bp |
Protein Length | 423 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180590 |
Protein GI | 219119671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCAGA TCCTGTCCTT ACCATTCAAG CTGTGCAGGC ATACCGCGAC CTTCTATCGC GGCTTTGTGC ATTACTGGAT TGGACAGGGC CGAAATTCTC CTTACCAAAC GCCCGAGCAG TGCACCTTTG CGCCCCTGCG CGAAACACCG ACGGACTCGC CGACGCAAAA GCTCTTTAAG CAGCATGCTC GAGTACATCT TTACAGCCTC GCGTCCAACT TTTACCTCTA TCACAAACCG CACTATCGAA AAGGCTCGTA TCGGGATGAT TTGATCGACA ATCTACGCAA CGTGGCCATT CCAGGCACGG GCATCCCCTT GTCACTCATG GCCTCGACCC GTCTTACAGC CCTGGGTTTC TTGTTCTCGG CTTATCCTAC GGTCAGTCTG GTTGCCGCTG TGCATCAGTG GATAAAAACT CGTGGGAAGA CTAGCATTTC AGAGGAATAC GCCACCCGCT TGTTGGCCCC GAATGATTGG TTCTCCTACT GGCGCTTGAA TTGCAACATT GTTGGTCTCC ATTCCGTTCT CAACGATATG CCGGTCGATT ACGAAATGGA AAACAAATGG ACCTTTCTAG AAAATGGTAA AAAGCGGGGT GTTCCTATTT CACCCTACCT AACCACACCC GGTATTGTCG TCAAGCATCG CAACGAAGAA GGCGGGCTCG GCATTCATTT CTATAGAAAT GCCGTCGACG GTGGGGACTG GATTATTCAG GAGCGCATCC AAAATTCCGA CTGGGTGCAG TCGATGCTTC CCGCCAAGGC GCCGTTGTCC ACTTTTCGTG TCATCACTTG CAGTGCAGCG TATAATGTAT CCGAAGCACC TAACGTACGT CGGGATTTTG GAATGAACAA ATTTCGATGG CGCTGAGCTC GCCCGATATT TTCACTTTCA CGCGTTCTCA ATTTTGTTTA TTCTTGCGTT TGCAGCGCGC TGACGTGAAA GCTCTTTCCT GCGTATTCCG TGCAGGCCGA GCTGGTGCCG CCACCGATCA CGATTCCATC TTGTTTGACG TCGATGTCAA AACTGGAACC ATAAAGGGAG GGACGACCAA CGCGCACTGG TACCGACTCG GTTTGCACGA AGCCCTACCG GGACGTTGTC CTTGGCGATC ACACCATGAT TACAGCCTTC ACCCGGATGG TGACATTCCC GTGACGGGCA ACCAAGTTCC TGATATTGCC CAAATGCTCC AGTTGGTTGA GCAATCCCAT TTCGACATGT GCCCACGGGT ACCCATGGCT GGCTGGGATG TCGTCTTTTC GGCTGATCCC GAGGTACCAA TTTGCCTGCT CGAGGTTAAC TTGAGTTGTA ATTTTTTTCG GGGCTCGTTC GATCAAAAGG TATTGTCTCG GTTTTATGTT TGA
|
Protein sequence | MGQILSLPFK LCRHTATFYR GFVHYWIGQG RNSPYQTPEQ CTFAPLRETP TDSPTQKLFK QHARVHLYSL ASNFYLYHKP HYRKGSYRDD LIDNLRNVAI PGTGIPLSLM ASTRLTALGF LFSAYPTVSL VAAVHQWIKT RGKTSISEEY ATRLLAPNDW FSYWRLNCNI VGLHSVLNDM PVDYEMENKW TFLENGKKRG VPISPYLTTP GIVVKHRNEE GGLGIHFYRN AVDGGDWIIQ ERIQNSDWVQ SMLPAKAPLS TFRVITCSAA YNVSEAPNRA DVKALSCVFR AGRAGAATDH DSILFDVDVK TGTIKGGTTN AHWYRLGLHE ALPGRCPWRS HHDYSLHPDG DIPVTGNQVP DIAQMLQLVE QSHFDMCPRV PMAGWDVVFS ADPEVPICLL EVNLSCNFFR GSFDQKVLSR FYV
|
| |