Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48417 |
Symbol | |
ID | 7203668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 396782 |
End bp | 398149 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182826 |
Protein GI | 219125100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAGA GCAAGAAGCG CAAGCACGCT TTGTTCCATG CAGCCACGAA AGTTCTTGAT GAGGAACAAT CTAACGGAAA GGACCCTGGC GATGATCTTG GCTCTGTTGT TTCGGCTGCC GACTTACAAA CGACCATTGA CACATTGCAA ATTCTCACCG AACATGGCGA ACTCCTGCAA GGCCGAGCCT TTAAGGATCT GCGTCGAGCA CTGCATCCGA TCATTGTAGC ACAAATGCGA TCGTACGATC AAGGAACGGA TTACAAAATC AAAGCAACTC ATGCGCTGCA AAATCTCAAG TGGTCCGATG CACTGGCAGC GCTCCAAGGT TGCCGGGATT TTCAACAAAT TCCAAAACAA GGCACGATTC AACGATGGGT CCGCGACTGT GATCAAGCGC CAACCGAATG CAAAATTACC CTACTTAATG CCATTCTGCA GGTGAACGGC CATTCAAGTA ACAATAAGCA TGATGTGGGT TTGGTGTTGG CGGAGCGCTT AAAAGATGGC AACTCAAGTC ATGCAGCAAA CGGAGGTCTG CTTGTTTTGG ATAGTTGGCG CGTACCTGGA CACGCATCCA GTTCGAAGAT AGAAAAAGAG GAGGTCATCA AATTCAAAGT TCCTCTGAGC ACCAACATTA TATTTCAAGA GAAAGCGTCT GAACGTAAAC CTCCGAACGA ATATGACCTT CTTCTTCACT CTGTCACTCC GGACTCGATA CAATGGTCAG ACCCTCCTCC TACCATCAAC AAGCACATTG TTCCGTTTGT CGACGATGTC TTTTTGCTAG ATCATGTATT GACGAGATCT GAATGCAACC AATTACGATC CGTAGCCACG ATGCTGGGTT ACCGACCCGA TCATCCTGTC ACCGTGGACA AGCCAACGGG CATTGATAGC TGTGAATGGC TTGTGGACGC CTCAATCATG GATCCACTGA ACGAACGGGT CAAATCACTG CTTCCTCCAA TAATGAAGGA AAGCGCTGTC GTCCATTCAA TTAATTCAAG GTGGCGATTT TTTCGCTACA GCCAAGATAG TGTCTATCGA CCCCACATTG ACGGTTCCTG GCCGGAGAGT CGTATCAATG AAAAGGGTGA ATATGAGTAC GATGAATCAG GATCCGTCAA GTCTTACTTG ACCTTCTTGA TCTACTTGAA TGATGATTTT GAAGGCGGGG AGACCCTCTT TTACATTCCA TCCAGTCAAG GCATGAGCGC CCGTGGTGTT GTACCGAAGG CAGGAGCAGT ACTTGTCTTT CCGCAGGGCA ACACAGCTTC GCTGATTCAC GAAGGTTCGG CTGTTGCAAA CGGCACCAAA TATGTTGTCC GAACGGATGT CTTGTATCGT GTCAAGGGAG AAAGATGA
|
Protein sequence | MGKSKKRKHA LFHAATKVLD EEQSNGKDPG DDLGSVVSAA DLQTTIDTLQ ILTEHGELLQ GRAFKDLRRA LHPIIVAQMR SYDQGTDYKI KATHALQNLK WSDALAALQG CRDFQQIPKQ GTIQRWVRDC DQAPTECKIT LLNAILQVNG HSSNNKHDVG LVLAERLKDG NSSHAANGGL LVLDSWRVPG HASSSKIEKE EVIKFKVPLS TNIIFQEKAS ERKPPNEYDL LLHSVTPDSI QWSDPPPTIN KHIVPFVDDV FLLDHVLTRS ECNQLRSVAT MLGYRPDHPV TVDKPTGIDS CEWLVDASIM DPLNERVKSL LPPIMKESAV VHSINSRWRF FRYSQDSVYR PHIDGSWPES RINEKGEYEY DESGSVKSYL TFLIYLNDDF EGGETLFYIP SSQGMSARGV VPKAGAVLVF PQGNTASLIH EGSAVANGTK YVVRTDVLYR VKGER
|
| |