Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50356 |
Symbol | SQD2 |
ID | 7199140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 64244 |
End bp | 65924 |
Gene Length | 1681 bp |
Protein Length | 507 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | glycosyl transferase, group 1 |
Protein accession | XP_002185276 |
Protein GI | 219130238 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCTCCTGTC TCACTTACAG CATTTTGGTG GTGAGTTTGG TGAACCAGTC ACAGCCCAGC ACCGTCATTT GTCACACAGT TTGACAATTC ATTGCCATAC CACACCACAG CGCTCCCTCC TAGTGGAGCT TTTCATCTTT GCTTCCAAGG ACAGGCAATG ATCGTTGTAA AATCGGCGCT CCTGATTGTC TGTTTGGTGG CTACAGCATG CATTGGGTTC GTCTCATGCT TTCAGCCATA CACATCCGCA ATCCCTCCGC CTTGTACTTC CTCTGTCAGC TCATCTTCGG TCCGGCTAGA AGCCGCCGCG AGGTCATCGG ATGCGCCACC GCTACGACGC AACCCTCCCC GTAAGATTTG TCTCATGGTG GAACCTACTC CGTTCACTCA TGTTTCTGGA TACGCGAATC GTTTCAAAGA AATGCTCAAA TTCATGGCCA AAGCCGGCGA CGAAGTTGAC ATTCTGACCG TCGACACCAA GACTCCCGCA CACGAACTGC CCACCGCATG CAGTGGATTC TCCATCAAGC ACACACAGGG CTTTACCTTT CCCCTTTACA ATCAGATTTC CCTCACCTTT GATTTGCCAG AAATGAAAGG CGCCCAAATG CTGGAAAAAT TCAAACCCGA CCTCATTCAC GTCACATCTC CCGGCTTCAT GCTGTTCGCG TCTATTTTCT ATGCGCGTGT CTTATGTATT CCTCTAGTTA TGAGCTATCA CACACACTTG CCGAGTTACG GCAAAAATTA TCTTTCCTTC GTACCTGGTA TTGAGAACTT TTGTTGGGAA TTACTGCGCT GGGCACACGC CCGAGCGGAT TTGACCCTGG TCACAAGTCC GCAAATGCAG GAAGAACTGA CTCGCAACGG GATTCCTCGA GTGGACGTCT GGCGGAAAGG CATCGATACG GATCGATTTG ATCCCAAATT TCGGTCCACT TCCATGCGGG AGAAAATGAC TAGGGGCAAT GCTGACGACT TTTTGATGGT CTATGTGGGA CGGCTAGGAG CCGAAAAGCG TCTGAAAGAC ATCAAGCCCA TGCTGGAACG AATGCCCAAC GCGAGATTAT GCATAGTTGG TAAGGGACCA CAAGAAGAGG AACTTCATGA CTATTTCAAG GGTACCAACA CGGTGTTTAC TGGTCAATTG GATGGCGATG AGCTATCGTC CGCCTTTGCG TCGGCTGATG TGTTTGTCAT GCCTAGTGAT TCCGAAACCT TGGGTTTTGT CGTTTTGGAG AGTATGGCGT CCGGTGTTCC CGTAGTTGGG GCCGCCGCAG GCGGCATTCC CGATATTATT GATGACGGTA AGACGGGATT TTTAGTTCCA CCCGGAGATA TTGCTGGCTT CGTATCGCGC CTCGAGAGTC TAAGGAATGC GAAATTCCGC ACTCAAATGG CCAAGGCAGC CCGGAAAGAA ACTGAACGGT GGGGGTGGGA GGCCGCTACC TCGTACTTGC GAAACGTTCA ATACGAAAAG GCTCTTATCA ACTTTCATTC CCGAGCGTTT GGGGGATTCG GGCGTCCACG ATCCGGTACT ATGTGGAGAT TGCTTGGCTG GCGGATGCGC CGGGTGATAC ACAAGGTTGC CAAGCCTAAG GCCGTCTTTT CGTCCATTTG GAGAAAGCTA TCGTTCGGAC GCCAAGAAAA TAGCAACAGT GGCAAGCCAG CGGCGGCATG A
|
Protein sequence | MIVVKSALLI VCLVATACIG FVSCFQPYTS AIPPPCTSSV SSSSVRLEAA ARSSDAPPLR RNPPRKICLM VEPTPFTHVS GYANRFKEML KFMAKAGDEV DILTVDTKTP AHELPTACSG FSIKHTQGFT FPLYNQISLT FDLPEMKGAQ MLEKFKPDLI HVTSPGFMLF ASIFYARVLC IPLVMSYHTH LPSYGKNYLS FVPGIENFCW ELLRWAHARA DLTLVTSPQM QEELTRNGIP RVDVWRKGID TDRFDPKFRS TSMREKMTRG NADDFLMVYV GRLGAEKRLK DIKPMLERMP NARLCIVGKG PQEEELHDYF KGTNTVFTGQ LDGDELSSAF ASADVFVMPS DSETLGFVVL ESMASGVPVV GAAAGGIPDI IDDGKTGFLV PPGDIAGFVS RLESLRNAKF RTQMAKAARK ETERWGWEAA TSYLRNVQYE KALINFHSRA FGGFGRPRSG TMWRLLGWRM RRVIHKVAKP KAVFSSIWRK LSFGRQENSN SGKPAAA
|
| |