Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_21201 |
Symbol | SQD1 |
ID | 7204758 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 857133 |
End bp | 858918 |
Gene Length | 1786 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | UDP-sulfoquinovose synthase, plastid precursor |
Protein accession | XP_002185968 |
Protein GI | 219121490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACATCATA CCATCACGCT TGTCGTTGTG GTGGGGATCC TCGCTCGTCC TTGAGTCAAC CTGTGGTAGA TCGCATCTTT TTTGTTTCGC AAAATAGTTG TCAATCGTTG GCAAAGACAT TATTTCGTCT ACTTTCACAC CATGAAGCTT TTGATTTTTC TTTCTTTAGT GACCTCTGCT CATTCGTTTG TTCCAGTTCT GCATCTCTCG GCGCAGTCTA CACCTTCGCG GACGAAACTA TTTGCCGAAG GTGCGAACGG AGATTCCTCA GCGGCGAGCA AAAAGAAGGT CATCGTCTTG GGAGGAGACG GGTTCTGCGG TTGGCCAACT TCTCTGTACC TGTCGGATCA GGGGCATGAA GTTGTGATCG TCGACAATCT CAGCCGACGC AACATTGACA TCGAGCTCGG ATGCGACTCG CTGACTCCTA TCCGATCCCC CGAAGTGAGT AGTCTTGATG GCACAGCTAA AGACAGCTCG AAGGAAGTAC CCCACTTCGA ATGAAAGGTT AAATTTCATA TGAATTTACA CCCATTCAAT TGCCAAAGCA AACGCATGAT GATGATTTGC TTACCAAATT GCTTTCCTTT ACGAATTGCA GGTACGTTGC CAAGCCTGGA AGGAAGTTAG TGGCAAGGAA ATCCGCTTCG TCAACTTGGA CGTCGCCAAG GAATATGATC TTTTGGTGGA TCTCATCAAG CAAGAAAAGC CCGATTCCAT CGTGCACTTT GCGGAGCAGC GTGCTGCTCC GTACAGTATG AAATCCGCCA AGACGAAGCG ATACACTGTC GACAACAACG TTGGTGGCTC CAACAACCTT TGTTGCGCTG TTATCGACTC AGAGGTCGAC GCCCATATCA TCCATTTGGG AACCATGGGC GTGTACGGGT ACGGAACTAG CGGTGGAGAA ATTCCGGAAG GATACATCGA TGTCACTCTA CCCGGCGGCC GTGATGCCAA CATTCTGCAC CCTGCCCACC CTGGAAGCGT CTACCATGCC ACCAAGTGCT TGGATGCCCT TTTGTGGCAA TTTTACCAAA AAAACGACCA GCTACGTATT ACTGATTTGC ATCAGGGCAT TGTCTGGGGT ACCAACACAC CACAGACAGC CCTGGACGAA CGTTTGATCA ATCGGTTTGA CTATGATGGG GACTACGGTA CAGTTTTGAA CCGTTTCTTG ATGCAAGCCG CCATGGGCGT TCCGTTGACC GTCTACGGAA CTGGCGGGCA AACTCGAGCC TTTATTCATA TTTCCGACAC GGCTCGTTGC ATCGATTTAG CTATAAGCAA CCCGCCCACC GCTGGCGACC GCGTCGAAAT CTTCAATCAG GTCGCCGAAA CCCGTCGTGT GCGGGATATT GCTGAGTTGG TCGCCAGTAT GACCGATGTC GAAGTGAACT TTATCCCCAA TCCTCGTCAA GAAGCTGCGG AGAATGATTT GGATGTGGCC AATCGTAAGT TTTGCAATCT TGGTTTGGAT CCCATTACTC TCGATACGGG CCTCTTTGAT GAAGTCACCG AAATTGTCAA GAAGTACAAG TCACGCTGTG ATCCCACAAA GATTCTGCCG GCGAGTTTCT GGAACAAAAA ACGCGCAGAA GAGTGCGCCA GCATCGATCC CAAGAGCATT CAATTGAAGA AAGACGAAGC GGAAGTCGCC AAGGCGTAAG TCGACCACGA ACTGGTGATG CCCCATTGGT TTATAGGTAT AATGCGCGAG AAGTTCGTTT AACGCTTGGC CTCGGATCTG CTTGCTCCTA TCATAGTACA CATTATTGCT AGGTTC
|
Protein sequence | MKLLIFLSLV TSAHSFVPVL HLSAQSTPSR TKLFAEGANG DSSAASKKKV IVLGGDGFCG WPTSLYLSDQ GHEVVIVDNL SRRNIDIELG CDSLTPIRSP EVRCQAWKEV SGKEIRFVNL DVAKEYDLLV DLIKQEKPDS IVHFAEQRAA PYSMKSAKTK RYTVDNNVGG SNNLCCAVID SEVDAHIIHL GTMGVYGYGT SGGEIPEGYI DVTLPGGRDA NILHPAHPGS VYHATKCLDA LLWQFYQKND QLRITDLHQG IVWGTNTPQT ALDERLINRF DYDGDYGTVL NRFLMQAAMG VPLTVYGTGG QTRAFIHISD TARCIDLAIS NPPTAGDRVE IFNQVAETRR VRDIAELVAS MTDVEVNFIP NPRQEAAEND LDVANRKFCN LGLDPITLDT GLFDEVTEIV KKYKSRCDPT KILPASFWNK KRAEECASID PKSIQLKKDE AEVAKA
|
| |