Gene PHATR_21201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_21201 
SymbolSQD1 
ID7204758 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp857133 
End bp858918 
Gene Length1786 bp 
Protein Length456 aa 
Translation table 
GC content49% 
IMG OID 
ProductUDP-sulfoquinovose synthase, plastid precursor 
Protein accessionXP_002185968 
Protein GI219121490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAACATCATA CCATCACGCT TGTCGTTGTG GTGGGGATCC TCGCTCGTCC TTGAGTCAAC 
CTGTGGTAGA TCGCATCTTT TTTGTTTCGC AAAATAGTTG TCAATCGTTG GCAAAGACAT
TATTTCGTCT ACTTTCACAC CATGAAGCTT TTGATTTTTC TTTCTTTAGT GACCTCTGCT
CATTCGTTTG TTCCAGTTCT GCATCTCTCG GCGCAGTCTA CACCTTCGCG GACGAAACTA
TTTGCCGAAG GTGCGAACGG AGATTCCTCA GCGGCGAGCA AAAAGAAGGT CATCGTCTTG
GGAGGAGACG GGTTCTGCGG TTGGCCAACT TCTCTGTACC TGTCGGATCA GGGGCATGAA
GTTGTGATCG TCGACAATCT CAGCCGACGC AACATTGACA TCGAGCTCGG ATGCGACTCG
CTGACTCCTA TCCGATCCCC CGAAGTGAGT AGTCTTGATG GCACAGCTAA AGACAGCTCG
AAGGAAGTAC CCCACTTCGA ATGAAAGGTT AAATTTCATA TGAATTTACA CCCATTCAAT
TGCCAAAGCA AACGCATGAT GATGATTTGC TTACCAAATT GCTTTCCTTT ACGAATTGCA
GGTACGTTGC CAAGCCTGGA AGGAAGTTAG TGGCAAGGAA ATCCGCTTCG TCAACTTGGA
CGTCGCCAAG GAATATGATC TTTTGGTGGA TCTCATCAAG CAAGAAAAGC CCGATTCCAT
CGTGCACTTT GCGGAGCAGC GTGCTGCTCC GTACAGTATG AAATCCGCCA AGACGAAGCG
ATACACTGTC GACAACAACG TTGGTGGCTC CAACAACCTT TGTTGCGCTG TTATCGACTC
AGAGGTCGAC GCCCATATCA TCCATTTGGG AACCATGGGC GTGTACGGGT ACGGAACTAG
CGGTGGAGAA ATTCCGGAAG GATACATCGA TGTCACTCTA CCCGGCGGCC GTGATGCCAA
CATTCTGCAC CCTGCCCACC CTGGAAGCGT CTACCATGCC ACCAAGTGCT TGGATGCCCT
TTTGTGGCAA TTTTACCAAA AAAACGACCA GCTACGTATT ACTGATTTGC ATCAGGGCAT
TGTCTGGGGT ACCAACACAC CACAGACAGC CCTGGACGAA CGTTTGATCA ATCGGTTTGA
CTATGATGGG GACTACGGTA CAGTTTTGAA CCGTTTCTTG ATGCAAGCCG CCATGGGCGT
TCCGTTGACC GTCTACGGAA CTGGCGGGCA AACTCGAGCC TTTATTCATA TTTCCGACAC
GGCTCGTTGC ATCGATTTAG CTATAAGCAA CCCGCCCACC GCTGGCGACC GCGTCGAAAT
CTTCAATCAG GTCGCCGAAA CCCGTCGTGT GCGGGATATT GCTGAGTTGG TCGCCAGTAT
GACCGATGTC GAAGTGAACT TTATCCCCAA TCCTCGTCAA GAAGCTGCGG AGAATGATTT
GGATGTGGCC AATCGTAAGT TTTGCAATCT TGGTTTGGAT CCCATTACTC TCGATACGGG
CCTCTTTGAT GAAGTCACCG AAATTGTCAA GAAGTACAAG TCACGCTGTG ATCCCACAAA
GATTCTGCCG GCGAGTTTCT GGAACAAAAA ACGCGCAGAA GAGTGCGCCA GCATCGATCC
CAAGAGCATT CAATTGAAGA AAGACGAAGC GGAAGTCGCC AAGGCGTAAG TCGACCACGA
ACTGGTGATG CCCCATTGGT TTATAGGTAT AATGCGCGAG AAGTTCGTTT AACGCTTGGC
CTCGGATCTG CTTGCTCCTA TCATAGTACA CATTATTGCT AGGTTC
 
Protein sequence
MKLLIFLSLV TSAHSFVPVL HLSAQSTPSR TKLFAEGANG DSSAASKKKV IVLGGDGFCG 
WPTSLYLSDQ GHEVVIVDNL SRRNIDIELG CDSLTPIRSP EVRCQAWKEV SGKEIRFVNL
DVAKEYDLLV DLIKQEKPDS IVHFAEQRAA PYSMKSAKTK RYTVDNNVGG SNNLCCAVID
SEVDAHIIHL GTMGVYGYGT SGGEIPEGYI DVTLPGGRDA NILHPAHPGS VYHATKCLDA
LLWQFYQKND QLRITDLHQG IVWGTNTPQT ALDERLINRF DYDGDYGTVL NRFLMQAAMG
VPLTVYGTGG QTRAFIHISD TARCIDLAIS NPPTAGDRVE IFNQVAETRR VRDIAELVAS
MTDVEVNFIP NPRQEAAEND LDVANRKFCN LGLDPITLDT GLFDEVTEIV KKYKSRCDPT
KILPASFWNK KRAEECASID PKSIQLKKDE AEVAKA