Gene PHATRDRAFT_42015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42015 
SymbolSCS-alpha 
ID7201458 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp717119 
End bp718204 
Gene Length1086 bp 
Protein Length309 aa 
Translation table 
GC content54% 
IMG OID 
Productligase succinate-coa ligase 
Protein accessionXP_002180696 
Protein GI219119890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCAA CCGCCAAAGT GTGGGTGGAT AAGAACAGTC GCGTGATCTG TCAAGGGTTC 
ACCGGAAAAC AGGTAGGTAC GGTATCTGAG ACGTCGGATC TGTTCAAACA TTTTCGTCGT
ACAGTTATTG CACAACCGAG ATTAGTGAAT CTAATTCGTG TCTGAAAATC AGCACACGGA
ATCAGTCCAG CAATAAGACT TACCAAGGCG CTTCTTCTAT CTCGGCAGGG AACCTTCCAT
TCGACGCAAG CCATTGATTA CGGCACCAAT ATGGTTGGTG GAGTCACGCC GAAAAAAGGC
GGGCAGGAGC ATCTCGGTCT ACCGGTATTT AACACGGTCC AAGAAGCCGT CGACGGGGTA
CAACCGGACG CCTCGGTTAT TTACGTGCCG CCCCCCTACG CGGCCCAAGC CATTCTGGAC
GCGATCGAGG CCGAAATCGG ACTCGTCGTC TGCATCACGG AGGGAATTCC TCAGCAAGAC
ATGGCGCGCG TCAAACACGC CTTGCGGCTA CAAGACAGGA CACGCTTAAT TGGACCCAAT
TGTCCCGGGA TTATTAAACC CGGTGAATGC AAGATTGGTA TCATGCCAGG GTACATTCAC
CGGCCTGGGA AAATCGGTGT AGTGTCGCGG TCGGGAACAC TAACGTACGA AGCAGTTTGG
CAAACTACGG TAACGGGACT GGGCCAGTCG ACATGTGTCG GTATTGGCGG GGACCCTTTC
AATGGAACCA ACTTTATTGA TTGCTTGGAA CGTTTCACCA ATGATCCCGA GACGGAAGGT
ATCATCATGA TCGGAGAAAT TGGTGGATCG GCCGAAGAAG AAGCCGCCGA ATGGCTCAAG
GAATACGGCG ACCCCAACAA ACCTGTGGTT GGCTTTATTG CTGGACTCAC AGCTCCTCCG
GGACGTCGTA TGGGACATGC TGGTGCCATT ATTGCTGGAG GAAAGGGCGG TGCGGAAGAA
AAGTTCCAGG CCCTGGAGTC TGCTGGAGTT CACGTGTCGC GCTCTCCAGC TCGTCTCGGA
GCCACCATGC TCGAAGCCAT GGGACTGGAA GCGCCGGAAA CTCCGCTTGA GCCGCAAAGT
GCTTGA
 
Protein sequence
MSSTAKVWVD KNSRVICQGF TGKQGTFHST QAIDYGTNMV GGVTPKKGGQ EHLGLPVFNT 
VQEAVDGVQP DASVIYVPPP YAAQAILDAI EAEIGLVVCI TEGIPQQDMA RVKHALRLQD
RTRLIGPNCP GIIKPGECKI GIMPGYIHRP GKIGVVSRSG TLTYEAVWQT TVTGLGQSTC
VGIGGDPFNG TNFIDCLERF TNDPETEGII MIGEIGGSAE EEAAEWLKEY GDPNKPVVGF
IAGLTAPPGR RMGHAGAIIA GGKGGAEEKF QALESAGVHV SRSPARLGAT MLEAMGLEAP
ETPLEPQSA