Gene PHATRDRAFT_54038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54038 
SymbolSAS1 
ID7196720 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2003996 
End bp2006187 
Gene Length2192 bp 
Protein Length624 aa 
Translation table 
GC content48% 
IMG OID 
Productregulator of chromatin silencing 
Protein accessionXP_002177422 
Protein GI219111341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAATGGTTAC AGTAATCCAA GTTCTCCCTC GCCATTTCCA TTCCGCTGTC TACGAGCGAG 
GCTCGCAATT TGAACCATTG CAAACTCTCA TATTTACTTA CGAACGATCG GGCGATGGGA
AAACGACGCA GCACCGCAAG TAATGGTGAT AAAGGCCTTC ACAAAGGTCA GGCACCCGAA
AACGAAAAGC AGCAGCCGGC AAAGGCCGAC AGCGACGATG ATGCAATGTA CGGCAAGGTC
GATCGGTACC ATAACGAACG GGACGAAAAT TTTATGCGAC TCGACCAAGA ATCGAATGAG
AGTAGTGATG ATGACGAGAA AGAAGAAGCC GTCATGGATC TTGGCTTAGC GGGAGCTAGT
TCTGAGGAAA GTAACAGCGA TAGTGAATCC GGAGAGGAAG GCGATCGACA GATTGGCAAT
TCATCGGAGG AGGACGATTA TTTGGACTCC TCGGATGATG AAAATGACGA AGAGATACAA
ATTGAGGGTG TACGCGACTG GGGGAAGAAA AAATCAAACT ATTACAAGGG TGACACCGCC
GACTTGGAGA TTGGACAAGA CGAGGAGGAT GCGATTGTGG AAGAAGAGGC CGCGAAAGAA
CTACTAGCTG CTCGATATGA AGGCATGTCC GAGGACGATT TTGTGCTTCC TGACATTAGA
GAAGAGAAGC AAGCATCGCC AGGAGAGCTT TTGTCGGCTG CTCGGGATAT TTCAAGGCTA
TCGAGAAAGG AAAAGCAAAA ACTTCTCGAC AAGCAACACC CGGAGCTCCT TCCACTGCTG
TCTCACTTTT CTGAGATTGC GAACGATCTT GAAGAGCGTA CGATCGTCGG AACTCGCGCA
ATTTTTGATG GAGAGGATGA AACTGCTGCG GTAAGTGAAC CAAGGAAGTA CTCATCTCGC
ATCCATGATA GACTGGAACG GAAGCTCTCG TGCTGGTGAC GTTCTGGTAG AAAAGTGGTG
CAACAGGAAA CTTTTTGGGA CCATTTTAAT CCTAAGAGTT GGTCAACTCT ATGACTAAGT
CTCTCTTGGT TGGCGCCCAA CATCTTCACT CACAATGAAT ATTTATTACA CAGGCCGTTG
GATGTACCAA GTCGGGTCAA CAGTACCTTC TGGTAAAGTC AATGTTGCAG ACCACAGCGT
CACTAAATCT CGCTATGTAT TTGCTGTTGA AAAAAGAGCA ATCCTCTTTA GAAGACGTGG
ATTCTGGTCT GATTCAAAGC CACCCTATCA TGGCTCGTAT GCAAAAATGG AATTCTATGC
TACAAAAGCT GGAGGAACAA GTCGAAGATC GAGCAGATGG TCTGGAAACA CAGTTGAATA
ATCTCGTCAA AGCCGCTGCT TTAATGTCAG AAGGGCTAAG TGACGAAGAA ATTGAAGAAG
AGGAAGATGA AGACCACGCC GAGGATAGAG AAGGAAACGA TGAGAGTCTC CGCATTGATA
TACCGTCGCT GGATTCGACT GACGAAGACT CAGTCGACAC TGAAGAAGGG AGGCGCCATG
CCCTGAACGA AGCACGATTT GGGCTTCGGA CGAGCGAGAT TAAATCCGGA GCGTCTAGCA
GTCCTCAGCG CAAAGCTGTG GAGTCCGATC TAGGTGACGA ATTTGAAAAC GATGTTAATA
ATGCCGCCTC TAGGGCTCTG GCGTCTACCC TTAACTCGAT CGAACAGCGG TCGCTTTCGC
GCAAACGAAA AGCAGCTCCA AATGTGGAAG CTTTGGACGA ACCACAGCAA GATAATTCTG
AAATACGACA GGCATTGGAA ATGATGGAAA CCGAGCTTGG TAAAGACCCT GAAACAGAAT
ATGAATCCAA CGATGACGAT GCAACTGACC CGGAGGTTGA CGAGGACGAT GGGGAAGACG
AGTTATACAA AGAAGTTAAG AGAAAAAGCA AGTCAAAGAA AGAGCTCAAA AAGAGTTTGT
ACGCAGTGGC TCCTAAATAT CCTCGATTGG AGAAAGAGAT TGAGGGAGAG CGCGCTGTCA
GTCGCACCAT TCTCAAGAAT CGTGGCTTGG TTGCCCACAA GAATAAGCTC AATCGCAATC
CACGCGTCAA AAAACGAGAG CAGTATCGGA AACGACTTAT TCAGCGCAAG GGAACTGTTC
GGGAGGTCCG CACAGACGAA GGACACAAGT ATTCTGGAGA AGCGACCGGT ATCAAGAGTG
GTCTTACTCG TAGCCGCAAG TTGGCTCGTT AG
 
Protein sequence
MGKRRSTASN GDKGLHKGQA PENEKQQPAK ADSDDDAMYG KVDRYHNERD ENFMRLDQES 
NESSDDDEKE EAVMDLGLAG ASSEESNSDS ESGEEGDRQI GNSSEEDDYL DSSDDENDEE
IQIEGVRDWG KKKSNYYKGD TADLEIGQDE EDAIVEEEAA KELLAARYEG MSEDDFVLPD
IREEKQASPG ELLSAARDIS RLSRKEKQKL LDKQHPELLP LLSHFSEIAN DLEERTIVGT
RAIFDGEDET AAAVGCTKSG QQYLLVKSML QTTASLNLAM YLLLKKEQSS LEDVDSGLIQ
SHPIMARMQK WNSMLQKLEE QVEDRADGLE TQLNNLVKAA ALMSEGLSDE EIEEEEDEDH
AEDREGNDES LRIDIPSLDS TDEDSVDTEE GRRHALNEAR FGLRTSEIKS GASSSPQRKA
VESDLGDEFE NDVNNAASRA LASTLNSIEQ RSLSRKRKAA PNVEALDEPQ QDNSEIRQAL
EMMETELGKD PETEYESNDD DATDPEVDED DGEDELYKEV KRKSKSKKEL KKSLYAVAPK
YPRLEKEIEG ERAVSRTILK NRGLVAHKNK LNRNPRVKKR EQYRKRLIQR KGTVREVRTD
EGHKYSGEAT GIKSGLTRSR KLAR