Gene PHATRDRAFT_47314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47314 
SymbolSpc97 
ID7202386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp267568 
End bp271417 
Gene Length3850 bp 
Protein Length1274 aa 
Translation table 
GC content51% 
IMG OID 
ProductSpc97 
Protein accessionXP_002181520 
Protein GI219122372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.477788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCT CCTTGCGCAA TCCGGATTCG GCGGTCGCCC GTGCCGCCAT GGCGCGGCGT 
CGGCAGCGAA TTCCGCACCA CGGGGCCGTT GCGACATCGT CCGGTACTAC CGATCAGCCC
CGCGATCCAA CTGCTAGACG AAACTCGGCC GTTGCGGAGA CTTCTCGCCA GGTTTACGAT
CACGAACGGC TACTAGAAAG AAGTTTTGAA GTCGCAGACG AAGCGGCGTC CGTGTTGGCG
TCGGTACGTC AACGTCGTCA GCGGGATGGC TCCACACGAA TGAATATCGA GCCCGAGTCG
GACTCCAACC TTTCGACGGC ATCACACGTA CCACAACAAC ACCAAGGGCG GACACCGACA
GTCAAGTTTG CAACGCAAAG CTCGTCTACG AGAACGAGCG TCCGTAATGT GCCATCCAGC
ATTACTCCAC TACTGTCTGC CGCCGTCACC GTTTCGAGTG CCCCTCCGCC CACATCCTCA
CGAACGCCCT CAACGGTGAC TAAAGTTTCC CCCGTACCTT TGCCTCCGCC ACCGTCAACC
TCCGTCGACG GAACGCGCCC GGAACGTGCC GTGTCTTCCC ACGGACCCAT GTCCCGTACC
GGATCCGCGC CATTGTCCAC TTCGTCCGCC AAAGCAACCT CGGCGGTACC TCGCACGTTT
GTTGCTTCGC GCAGCACGTC TCCCATGCCT CAGCAAACTT CCGACAGTGA GGAGGATAAC
GATCAAGTCT CGGGAGAAAC GAAGCTGCCG CCCACCGTGG AAGAGCTGGA ACCCGTCCTA
GTTGAAGATG ATGACGACGA AGGCCACAAA GGTTGGACCG CTAACGATAG TCATGCGGCC
GCAACGACCC ATTTACACTA CGGTGACCGT ATTCGTATAC TTTGCCACAC CGAAACACCC
TGGAGTATTG TACGGGAAAG CTCCCCCGAA AGCAAGGTAA CAAAGGCGAT AGCAGTCCCC
GTTACGTCAG CCACCATGGC CGCCTTTACG GTACAGCGAC CCCGCCCCAC AGCACAATAT
GATCCTTGCC TACGCTATGG AGATGTCGTA TCTTTGCACC ATACCGACGG ATCAATTATG
AGTGTCCATC GTCAGCAATC CGAGAAAGGC CATAACATTG CTTACGTGCC CGCTTGGATC
GATCCGGTGG AAACCGACGA CGAACGTATG GAAGTTTGGA AACTTTTGCG GGCGGTCCGG
AACGGTCTCG TGCGTGTCGG ACACTCTGCG ATAGAAAAGG TCACACCACG GTCTGGTCGG
ACGGCTCCAA TAAATTCGGG TGACGCCATT CTCTTTCGGC ACGAGAAAAC CGGCGGGATT
TTACGGTTGG ACGTGTCCGG TCACTTGGAA GTAGCCACGG ATTCATATGT TCCCAGCAAC
AGTAATATCC GCCAACGTCG AACTTTGCTG ACGCGATTGC AAACGCACGA TATCTTAGAA
ATATCCAAGC ACGAAACTTT TCACGTGATT ACCGGAACCG TTCCACCGAC ACCTCTGTGG
ACTAACGGTC CGGCAGATAC GCAACGTTCA TTCTCCAACG GCACTCATAT CCTGGATGCC
CAACGACACG CTTCTACACA AGTAGAGGAA GCACTGTTTG TTGATGTTGA TAAGAAACGC
AAGATTGAGC TGGCAGACCA CATCGGAATA TTGCAAGCGA ATCGGTCGAA ACAGATGGAC
ACCCCTTTGG GACAAGAAAT GGTTTTGATT GACGAGCTAC TAGGTTCCTT CGTGGGATTA
GAAGGAAATT CAATCCGCGT TCATGGGGCA AAATCCCTAG ACGACGACAG TTTTCGCTTC
GAAGTTGGTA CATGTACGAA TCTGGACGAA GGCTTACGGT CCATGGTGGA GAGTTTATTG
CCTCTATCGA CGGCATTTGT TCGCGTAAAT CACTTTGTAG TTAGCACGCT TCCACGATAC
GAAATGGGGC TCGTCATGCA CGCTTTCTGC GAAGCGCTGG ACGAGTTGTT GCAAAACTAC
GTTGCTTTGA CAGCAACCTT GGAACGCGAA TATCGAAGTC TCATAGACTA TTCGTTTGGG
ATCTCAAAAT TACAGACACA CATTTTGTCT GCACTACATA CAATGTCGGT GTTACACCAT
GCCGTAGAGA CTGTTCGTCA CAGCAAGGGC GGTGCTCTGA TAAATTCGCT TCAAGAATAT
AAAGACAATC GTTGTGACGG TGATCCAGCC GCCGAGTCTA TGATGCATAA CCTTGTGGAA
CGGGCAAGCA TACCATATAT GAACATGCTT CTGCAATGGC TCAACAACGG TATCCTTGAC
GATCCGTACA GCGAGTTCAT GGTAATGTGG GACAAAAGCA AGCTATGGAA TGAGCAGTAC
GCTGTCGTGG AAGCGCATGT TTTGAGCAGA CACTTTGGAT CACGACAATT GATCGAAAGA
ACGGTCTCTA CGGGACAATA TTGGAACGCT GTTCGCCGAT GCCAGGGTCA CGTTCAAGAA
TCTGCTGTGC TCGTGGATAC TCTTCCTTTG CGATATAGCG ACCCCATTGT TTCTTTTGCT
TCCAGCGTCC AGCTGCAGTA TCACAAAGCC TCCCGTGTTC TTGTTAGTCT ACTGCTGAAC
GAATACGATC TGCTTGGCTC TTTGCGTCTT ATGAAACGAT ACTTTCTGCT CGATCATGGT
GACTTCTTCG TGCACTTTCT TGACGTCGCA GAACGGGAAT TGCGTAAATC CCTTTCGAGT
GTATCACCTG GAAGGATACA GCATTGGCTG AAAATGTCGA TCCAGCTTTC GGAGAGTCAC
ACGGAGGATC AAGCAGCTAG CCCCTTCTTT CAAACACGAG GTAATCGATC TCTGAATGGA
AACTCGATAC GCTGTAATTT CGCCCCGGAA AGTCTTGCCG ATCAATTAGA CCAGCTGCAC
GCCGCCACAG GAGGTATCGA TACGCACGAA CCCAATACAC CTCAACGACA TGCGTATGGG
GCATCACTAG ACGAAGGGCT GACGGGTTTG GACGCATTCC TTATCGAGTT GAGCTTTGTT
CCATTCCCTG TCTCTGTGGT TCTTTCGCGT CGTGCGTTGA CCAGCTATCA GCTCCTCTTT
CGCCATTTGT TTCTCACGAA GCACGCCGAA CGTAGACTGG TAGGAATATG GAAAGATCAT
CAAACAATGA AAGAACTGCA ACAGATTCGT GGGTCTATGG GACCTGCTTT TCTTTTGAGA
CAACGAATGT TACATTTTTT GCAAAATTTG ATGTACTATA TGATGTTTGA AGTTATCGAG
CCGAACTGGT CCGAATTGGA AAATGAGATT GATTCACTCA AACAGCAACG GGAGTACACA
GTTGACGATA TGTTGCAGGT TCATTCTGAG TTTCTCCAGT CAACAATCCA AGCATGCCTA
CTGACCAGTC GCGAGTTAAT ACGAGCCTTG ACGAGACTTT TGAAAACGTG TCTGCTTTTT
AGCGATCAAA TGGGTCACTT CATTAGAGCG ACCCAGATTA ATGAAGACCG TGACGCTGTT
GCGACAGAGA AGCAAAAAGT AGTAGAAAGG AGCCTTAACG GCAGAGATCG CGTTGGAGTT
TCAATTTCTG AGAACAGACT CCGTCACACT CTTGAAGAAG CGCGTCGGGA ACGTGCCGAG
CGAGTGAATC GTCAAACCCT TCGTGTGAAA AGAGAAGTGG CGAATGAGTC GTATAGATGG
ATGATTGTCC GCTTCGAAGA GGTTTTCTCT GAGCACTTGA AAGAGTTCAT GGTCCGTCTA
TCGTCAGCAG ATGATTCATA TCCAACCAAT GCACAACTAG CAAATTTGTG CGTACGTCTT
GATTACAATG GTTACGTCTC GAAATCAATT GTCCGGTCTC CCTAAAGATC CACCCTTTGC
GCCAATTTCC
 
Protein sequence
MTSSLRNPDS AVARAAMARR RQRIPHHGAV ATSSGTTDQP RDPTARRNSA VAETSRQVYD 
HERLLERSFE VADEAASVLA SVRQRRQRDG STRMNIEPES DSNLSTASHV PQQHQGRTPT
VKFATQSSST RTSVRNVPSS ITPLLSAAVT VSSAPPPTSS RTPSTVTKVS PVPLPPPPST
SVDGTRPERA VSSHGPMSRT GSAPLSTSSA KATSAVPRTF VASRSTSPMP QQTSDSEEDN
DQVSGETKLP PTVEELEPVL VEDDDDEGHK GWTANDSHAA ATTHLHYGDR IRILCHTETP
WSIVRESSPE SKVTKAIAVP VTSATMAAFT VQRPRPTAQY DPCLRYGDVV SLHHTDGSIM
SVHRQQSEKG HNIAYVPAWI DPVETDDERM EVWKLLRAVR NGLVRVGHSA IEKVTPRSGR
TAPINSGDAI LFRHEKTGGI LRLDVSGHLE VATDSYVPSN SNIRQRRTLL TRLQTHDILE
ISKHETFHVI TGTVPPTPLW TNGPADTQRS FSNGTHILDA QRHASTQVEE ALFVDVDKKR
KIELADHIGI LQANRSKQMD TPLGQEMVLI DELLGSFVGL EGNSIRVHGA KSLDDDSFRF
EVGTCTNLDE GLRSMVESLL PLSTAFVRVN HFVVSTLPRY EMGLVMHAFC EALDELLQNY
VALTATLERE YRSLIDYSFG ISKLQTHILS ALHTMSVLHH AVETVRHSKG GALINSLQEY
KDNRCDGDPA AESMMHNLVE RASIPYMNML LQWLNNGILD DPYSEFMVMW DKSKLWNEQY
AVVEAHVLSR HFGSRQLIER TVSTGQYWNA VRRCQGHVQE SAVLVDTLPL RYSDPIVSFA
SSVQLQYHKA SRVLVSLLLN EYDLLGSLRL MKRYFLLDHG DFFVHFLDVA ERELRKSLSS
VSPGRIQHWL KMSIQLSESH TEDQAASPFF QTRGNRSLNG NSIRCNFAPE SLADQLDQLH
AATGGIDTHE PNTPQRHAYG ASLDEGLTGL DAFLIELSFV PFPVSVVLSR RALTSYQLLF
RHLFLTKHAE RRLVGIWKDH QTMKELQQIR GSMGPAFLLR QRMLHFLQNL MYYMMFEVIE
PNWSELENEI DSLKQQREYT VDDMLQVHSE FLQSTIQACL LTSRELIRAL TRLLKTCLLF
SDQMGHFIRA TQINEDRDAV ATEKQKVVER SLNGRDRVGV SISENRLRHT LEEARRERAE
RVNRQTLRVK REVANESYRW MIVRFEEVFS EHLKEFMVRL SSADDSYPTN AQLANLCVRL
DYNGYVSKSI VRSP