Gene PHATRDRAFT_55198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55198 
Symbol 
ID7199249 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp50287 
End bp53582 
Gene Length3296 bp 
Protein Length894 aa 
Translation table 
GC content52% 
IMG OID 
ProductSTT3 subunit-like protein 
Protein accessionXP_002185420 
Protein GI219130538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.835655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTTCGCAT CCGTATTGCC TTCGTCGTTG GTCACGCTTT TCTCTCGATC GATATCTCTT 
TCTCTCGATC GATACCTGCT TGTTTGCTTG TAGCGATTGT GAGGATTAAT TGGGCTTGTC
AGTGTTGGTC ATACTGGGGT TACACTGTCC GCACAGGATG TCCCCGTCGA CGATTTCCTC
CACGCGTCCG CCGGAACCCT CGTCCGACCT TTTGTCGTGG GTTTTGCTCG GAGGGACGGC
TTACGCCGTG TATGTGATCC TGCACACGGC GTATCGCATC CGTATGGGTG CCATTGACGA
TTTTGGCCTC GTCATTCACG AATTTGACCC CTGGGTACGT ACGGGTCAAT GGGCCTCGAG
TTGCGCTCCA GACATGGCGC GACGGACTGT CCAAGCGCTA TTACCGACCG GCTTTGTTAC
GGTGACGCTA ATCATGAATG GCACCGCATA GCTCACTCTC TTCTACTGCT TCTTTGCGTT
TTTGTGGTCG AACAGTTCAA TTACCGCGCG ACGGAATATT TGTACTACAA CGGCATCAAG
GACTTTTTCC AATGGTTCGA TTACATGAGT TGGTATCCAT TGGGACGTCC CGTTGGTACC
ACCATTTATC CCGGTATGCA GTTTACCGCC GTCGCTATTA AGCGGTACTT GCTGGATAGT
GTTATGAGTT TGAACGACAT TTGTTGTTAC ATACCCGTTT GGTTCGGCGT GATGGCCTCG
TTTGTTACCG GCTCCATCGC CTACGAAGTT TCTATTCCCC ACAATACCCA TTCCTCCCTG
CTCGGATTCA TTTGGGACAT TTACAAGGGT CAAAAGAGGA CGTACCAAGG TACCGACGTC
GATCGCAGCG GCTCGACACC CCTACTCTTG GGATTCTGGT CTCCCGCTAT TGGCTGCTCG
ATTTTCGCCA TGGCCATGAT GGCCATCGTT CCTGCCCATT TGATGCGCTC CATGGGCGGT
GGATACGACA ACGAAAGCAT CGCCGTATTT GCCATGGTAC TCACCTTTTA TTGCTGGGTA
CGCTCCCTGC GCAGTACTTC CGGTAGTGCC CACGCACAAA CCTACGTCGC ATGGAGCGTC
GCCACGGGGT TAGCTTACTT TTACATGGTC GCCGCCTGGG GAGGCTACGT CTTTGTTCTC
AACCTAATTG GCGTACACGC GGCCTTTCTC GTCCTCGCCG GACGCTTTTC TACCCAGACC
TACGTCGCGT ACACGTTGTT TTACAGTATC GGTACCGCCT TGGCCGTTCA AATTCCGGTG
GTGGGATGGG CACCACTCAA GTCACTCGAG CAACTCGGAC CGGGTGCCGT CTTTTTGGGA
TACCAGCTTT TGTATGTTTG TGAAGTCCTC CGGAAACGGC AAAATTTGAC CCGGGCACAG
GCGTGGAAAC TGCGTGTACA AATGTGTGCC ATCGCTGGGG CGCTTGTCAT GTTTGCTGCC
TTTTTTCTAG CCCCCAAGGG ATACTTTGGT CCCTTGTCAT CGCGGGTTCG TGGATTGTTC
GTTGCGCATA CCAAGACGGG CAATCCGCTG GTGGATTCTG TCGCTGAACA CCAGGCCGCT
TCGTCGCGGG CCTATTTTCA ATACTTGCAT CACGTTTGTT CCCTGGCACC CGTGGGTTAC
ATACTCGTGT TTTTCAACTT GAGCGACGCC AGTTCCTTTC TGATCGTTTG GGCGACGGCA
GCCTATTTCT TTTCCCACAA AATGGTTCGT CTCATTCTAC TGACGGCGCC CATTGGGTCG
ATTCTTGGTG GTATTACCGC TGGTCGTCTC TTTACCTGGT GCCTGCACCA GTGGTGGGAC
ATCGTGGATG ATGATGAGGC CAATAAAGGT GAGTCACCCG CCGTGGTGAC TAAAACAACC
AAGAAGGACG GCAAGGTCAC TATGAAAGCA ACCAAGGAGA AAAAGAAAAA GATTCCCAAA
ACTGTCGGCA AATCCGATAA GTATACTTCG GAAAGCTTTT CTTCGTTTGA AGGATTAGCC
GCCATTCAAG AAACAGCCAG ATTAGCTTTG AATACTACGG AAGGCATCCT TGTCCGTCGA
AGCGTGGCGC TTGTCCTTTT GCTCATCGGA TACTTTCTGG GTGGAAGCTT TAATAATTAC
AGTTGGCGGC TGAGTCAAGA CCTTTCCAAC CCGACCATCA TTATGCGAGC GCGCCTCCGT
GATGGTCAAC TAGTCATGAT TGACGATTAC CGTGAAGCCT ATTGGTGGTT GAAAGACAAC
ACTCCGGAAG ACTCCCGCAT CATGGCGTGG TGGGACTATG GTTACCAGAT TGCCGGTATT
GCGAATCGGA CAAGCATCGC GGATGGAAAT ACGTGGAATC ATGAGCACAT TGCGCTACTC
GGAAAAGCGT TGACGACCGG TGTCGAGGAG GGATACGAGA TTGCTCGTCA TTGGGCCGAT
TACGTTCTGT TGTGGACCGG CGGTGGCGGT GACGACTTGG CCAAATCTCC GCATTTGGCT
CGCATCGCCA ACTCGGTTTA TCGTGATCAC TGTCCCGACG ATCCTACCTG TCGGGCCTTT
GGCTTTGTGG ATCGCGAGGG AACCCCGTCG GCCATGATGA AGCGCAGCTT TTTGTTCAAT
CTTCACGGCC ATCAAATCAA GCCGGAAGCC AATGCACCCG CGGACAAGTT CCAGGAAGTT
TTCCGGTCCA AGTACGGCAA GGTGCGGATC TTCAAGATCC TCGGAGTCTC GCAGGAATCC
AAGGAATGGG TTGCCGATCC ATCCAATCGT ATCTGTGACG CTCCTGGTTC ATGGTTTTGT
CGCGGTCAGT ACCCACCGGG ATTGAGTCGA GTTCTGGAAG GCAAAAAAGA CTTTTCACAA
TTAGAAGATT TCAACCGGGG CGATCGGGAT GAAGAATACA CGCGTCGGTA TTTCGAAGAT
CTGAAGGACC CGGACAGCGC CCGGAGAAAG GCCATGGCCA AAGAAATTGA ACGCAACAAA
GAACAGGTCG ACGCCGAAGT ACAGGAGAAG AAACATGTCT CGGTTGACGA TATATATAAC
ACTTGGGAAA ACACCGACGA CACAACACGC ATGTGGAACT TAATCAATTC AAACGCCGTT
GAGGAATTAA AGGCATGGCT CGAAGCGGAG CCGCATAAAG CATATGTGAG GTCAGAGGAT
GGGAGAGGAC CTATGTGGTG GGCTTTTGAG AAGCGCAACG AGGATGTAAC CAAGCTTCTC
ATGAAGGCAG GAGTTCCTTA TACGGATCGT GACGGCAGCG GAAAAACCCC ATTGGATTTG
CAACAGGGAG GCTAGATTAA ATTTTGCTTC GTTCGCTTTA AAATCGATTT TACAAC
 
Protein sequence
MSPSTISSTR PPEPSSDLLS WVLLGGTAYA VYVILHTAYR IRMGAIDDFG LVIHEFDPWF 
NYRATEYLYY NGIKDFFQWF DYMSWYPLGR PVGTTIYPGM QFTAVAIKRY LLDSVMSLND
ICCYIPVWFG VMASGSTPLL LGFWSPAIGC SIFAMAMMAI VPAHLMRSMG GGYDNESIAV
FAMVLTFYCW VRSLRSTSGS AHAQTYVAWS VATGLAYFYM VAAWGGYVFV LNLIGVHAAF
LVLAGRFSTQ TYVAYTLFYS IGTALAVQIP VVGWAPLKSL EQLGPGAVFL GYQLLYVCEV
LRKRQNLTRA QAWKLRVQMC AIAGALVMFA AFFLAPKGYF GPLSSRVRGL FVAHTKTGNP
LVDSVAEHQA ASSRAYFQYL HHVCSLAPVG YILVFFNLSD ASSFLIVWAT AAYFFSHKMV
RLILLTAPIG SILGGITAGP TKEKKKKIPK TVGKSDKYTS ESFSSFEGLA AIQETARLAL
NTTEGILVRR SVALVLLLIG YFLGGSFNNY SWRLSQDLSN PTIIMRARLR DGQLVMIDDY
REAYWWLKDN TPEDSRIMAW WDYGYQIAGI ANRTSIADGN TWNHEHIALL GKALTTGVEE
GYEIARHWAD YVLLWTGGGG DDLAKSPHLA RIANSVYRDH CPDDPTCRAF GFVDREGTPS
AMMKRSFLFN LHGHQIKPEA NAPADKFQEV FRSKYGKVRI FKILGVSQES KEWVADPSNR
ICDAPGSWFC RGQYPPGLSR VLEGKKDFSQ LEDFNRGDRD EEYTRRYFED LKDPDSARRK
AMAKEIERNK EQVDAEVQEK KHVSVDDIYN TWENTDDTTR MWNLINSNAV EELKAWLEAE
PHKAYVRSED GRGPMWWAFE KRNEDVTKLL MKAGVPYTDR DGSGKTPLDL QQGG