Gene PHATR_33116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33116 
Symbol 
ID7204251 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp81317 
End bp85372 
Gene Length4056 bp 
Protein Length1057 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185995 
Protein GI219112823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.51252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTCGG ACGCACACAA ACCGCAGTCT GTGAGCGAAA AAAAGGGAAA CAAGGACCCG 
ATGCCGCAGC GGCAACGAAA AGCGCTTGGA TGCTGCCGGA GCGAGAACGC AATCCAAGGA
AGTGGTGTCA CTGTCACGTC CATCCCCCCC AATTGACCTT GACACCACGA GTTCGCACGC
TCCATACAAG TTTCCAGAGT CCAGTGGTGA CGCTGTGCCC TTTCCCGGCA CTCGATGCAG
TCGTGTGTGT GTGTGCGTAT TTACAGTCAA TACTACAAGC GTTCCCCCGC CATTCCCATA
TCCCCTAGCC CTACAGAAAC GCCAGTCCCG GTCCCGTTTC CCCAACGAGG ATTCTCCCTC
CTCGAAGAGC AGCTCAAGCC CCCTCTAGTT AGACATGAAT CTACCCACGC ACACGATCAG
TGCGATTCCA ATGGAATATG GTATGTCCGA TTCTCACAGT CAACGACAAA ACTCTGCGTC
GGCCTTTGCA GAAGACGAAA AGTACCCGGC GGGCGTTTCC CAGACATGTT GGTGGATGTG
AACGTCCCCA GAGGTACACA ATTCACGGTC AGTCCATAGT CTGTAGGTCC CACGCTCTGA
TGTCCTCTAC AAAGAACTGT GTACGGAATC TAATAGCAAT GATATTCATT TACATTGCTT
TTCTGTTATT ATCCCCCCAA AAAAGAACAC TGCAGTCCTG CCAAATTAAT TACACATTTA
GCATTCTCAA CGAAATACGC CATTTTGACA CGTATTCGGT CCTCTCGAAT TTGGCAACAA
ACAACATTCG CCGTGTACCA TAGGAAGAAG TGTACTGCTG CTCCCGACAT CCATCGTTGG
ATTAGGGCAC TTATCGATCG CTTCGAAAAC CTCTCTTGCA TTGCAGTAAG AAAAGAACGA
CTCATGGTGC ACGAAAGACT TTGGCGTTCG AAACGGACCG ATCCTCGGAA AACGGATATC
CGACTAGTGA TGACAACGTG CGCATTGCTA GCTTTTGGGA GTCTGTACAC GTATTACATG
AGTTTGTTGA ACAATTTCGA CTACTCCATT CTGTACAACG ATATCGGAGT TTCCGTGCCT
TTGCCTGTGC TACCCAACTT TACCTCTCTA TTGATCGCGT CGGGATCGGA CAAGTACAAT
AGGCATCACT ACGAGCGATA TTACGAGCGC TGGCTCGAAC CGTATCGCGA TGTGCCGGGC
GTGAAAGTAC TCGAAATTGG CGCTAATCAA GGACACTCCT TGAAGCTATG GGAAGACTAC
TTTGCGGATC CAGACATTAT TCTGGGATTG AAGTATGGGA ACGCCGCCAA CGGTATTGAG
AACAAGATTG TGAACCTCAC CAAAGTCTCC CTCTATACTG GTGACCAGTC CTCCAAGCCC
ACCATGGACT ACTTGAATGA GCGCGGACCT TGGCACATTA TTATTGATGA TGGCTCGCAT
GTCCCACAGC ACGTGATATA TTCGTTGGTG CATTTATGGG ACTCGGTGGC GCCAGGAGGC
ATGTATATTG TGGAAGACCT GGAAACAAGC TACTGGCGCA ACGGTTCCAA CGTCTACGAC
TATCCTCTTG CTAATACTGG AGTACTCGCG GACGCCAATC ATTCCGCAGC GGCCAAGATT
ATGCAACTCC AGCATATCTT GGTCCGCCAC CAAATTGGGG CTCGAGACAT GTCGATCTTT
CCCGGAGATG ACACCATTTG CTCAATCGAA TGGGGCATGA ACCTGCTGGC CATTCGTAAG
TGTGGGCTAC CAACGGATGG TGTGGGACCT AAGTATTTCA GAGAGAGATT TGACGCCTCT
GAAATGCGTA CCTGGCTCAA GCAATCCAGG TCCTCCAATC CGAAAGATTA ACGAATGTCT
GTGTCGTAAA GTGCATAGAG ATAAGAATAA TTGGACCTTG CAATCAATAC TGTAAGTATA
GGAATAAAGG GACAGGGGCT ACAGGACTAT GTATGGTCGA AAGAGCGTTT TGTCTCACAG
TCAGAGGAAG TTCAGGATGC CGAGCATTCG AAATGTCAAT AGTTTTACAA GTCGTCACAT
CAGACATCCG ATGGGTTATC GCTTGTAATA TCTAATTTTC GTCTCACTGT CAATCCCATA
TGTCATTCCT GGACATCGAC CGTCGATCAT GAACATAATA AGACTTCATT TTTTTCGTTT
GCTTTACAGT TCATCCCATG CCTTGCCTAC TTTCCTCACC ACAGTCTGCT CTAAAACGAG
ACGCCTTTTG AACAATTGCA CAGTAGATAA GCCAGTCAAA GCTATGCCGC GGCAAAACCT
AGTCGTTGAG ATTCAAGCGA TAGTTCCGGA GGAACCGAAA GGGTTTTTGG ACGACCCAGA
CGTGGACGCT GCTGCACCGG CAATGGATGC TGGCGCCCCG CAACACTATT CGCTTCCTCA
GCAGAACCGC ACCACCGACT CCCCGGTCGT TCTTCATACA ACCATGTTTG CGTCTACCGC
ATCCCTCTGC CTTTGCATGT TGACGCATAG TTTCTTACTG ATTTCGGTAT TTCCGTATTC
CGGTTTCATG GCCGTAGAAC TGATCGAATC GGTAGACGAA GAAACAGCCG GTGCCTACGC
TGGATTGCTC GCCTCGTGTT TCATGTGGGG TCGGGCGACG ACGGCGTACG GCTGGGGTCA
AGTGGCCGAC GTATACGGAC GTACCACGGT ACTGTATTGG TCTTTCGCAC TTTCGGGGAT
CCTCTCGATC GCCTTTGGAC TGTCACCAAC GTTTGGAAGT GCCTTGTTCC TCAGATTCGC
TCTCGGTTGT GCGAATGGGA TCATGGGAAG TATTAAGACG ATTGTTTCCG AGATATCGGC
GGGGAATGAG GCCTTGGAAA CTAAAACCAT GACAATGGTG ATTGGCATGT GGGGGTGGGG
CTTTCTCGTG TCGCCCGCCT TGTCGGGAAT ATTAGCAGAA CCAGTCAAAC AGTACCCCGG
CGTAGAATGG CTACAGCGCG AGGGAATATG GAACGCAGTG TTAGCCAAAC ATCCCTTCTT
GTTACCGAAT CTACTCGCGG CGATTTTTTG TTTGATAGGC GTCCTGGTAA TTCGAATGTT
TGTTCCCGAA ACCTTACCAT TCGGACAACG ACGCGACCCG CGACTCTTGC TATACGATAT
TGGAGCTTGG TGCCAACGGT CGGCCGGGTA TGCAAAAGTG CCGTTGAATG TGACGCGATA
CCAGCTTGTA CCGACCCTCA AAACGCACCC ATCCGACTTG GATCTGAGCA GCAGAAACAC
AAGGTTTTCA GTTTCTTGTC ACAACGCAAT CGACGAGGAT GATCTTGACG CTGTCCAAAC
ACTAGAGTCC AATGAACAAG TTGTATCCTT ATCGACAAAC ATTCCTGAAA AGGCGACCAT
TTTGTCCTTG CTCTCCCGCA AACCAACACG CACTTGCTTA CTCATATATT GGGCCTATTC
GTTTGTCGGT CTGACCGTAG ATGAATCGTT TCCACTCTTT TGTATTTCCA AACAGGCAGG
GTTTGGACTG TCTGAATATC AAATTGGCCA GATCTTGTCG CTTTGTGGAT TGTTCTTTGC
CGTCTCTCAG TACAGTGTTT ACACGACCAT CTACAACCGC TTTGGTCTGT ACGGATCGAT
ACGCTTTGGA AGCTGCTTTA GTGCACCCGT AATGTTCCTA ATGCCCTTAT CGGTACTGCT
GAATCGAGGC GCGCCAACCG GTCATCTCCG CACTTCCGCC TTGGTGTTTT TGTCCACTTG
CATGGCGGCC TACCGGGTGT TTGGGCTCGT GTTTTTCTCG AGCGTTTCCG TGACTATGAA
CCGAACCGTG CCTCGTTCGC ACCGGGCTAC CATGAACGGC TTATCCGTCT TGGGAGGGAG
CGTCGCAAAA GGCTTGGGGC CCATTTTTGC CGGCTTTCTC GTTTCGGGGT CCGTGGCGCT
TTGGGGAAGC CTGGGAGGAT TGCTCATTTT CGGCACCATT GGATTGATTG GATGTGCCGT
GGCCGCGACG ACTTTTTTCT ACCTTCAAGC CAGCGATTGT GAAGGTTCTA CTGACGATTT
AGAGCAGAGT GTGGTCGGGG ACACCGACCA AGTTAG
 
Protein sequence
MYSDAHKPQS VSEKKGNKDP MPQRQRKALG CCRSENAIQG SGVTVTQYYK RSPAIPISPS 
PTETPVPVPF PQRGFSLLEE QLKPPLVRHE STHAHDQCDS NGICQRQNSA SAFAEDEKYP
AGVSQTLRKE RLMVHERLWR SKRTDPRKTD IRLVMTTCAL LAFGSLYTYY MSLLNNFDYS
ILYNDIGVSV PLPVLPNFTS LLIASGSDKY NRHHYERYYE RWLEPYRDVP GVKVLEIGAN
QGHSLKLWED YFADPDIILG LKYGNAANGI ENKIVNLTKV SLYTGDQSSK PTMDYLNERG
PWHIIIDDGS HVPQHVIYSL VHLWDSVAPG GMYIVEDLET SYWRNGSNVY DYPLANTGVL
ADANHSAAAK IMQLQHILVR HQIGARDMSI FPGDDTICSI EWGMNLLAIR PPIRKINECL
CRKVHRDKNN WTLQSILSSH ALPTFLTTVC SKTRRLLNNC TVDKPVKAMP RQNLVVEIQA
IVPEEPKGFL DDPDVDAAAP AMDAGAPQHY SLPQQNRTTD SPVVLHTTMF ASTASLCLCM
LTHSFLLISV FPYSGFMAVE LIESVDEETA GAYAGLLASC FMWGRATTAY GWGQVADVYG
RTTVLYWSFA LSGILSIAFG LSPTFGSALF LRFALGCANG IMGSIKTIVS EISAGNEALE
TKTMTMVIGM WGWGFLVSPA LSGILAEPVK QYPGVEWLQR EGIWNAVLAK HPFLLPNLLA
AIFCLIGVLV IRMFVPETLP FGQRRDPRLL LYDIGAWCQR SAGYAKVPLN VTRYQLVPTL
KTHPSDLDLS SRNTRFSVSC HNAIDEDDLD AVQTLESNEQ VVSLSTNIPE KATILSLLSR
KPTRTCLLIY WAYSFVGLTV DESFPLFCIS KQAGFGLSEY QIGQILSLCG LFFAVSQYSV
YTTIYNRFGL YGSIRFGSCF SAPVMFLMPL SVLLNRGAPT GHLRTSALVF LSTCMAAYRV
FGLVFFSSVS VTMNRTVPRS HRATMNGLSV LGGSVAKGLG PIFAGFLVSG SVALWGSLGG
LLIFGTIGLI GCAVAATTFF YLQASDCEEC GRGHRPS