Gene PHATRDRAFT_45073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45073 
Symbol 
ID7200161 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp121314 
End bp124769 
Gene Length3456 bp 
Protein Length1085 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179365 
Protein GI219117141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGGTGCCGCG CTCCGTTTCT TTGTAGTAAA ATACCGGCAA GAGCCCCTCG TGAGCATCTT 
TATTCACAGC ATGCAATCTT CCTTGCGCGC ACGACGTGGT GTGGCTCCCC GGAGAAAAGC
TGCCGTCGGT CGGTATCAAC TGGTCCTCGC TATTACGTTG TCCGTCGCCT TGACCGGTCT
CGTGACAACT CAATTTGTGC TGTCGACTTC GTTGTATTCC ACTCAAGAAA AGCAGCAACG
ACAGCAGCAC AAGAGCCACG CAAAGGGACC TAATCCATCC AATCTAAGGA CCGATTCTTT
CCCGAGAGAG CAGTTGCACG TAGTTGTGCC GGAAGAACAG AAACAATCAG AGCGGCGATT
GAACGAAAGC AGTCGGGAAC ACGAATCTCG AGAGAATGAC CATGAAGACG AGCAAAAGAC
CCGGCCCGAT TCAGAGAAGA AAGATGATCT TCTTACGCAG CAGGACCAAA AGACGACGAA
ACGAAACGAA CATGGTCCTG CTCAGGAAGG CATCCGCAAG CACAAGACTG ACCACGAGGC
TCCCGAAAAG GTGGAATCGC CAGTGGATGA GGACCATGAA GTCCAAAAGG CACACAGAGA
AACAGTGCAA AAATTCGTCG ACGACCGAAG AGTTCGCCTC AAGCACAGAA TCTCTCGACC
CAAAGTGGCC AGGGCTGCAT CACCGCCTGA GGTGGAACCG CAAATTGAGG TTGCTCCGCC
ACCCGAAAAG CGACCTTATG ACATTCTGGA TGATCCCTTG CAAAATCCAG ATTTCAACAA
ACCCTCGAAG CCGTTGAACT TCACGGCCGC TGTACCATAT TTGGGTGTAC TGATTGACGG
GGGGCGTCAT TTCTTTCCAA TGGACTGGAT GAAACGAGCC GTTGATCGCC TTTCTGATTT
GCGCTATAAT TTGATTCACT TGCGTCTTAC GGACGACCAA GCCTTCAACG TTCTATTGGA
TTCTCATCCC GAGCTCGCTT ATCCCGCCGC CGTCAACAAC CCACACCAGC AAGTTTGGAC
GGCCAGCGAA TTGCGTGACT TGACCGCTTA CGCGAAATCC AAAGGAGTAA GCATCATGCC
CGAAGTCAAC GTTCCCGGAC ACGCCGGTGC CTGGGCCGGT ATTCCTCACC TAGTCGTGCA
CTGTCCCGAA TTCATCTGCC AAAGAGGCTA CGGATTGCCA CTAAATGTAA CCCATCACGA
TCTCAAACCT ATCTTGACGA GTATCTTGAA GGAAGTCGTC GACATTTTTG ATGATCCGCC
CTTTCTACAT TTGGGTGGGG ATGAAGTCAA CGTACGTTTG TTTTCCGTGT GGAACTTTTC
TGTGCGTTGT CGGAAGGTAT AAGGCGTTTC TAAACTTTTT TTTCTCTACC AGATGGCCGG
CCCCTGTTTT AACGAAGTTC GCAGTCCCGT CTTTAATTAC ACGGCTTTCG AAGTTGTTCT
TAAAGAAATC ATTGCCGATG TAGGCTACCC CGAAAAGCAA GTGGTTCGTT GGGAAATGAC
CGGGCAGGCT AATTTGGAAC GCGCTGGCGG TGTGGAACAA TTTTGGGAGT CGTATCCGGG
AGAACGGCAC AAGGCTGCGG GACCTTTTTT CATTTCGAAC CGTTTGTATT TTGATACAAA
CCAGGATCAA AATGCGTACG AAGTTTGGCA GAATACCCGA CGGTTTTATG TAAATGATTA
CCAGCCCGAG GCAGTTCCAA CCGCCATTAT TGCCGGCACC TTTGAGTTGT CGACGACTTG
GTGGTACGAT CGCAATATTT TGGGACGTTT GTTGGCTGTT GCGTTGGGTG CCCGAAACGA
AACTCTACCA AAGACGATGA AGGACCAAGA CCATGAGAAA ATGGTGCTTG ATCAATACCA
AGTTTTCTGC GACCAGTTAG GATATAGCCA GGCAATTTGC GAAACCAACG GTGGCCCGAT
CATCCCTACC CCGGAGTACA AAAAGAAATG GGGTGACGGT TGGGTAGTTT GGAAGGCGCA
CATCTGTGAA CGCATGACGA CGACTGAGGT AACCAAGGCA ATGCGACCCC GCTCTAGCGA
CCGGGTCGCC ACGCAGGCGA ACAGCTACTT TTGGAACGTA TTTGGATTTC CTGCGCACAC
ACACACGCGA GTGGGCCAGC ATCCAACGTT GCCAGACGAT CTCCAGGCCC TCCAGCGACA
TTTAATTCCT CATTGTGGCG TTATGTTGGA TACGACCAGA TCCCTGGTTC CAGCGGATCG
GTTGGGAACG ATTTTGACCG ACACCGTTGC AAAATTGGGT TTCAACATAG CCCAGCTGCG
TTTGGTCAGC AACAAGGGCT TCACGTTTGC TCCGAGTAGT CTACCGCATA CTGTAGGCCA
TTCGTTACTA GCAACGAAGG AGATCAAGGT ATACACTAGG AGTGACTTGA TGGGTACCGT
TGCTAAAGCG AGTGCGGTGG GAATCCAAAT GATCCCCGAA ATCAGCATGA CGACAGGAAG
TGCTGGTTGG TACGAGTCGG GCTATCTAGC GAATTGTCCA AACCGTCTGT GTGAAATTGG
TGACGCGTCG ATTGACGTGA CGAACCCGTT CTTACCACCC ACCGTGTACT CGTTGATCTA
CGAGTTGCGT TCCATTTTCA GCAGCAGTCC CTATATTCAT CTCGGTTCGG ACGAGCGTCA
AGACGCGGCA GCTTGCTACC AAGAAGCCAA TCCCACGTTC CACGCGGACG TGGGAGCGTT
CGAGCGCAAA ATGGTCAAAG TCTTGGAGGC GAGCGGGATT GCGAACGATT CTGTACTGCG
GTACGCCAAT TCGCAAGGCG AGGTGTACAG CGACCGGACG GGTGGCGTCA CACACTACGG
TCCGGACCAT GCGACGGAGA TTCCTGCGGA CGCACCAATA TTTGTGAGTG TGGATTTGTT
GCGGGACGAT GGGTGGACAT TGTACCAGCG GGTGAAGGAA CTCGTATCGA AAAAGCCGTT
GGGCATCTTG GCGGAAATCC GTACGTTGAC GGCTCCCCGT TGGGAAGGCC TGGAGATTCC
GGAACGTTTG CTGGTGTACG CCATGGCCGT ATCGGAATTG CCCACGTACG CGAACGCGGC
GGCACTGGGC GAGCGGTACG GGGAGCTTTG CCGGGCGTTA TCGGATCGAT TGCCGGGATT
GGGTCGTCGG CACGATTGCG CGTTGCCGGG TGTCGTCTCG GGGAAGGTGA CGTTTTTGGC
CGACACGAGT ACCTTTGTGC AGCAGCAGTG TCAAATGGCG ACGTATCCCG TGACGCAACA
CCACGCCAAG TTGGTAGCAC CCCGGTACAA CGCGACCGAG TGGGAGCAAC TGCGCGGGGC
GCCCCGCGTG TTTCCGGCGG CGGGTCGGGA TCCCCAGCGG CATCACCCCG TCGTCATCGG
GCATGGAAAA TCGGGGGACG AGTCCCCGGT CGAGTCCGTA GCTAGCTAGC TAGCTAACTA
ACTAGGAGTA AACGATGAGT GCATGGATGA GAGAGA
 
Protein sequence
MQSSLRARRG VAPRRKAAVG RYQLVLAITL SVALTGLVTT QFVLSTSLYS TQEKQQRQQH 
KSHAKGPNPS NLRTDSFPRE QLHVVVPEEQ KQSERRLNES SREHESREND HEDEQKTRPD
SEKKDDLLTQ QDQKTTKRNE HGPAQEGIRK HKTDHEAPEK VESPVDEDHE VQKAHRETVQ
KFVDDRRVRL KHRISRPKVA RAASPPEVEP QIEVAPPPEK RPYDILDDPL QNPDFNKPSK
PLNFTAAVPY LGVLIDGGRH FFPMDWMKRA VDRLSDLRYN LIHLRLTDDQ AFNVLLDSHP
ELAYPAAVNN PHQQVWTASE LRDLTAYAKS KGVSIMPEVN VPGHAGAWAG IPHLVVHCPE
FICQRGYGLP LNVTHHDLKP ILTSILKEVV DIFDDPPFLH LGGDEVNMAG PCFNEVRSPV
FNYTAFEVVL KEIIADVGYP EKQVVRWEMT GQANLERAGG VEQFWESYPG ERHKAAGPFF
ISNRLYFDTN QDQNAYEVWQ NTRRFYVNDY QPEAVPTAII AGTFELSTTW WYDRNILGRL
LAVALGARNE TLPKTMKDQD HEKMVLDQYQ VFCDQLGYSQ AICETNGGPI IPTPEYKKKW
GDGWVVWKAH ICERMTTTEV TKAMRPRSSD RVATQANSYF WNVFGFPAHT HTRVGQHPTL
PDDLQALQRH LIPHCGVMLD TTRSLVPADR LGTILTDTVA KLGFNIAQLR LVSNKGFTFA
PSSLPHTVGH SLLATKEIKV YTRSDLMGTV AKASAVGIQM IPEISMTTGS AGWYESGYLA
NCPNRLCEIG DASIDVTNPF LPPTVYSLIY ELRSIFSSSP YIHLGSDERQ DAAACYQEAN
PTFHADVGAF ERKMVKVLEA SGIANDSVLR YANSQGEVYS DRTGGVTHYG PDHATEIPAD
APIFVSVDLL RDDGWTLYQR VKELVSKKPL GILAEIRTLT APRWEGLEIP ERLLVYAMAV
SELPTYANAA ALGERYGELC RALSDRLPGL GRRHDCALPG VVSGKVTFLA DTSTFVQQQC
QMATYPVTQH HAKLVAPRYN ATEWEQLRGA PRVFPAAGRD PQRHHPVVIG HGKSGDESPV
ESVAS