Gene PHATRDRAFT_23306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23306 
Symbol 
ID7195748 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp547167 
End bp550091 
Gene Length2925 bp 
Protein Length908 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184157 
Protein GI219127886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGATA TTGCCAGCGC TCCTTCCCTA TACAGTCGCC CCGCATTTTT CAGACAAACC 
CGGACCGGAA AGATTCTCAA AACCGTCAAC GAGCGATATT ATCGAGACGA TTTGGGCTTT
GGCTGTCACT TTGTCGATGA TGCAAGTGTC AAGCGACATA AACAGGTTGA CGAGGTTGTA
GGAAAACCCG AAACGATAGA ATCAGCATCA GAATTGCTCG CATTGCTTCA GCCCTGTCAA
CCCCATGCGT TAATAGTCTG TGATGCCAAT GTTTTGTTGC ACAATATAGA TGTTTTGGAA
CAAGCGGACT CTCTCATGCC GAATATTGTC ATTCCTCAGA CGGCTTTAAT GGAATGTCGG
GCCAATCGAA TGGTAGCTTA TGACCGCACG GTCGAGCTGT TGCGAGCAGT AGGGGGCGGG
AAGACCAAAA CTACCAAACG TTGCGGCATC TTCTTTGCTG ATCCGCATCA TGCTGCAACT
CAGCTCGAAC ATGATGAGAC CAAAATTGAG CGCAAGGGAA ATTCCATCAA CGATGAAAAC
GATGCCCGCA TTCGAAAAGT AGCCGACTAT TTTGGCACAG CCTTAAAGAA CACGGGTGTG
CGAGTGATCT TGCTGACCGA CGACGCCGGC TCTCGCACAC TCGCGGCAGA AGAATCCTCA
ACGTACCAAG CCAAGTCGGT GCGAGAATGG GTAAAAGAAT TAGAAAGGTT CAATCCAGGT
CTATCTCTAC TCGATCAAGT CGCACAATTT AATAATACTA GCCCCACGGG TGGTATCAAC
GAAAAGGATT ACTTTGAAGC TCATCTAGAA GCCAAGCTTT TGTTACGAGG AGTACAAGCT
GGAATGTACC ACCGGGGAGT GCTACGATCC GCAGGAAGCC ATTCCGCTAT GATTACGATT
AGACAAGGCG ATGAACGAGT AGCTGTGACG ATACCAAGCT TTACGGATCG GAATCGTGCC
GTCGACGGCG ATGTCGTTGC TGTGGCTTTG CATCCTTTGG ACAAGTGGAT TACTGCGAGC
GTCGATCTCA AAGCCAGTAA GGCTGAGGCA AACAGAGCTA TTGCGCCAGG TATCGCTAAT
GAAACAGCTG AACCAACTAT AAGTGAAATG AACAATGTCG CCGACACCTT TGCTTTGGAG
GATGACGCTG AATCACTGCG TCCCACTGGT AAGGTTGTTG GAATCATTCG GCGCAACTTT
TCCACTTACA GCGGCTCCAT TTACGCCATC AAAAGTGACT CTACAGAGCT GACGGATCGA
GAGCGGACTG CATCAGATTA TGAACGCGAG CATCCGGATG GGTCAATCAC CTGCGTATTC
TTTCCCGTCG ACAAGAAGAT TCCTCCCATT TTAATTCGGA CAACGCAACG GGATCGCCTT
TTCGGTCAAC GCATAGTTGT GGCTATGGAT TCCTGGCCCT CCACATCTAT TTACCCCCTA
GGACATTACG TACGGGTGAT TGGGCCAGCC GGATCTAAAG ACGTTGAAAC CGAAGTGCTG
CTTCAAGAGC ATGACATACC TCACGAACCT TTCCCTGCTG CTGTTCTTGC TTGCTTGCCA
CCTGAAGATT ACCGCATCGA TGTAGACAAT AGCCCCGGAC GCCAGGATAT CCGGCACATT
CCTGTTTTGT CAATCGATCC GCCCAATTGC AAAGATATCG ACGACGCACT ACACTGCACT
GTGCTACCCA ACGGAAACTT TCAGGTTGGT GTGCACATTG CGGACGGTAC GTTAGGGAGA
AGGATTGTCT ACAGTTAGCC CCTATGTTCG ATCTGACAAC ACACTATGGA TTCTTTTGCA
GTGACACACT ACGTCCAGGC AGGTACCGCG ATTGATCTAG AAGCAGCGAA TCGTTCGACG
TCGACGTATT TAGTAAATAA GCGACTCGAC ATGCTTCCCA GCCTCTTAAC AACAGACCTT
TGCAGTCTGA AAGGAAATGT AGATCGGTAC GCCTTTTCGG TACTGTGGGA GGTCACACCA
GAGGCCGAGA TTCTCAACGT TGAATTTCAA AAGACCATCA TCCACTCGAT TGCCGCCCTT
ACTTATCAGC AAGCGCAGAC AATGATTGAC CAACCCGACG ACCCGAACGA TATTCAGTCG
AATGCCGTGA AGCGCCTCGC ATCTCTTGCA CGTAAGTTTC GAAAACGTCG AATTGATGCT
GGGGCGTTGA CTTTGGCATC ACCAGAAGTT AAGTTCGTGT TGGACAGCGA GTCCTTGAAT
CCAACAGACG TCCAAGCGTA CGCACTGTTG GAAGCAAATG CTGTCGTAGA GGAATTCATG
CTACTGGCCA ACGTTACCGT ATCGAAAAAA ATTCTTCGGC ACTTTCCGAC TTTGTCAGTA
CTTCGACGGC ATCCTGCTCC TAACCGCGCT ATGTTTGATA GCCTTATCAG TAAAGCAAAG
AGCAAGGATT TGGATATCAA TATCGACGAC TCGAAGCGTC TAGCGGATTC GCTGGATGCC
GCTGTTGTAG AGTCTGACCC TTACGTGAAC AAACTTCTTC GTATTTTGTC GACCCGATGC
ATGAGCCCCG CGCAGTACTT TTGCTCGGGA GAGTTTCGCC CAATGGAGTG GCATCACTAC
GGTTTGGCGG CGCCTGTCTA TACACACTTT ACGTCCCCAA TTCGACGTTA CGCGGATGTT
TGCGTCCATC GATTGTTAGC TGCTGCTGTA GGGGTGGCCC CTTTACCACC TCACCTCTCA
TCGAAATCTT ACCTGCATGA TCTATGTGCC AACATGAATA GACGCCATCG TGCGGCGCAG
CTTGCAGGTC GAGCCAGTGT GCAGCTTCAT ACACTCATTT TCTTTGCCGG TGATGGGGCC
AAAGAAGAAC AAGCTTACAT ATTGGACGTA GAAACTGCAG AAGGAGTCGA GCCTTCCTTT
ACTGTGATTG TTCCTAGATA CGGAATCGAA GGGAGAGTGA AGCTA
 
Protein sequence
MGDIASAPSL YSRPAFFRQT RTGKILKTVN ERYYRDDLGF GCHFVDDASP CQPHALIVCD 
ANVLLHNIDV LEQADSLMPN IVIPQTALME CRANRMVAYD RTVELLRAVG GGKTKTTKRC
GIFFADPHHA ATQLEHDETK IERKGNSIND ENDARIRKVA DYFGTALKNT GVRVILLTDD
AGSRTLAAEE SSTYQAKSVR EWVKELERFN PGLSLLDQVA QFNNTSPTGG INEKDYFEAH
LEAKLLLRGV QAGMYHRGVL RSAGSHSAMI TIRQGDERVA VTIPSFTDRN RAVDGDVVAV
ALHPLDKWIT ASVDLKASKA EANRAIAPGI ANETAEPTIS EMNNVADTFA LEDDAESLRP
TGKVVGIIRR NFSTYSGSIY AIKNYEREHP DGSITCVFFP VDKKIPPILI RTTQRDRLFG
QRIVVAMDSW PSTSIYPLGH YVRVIGPAGS KDVETEVLLQ EHDIPHEPFP AAVLACLPPE
DYRIDVDNSP GRQDIRHIPV LSIDPPNCKD IDDALHCTVL PNGNFQVGVH IADVTHYVQA
GTAIDLEAAN RSTSTYLVNK RLDMLPSLLT TDLCSLKGNV DRYAFSVLWE VTPEAEILNV
EFQKTIIHSI AALTYQQAQT MIDQPDDPND IQSNAVKRLA SLARKFRKRR IDAGALTLAS
PEVKFVLDSE SLNPTDVQAY ALLEANAVVE EFMLLANVTV SKKILRHFPT LSVLRRHPAP
NRAMFDSLIS KAKSKDLDIN IDDSKRLADS LDAAVVESDP YVNKLLRILS TRCMSPAQYF
CSGEFRPMEW HHYGLAAPVY THFTSPIRRY ADVCVHRLLA AAVGVAPLPP HLSSKSYLHD
LCANMNRRHR AAQLAGRASV QLHTLIFFAG DGAKEEQAYI LDVETAEGVE PSFTVIVPRY
GIEGRVKL