Gene PHATRDRAFT_32455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32455 
Symbol 
ID7196611 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2515319 
End bp2518686 
Gene Length3368 bp 
Protein Length1107 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176994 
Protein GI219110485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC 
GTACTTCTTC TTCCCCAAGG GCATCCTATC CGCCTCAGTT TTGAGCAACA AGGATATGAA
TCGGCTGATG ATCTTCTGTG TATTTTTGAG AATGAACTTG AGTCTCTTGG ATACACTCCT
TCTGTCCTTC CCGACGGCCT GGAAAACCCG CCAACTATAC CCCTTCTCAT GGCGCACCGA
CAGATCATAC GTCATTTCTT GCGCTGGCAG GCATCTTTGG AACGACAAAA GGGGACACCT
TTGAAGAACT CCGAGCTTGT TGCACTTAAC AATGAAGATT TTGTCCTTTA CCGTCGCTCA
GCCCTTGGTC AAGTCTCGAC AGCAACTGCA CCGGTTAATG CTTCCCCAAC TGTCCAGAGC
CCCATAGGAA AGACACGTTC GGCTGTCGAG GACTTCAAGC GTGGGATCAA ACGTGACAAA
ACTCACTATC CCGTGCTTAA AGATGATCGG TACTGGGACA ACTTTTATCG GTCGTTTGTT
GTTACTGCCG TAACACATAA CGTTGACAAA GTTCTAGATC CGACGTACAT CCCTACCGAT
CCCTTGGAGA AATCCCTTTT TGAAGAGCAG AACAAGTTTG TATATTCTGC TCTAGAGCAT
ACTCTCCAGA CGGACATGGG CAAGAACATT GTACGAGAGC ATAGTTTCGA CTTCAATGCC
CAGGAAGTTT TCCGTAAGGT TGTGAAACAC TACACAGAGT CCGCTAGCGC GAAGATTAGT
TCGTCTACTA CCCTGGGATA CCTTACAACT GCAAAGTACG GATCGTCATG GACTGGCACA
GCAGAAGGTT TTATTCTTCA CTGGAAAAAT CACTTGCGCA TCTACAATGA CACTGTTCCT
GCTGGTGAAC AGCTTCCTCA GCAACTATGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT
GTACCTGAGC TTCGACAGGT AAAAATCACT GCAACTCTTG ACTTAGCAAA GGGAGGTAAT
CCTATTAGCT ATGGTGGTTA TCTCAGTCTA CTACTCGCAT CGGCATCGCT CTACGACAAC
GGCAATAATC TATCTAATTC TCGTAGTGGC AAGAACAAGC GCAACATCTA TGCTAATGAA
CTAGAGTACA ATCCGATGGA TTTTGAGAGT AAACCGGATG TAGACTATGA TATAGATGTG
TCGCCTACCG CAATCTACGA AGCCAATGCT CATGCCCGTA ACAGCAGTTT CCGGAATCGT
AGTCCGGCAA CTAATCGCGA GCGACCTTAC ATCCCTCGTG AAATGTGGAA CCTACTCTCC
GACGATGCCA AAGCCATCCT CCAAGGCTTA ATAGCCCCCG GGAAGCAGGC CCCGTTGAAT
AATAGTTCGC CACACCAATC GTTGCAGGCC AATACGCACG ATACCATTGG CGCGGAACAA
ATCACAACGG ACACCTTCCA TGATTGCGCA CCCGAAACTG AATTGCTTGC CCACCTGACT
GAGCGTGTTA GTCACATGAG CGACGGCGAC ATACGTAAGG TACTTGCCGC ATCTCGTGAT
GGTCCCGCCT ATGATGAGCC CACACCACTG CAATCTAACG TACTTCAATA TCAAGTGTCT
CGTCACAACG TCATTGAAAC TACGGCAGCC CTCGTCGACC GTGGAGCCAA TGGAGGTCTT
GCCGGCAGTG ATGTCATGGT CTTGCATAAA ACAGGTCGTT CTGCAACCAT CACAGGTATC
AATGATCATA CCTTGTCCGA TTTGGACATT GTCACCGCTG CTGGCTACAC TGAATCCCAA
AATGGCCCCA TCATTCTCAT TATGAACCAA TACGCCCATT TGGGACAGGG TAAAACTATC
CACTCCAGTG CACAGCTTGA ACACTATCGC AACCATGTCG AAGACCGTTC CCGTACTGTA
GGAGGTAACC AGCGAATTGT AACATTGGAT GACTACATCA TCCCATTGCA CATTCGACAA
GGACTCGCGT ACATGGATAT GCGGCGTCCT ACCGACAAGG AACTTGCGAC CCTTCCACAC
GTTGTCCTAA CCTCCGACGT CGACTGGGAT CCCTCCGTAC TTGACCACGA AATTGATCTC
GCAACCTCTT GGTATGATGA CAAATATGAT TTGCCTCAAT CACCTTACGT TGAACCACGT
TTTGACCATA CAGGCAAATA CCTCCATTGT CACATTTCCC TTTGCAACCA TCGCGATGAC
GTTGTTGACC GTGTATTATA TTGCCAACAG CACCTCGTCA CGAAAAATGT GCAAGATTAT
GAGGCCCTTC GTCCGTGTTT TGGATGGGTC TCTGCTGAAA CCGTTCGCAA GACCATCATG
GCGACCACGC AGCATGCACG CGAAGTATAT AACGCTCCGT TACGCAAACA TTTTAAGTCT
CGCTTTCCCG CTCTAAATGT ACACCGTCGT AATGAACCAG TTGCTACCGA TACCATTTGG
TCCGACACCC CTGCTGTCGA TAATGGTGCT AAATTTGCAC AACTTTTCGT TGGTCGACGC
TCCCTTGTCA CCGACGCTTA CCCCATGAAA ACTGACAAAG AATTCGTCAA TACCCTTGAG
GACCATATCC GTTACCGGGG TGCCATGGAC AAATTGATTA GCGATCGTGC CCAGGTTGAA
ATCAGCAAAA AGGTCACCGA TATTACACGC GCATATAATA TCGACCAGTG GCAAAGTGAA
CCAAACCATC AACACCAAAA CTTTGCCGAA CGTCGTATTG CCACTATCGA GGCTAATACC
AACAACATTC TCAATCTTTC CGGTGCCCCT GATTCCGCCT GGTTACTTTG CGTGACATAT
GTTTGTTATG TTTTCAACCA TTTGGCACAT GAATCCCTAG ATAACCGCAC TCCCCTTGAA
GTCCTCACCG GCTCCACGCC TGATATCAGT GTTCTCCTTC AGTTTCATTT TTGGGAACCG
GTCTATTATA AGCTCGAAAA TGCGACATTT CCTTCTGGTG GTACCGAACA ACAAGGACGT
TTTGTTGGCA TCGCCGACTC CGTCGGCGAC GCTCTCACTT ATAAGATCCT TACCCACACC
ACCAACCGCA TTCTTCATCG CTCTAGTGTC CGTTCTGCGA CCATTCCCGG ACAAACCAAC
CTACGCCTTA CGCCACAGGA TGGGGAGAGT GGTCCTAAAC CCATCAACTT TATCAAGTCG
CGTAGAACCG AAAACAAAAA TTCCTATGCC ATTAAGGAGT TGCCTGGTTT CACACCTGAT
GACCTTATAG GTCGTACGTT CCTCACCGAC ACTCGGGATG ATGGGGAGCG TTTGAAGGCA
CGAATCACGC GGAAAATATT GGACCCAGAC AAGCCCTCGG ATGTAAAGTT CCTTGTCGAA
ATCAATGA
 
Protein sequence
MVPATRQMTS AAVYAHLLDN VLLLPQGHPI RLSFEQQGYE SADDLLCIFE NELESLGYTP 
SVLPDGLENP PTIPLLMAHR QIIRHFLRWQ ASLERQKGTP LKNSELVALN NEDFVLYRRS
ALGQVSTATA PVNASPTVQS PIGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV
VTAVTHNVDK VLDPTYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA
QEVFRKVVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP
AGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGN PISYGGYLSL LLASASLYDN
GNNLSNSRSG KNKRNIYANE LEYNPMDFES KPDVDYDIDV SPTAIYEANA HARNSSFRNR
SPATNRERPY IPREMWNLLS DDAKAILQGL IAPGKQAPLN NSSPHQSLQA NTHDTIGAEQ
ITTDTFHDCA PETELLAHLT ERVSHMSDGD IRKVLAASRD GPAYDEPTPL QSNVLQYQVS
RHNVIETTAA LVDRGANGGL AGSDVMVLHK TGRSATITGI NDHTLSDLDI VTAAGYTESQ
NGPIILIMNQ YAHLGQGKTI HSSAQLEHYR NHVEDRSRTV GGNQRIVTLD DYIIPLHIRQ
GLAYMDMRRP TDKELATLPH VVLTSDVDWD PSVLDHEIDL ATSWYDDKYD LPQSPYVEPR
FDHTGKYLHC HISLCNHRDD VVDRVLYCQQ HLVTKNVQDY EALRPCFGWV SAETVRKTIM
ATTQHAREVY NAPLRKHFKS RFPALNVHRR NEPVATDTIW SDTPAVDNGA KFAQLFVGRR
SLVTDAYPMK TDKEFVNTLE DHIRYRGAMD KLISDRAQVE ISKKVTDITR AYNIDQWQSE
PNHQHQNFAE RRIATIEANT NNILNLSGAP DSAWLLCVTY VCYVFNHLAH ESLDNRTPLE
VLTGSTPDIS VLLQFHFWEP VYYKLENATF PSGGTEQQGR FVGIADSVGD ALTYKILTHT
TNRILHRSSV RSATIPGQTN LRLTPQDGES GPKPINFIKS RRTENKNSYA IKELPGFTPD
DLIGRTNHAE NIGPRQALGC KVPCRNQ