Gene PHATRDRAFT_39223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39223 
Symbol 
ID7194925 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp652780 
End bp656377 
Gene Length3598 bp 
Protein Length1161 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183133 
Protein GI219125743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCTC TTGAACATGT TCTTGTGAAC CTTTTGGGAG CGACGACACC GGATTCGTCG 
TACCGTCGGT TCTTTGAAGA GTACGGTATT ACTCAGACCA GCGAGTTGGC CTCAATCACC
GAAAATTGTC TTGCAACGGT GTCATATGGT GTTTTGACCC CTTCTGTGGG AGATACCCCT
GCCACCATTG TTCGTATGTT TCTTCCGCCT GCCCAGCAGG ATCGGATCTT GAAGATTGTC
AAATGGTTCC TCTCGAAAGG TACCAACGTG ACAAACGAAA CCTGGTTTGA ACTTACCCCT
GAAGTCCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCTGCTAC TCCTGTTGGA
TCGGATGCTC GGAGTTCCTT TGTCAAAAGT GCTGCCGCAA AGTTTCGGAA GACAATCAAG
AATCACTCCG TTCCGTACCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG
AATATTCGTA TCAAGCTCCG TATCCATGGT GTCCAGTTGG TTCTTGACCC GGATTATTTG
CCCGAGACCG TCGACGAGAC GGATACATTT GTCGAAATGC AGAACTTTGT CTTTGGCGTG
TTCAACGATA TATTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAA
TTGGATGCTC AGGCTGTTTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATCAATGCG
CAGATCACTG CCACATCCAT TGAAACGAAG CTCACTTTGT ACTCATTTGC GACTTCAAAG
AGCAAGACCT GTGTTGCTTT TTTGACGACT TGGCGCAATT TGATTTACGA TCTTGAACGG
ATCAACGAGT TCCCCTTGCC GGATCACCAG AAGAGCGTAC GACTCAAGTC AGCTGTCCGT
TCCCATCCGC AATTGAAGCT TTTCCTCGGA AATGTTCAGC TTTACTCTCG TACCCATGTG
GGTAAGAGTG CCGACGATTC CGATTTTGAG TATGTTTATG ATTTGATGCT CGAACATGCG
ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGCGGTGG ATGCTCAGCA
AACAATGCAA AGTCTCAGTC TTCTTCCAAG AAGAAAACTA ACAAACCAAT TGGTAAGAAG
CACAAGAATT ATGTGCCTCC TGAAAAGTGA AAAGCTCTCT CTCCCAAAGA GAAGCGGACC
ATTATGGACC AACGAGGACC TCGCCCTGCT CCAGCTCCTG CCCCTGCCTT ATCGGTGAAC
GCCGCTGCCA CTCAGCCTCC TCCTACGGTG TATGTCAGCG ACTCGACGGT TGTTGACAAC
CAAAGCCTCG CTTCGACTCA CGTCCCGCCT GCTGCCGGAC CTGGTCAACT GCTTCGTTCG
CTCATTTCGA ATTCAGCTGC CCGCCAGCAC CCTGCTCCAT CGAATGGAGC CACGTCTGAC
TCTTTTTCGG TCAATGGGAC CACCTATCGC AGCGAAGTGA ACCGTTCTTC TGTGCAGTAC
CGTCTTTCCA CTCACGATGT TTCGTTGAAC AAGGACTCTT TGATCGATGG TGGTGCGAAC
GGTGGCCTTA GCGGCTCCGA CGTAACCGTT ATTTCGCAAT CCCTGTTGGA GGCCACTGTC
TCTGGAATTG GAAATTCGGA ATTGACCAAC CTCCGTTTGT CAACGGTGGC CGGACTCATT
CACACGACGG ATGGTCCCAT TATTGGTGTG TTTCACCAGT ATGCTCACCT TGGTACTGGC
AATACCATTC ATTCGTGCAA CCAAATGCGC TCCTGGGGAG TCACGGTTGA CGACGTCCCT
CGTACTTTTG GTGGCAAACA GCGTATTGTC ACGTCCGATG GTCGTTTTGT CATCCCGCTT
TCCGTTTCTG GCGGACTCAC TTACTTGTCT ATGCAGGCCC CTACCGAGGA GGACCTGGAC
ACTTTCGAAT GGGTGCCTTT TACCGCTGAC AACGAGTGGG ACCCAAATGG TGTCTCTTCT
CCTGCCGCTG CCGACAATGA CCTCAGTTTG CAGCTTCCTG CCGGCCATGT CCCGTTCCGT
GATGAACGCA TCAATAACTT TGGTCTCCTT GCGCATTCCG CGGCTGTCAG TCGATCCCCT
TTGAATGCCG ATGCTTTGCA ACCCAATTTT GGATGGGTTC CCAGTGCTCG TATCTCTCGC
ACGTTCGAGA ATACCACACA ATTCGCTCGT GCCGATGTCC GTTTGCCCCT GCGCAAACAT
TTCAAGTCGC GTTTCCCTGC TGCCAATGTT TCTCGTTTGA ACGAAATTGT GGCAACTGAT
ACCTTTTTCT CGGATACCCC TGCGGCCGAT GACGGCATTC TTAACCATGG TGGGGCTACG
ATGGCCCAAC TTTTCGTTGG AAAAAGTTCG CAAATCACCT CTGTCTTCCC GATGAAGCGT
GAATCCCAGT TTGCCCATAC TTTCGAGGAC TTTATCCGTA CCCATGGTGC TCCCGATGCC
CTCCTCAGCG ACAATGCTCG TGCTCAGATC GGTCAGCAGG CACTTCAGAT TTTGCGTATG
TATGCAATCG ACGATATGCA GTGCGAGCCG CATCATCAAC ACCAAAATTA CGCGGAACGC
CGCATTCAAG AGGTGAAAAA GATGGTGAAC ACAATCATGG ATCGTACAAA CACCCCTCCG
GAATATTGGT TGCTCTGCTT ATTTTATGTG ACCTACTTGC TCAATCGCCT TGCTGTTGAA
AGCTTGAATT GGCGTACCCC GCTTCAAGTT GCCCATGGAC AGCGTCCTGA TATTTCTGCT
TTGCTCCTTT TCCGTTGGTT TGAACCCGTT TATTATTACG ACCCTGACCA TGCGTCTTTC
CCATCGGCTT CTCGCGAGAA AACTGGTCGT TGGATTGGTG TTGCTGAACA CAAAGGTGAT
GCGCTGACTT ACTGGATTTT AACCGACAAT ACTCACCAAG CCATTGCTCG TTCTGTTGTT
CGTTCAGCCA ACGTCGATAA TGGTTTGAAA AACCATCGTG CTGCGAATTC CTCTCCCGAT
GGTGGGGAGC CTTCGAATCC TAAGCCCATT GTCTTGGCTA CGAGTGACCT ACGCCATGAT
GCTACGGTCG ATCCATCTTT TGAGAAATCC CCTGCATTCT CTCCTGACGA ATTGATCGGC
AGGTATTTGA TCCGTGAAGC CCCTGACGGC CAGAGCCATC GAGCCCTTGT TGCTCGTAAA
ATTATTGATG CCGACTCCGA TAACCATCAG GCGATTTGCT TCTTGTTGCA AATTGATGAA
AAGGATGCTG ACGAGATCAT TTCGTACAAT GAACTTTCCG ATTTGATGGA AGCCCAACAA
TCAGAGCCCG CTACGAACGG AAATATCGAA GATCATTTCA AGTTTACTAG TATTATTGGA
CACCAAGGCC CTTTGCAACC GACCGATGCT GGTTACAAGG GATCCTCTTG GAATGTTTTG
GTTCAATGGG AAGATGGTTC CCAGTCGTAC GAACCTCTAA TTGAAATGGC TAAGGACGAT
CCAGTCACAC TCGCGATGTA CGCGTCTGAC AACGATCTCC TTAACGTGCC CGGGTGGCGC
CGCTTCAATC GTTTGCTTCG CAACCGTGAT GACTTCAATC GATCTGTTTC GTTAGTGA
 
Protein sequence
MDPLEHVLVN LLGATTPDSS YRRFFEEYGI TQTSELASIT ENCLATVSYG VLTPSVGDTP 
ATIVRMFLPP AQQDRILKIV KWFLSKGTNV TNETWFELTP EVLEYWQPAS AIVAPATPVG
SDARSSFVKS AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL
PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQAVYRDL VASYGKGINA
QITATSIETK LTLYSFATSK SKTCVAFLTT WRNLIYDLER INEFPLPDHQ KSVRLKSAVR
SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLEHA TDIDQTDLED RGNNRGGCSA
NNAKSQSSSK KKTNKPIALS PKEKRTIMDQ RGPRPAPAPA PALSVNAAAT QPPPTVYVSD
STVVDNQSLA STHVPPAAGP GQLLRSLISN SAARQHPAPS NGATSDSFSV NGTTYRSEVN
RSSVQYRLST HDVSLNKDSL IDGGANGGLS GSDVTVISQS LLEATVSGIG NSELTNLRLS
TVAGLIHTTD GPIIGVFHQY AHLGTGNTIH SCNQMRSWGV TVDDVPRTFG GKQRIVTSDG
RFVIPLSVSG GLTYLSMQAP TEEDLDTFEW VPFTADNEWD PNGVSSPAAA DNDLSLQLPA
GHVPFRDERI NNFGLLAHSA AVSRSPLNAD ALQPNFGWVP SARISRTFEN TTQFARADVR
LPLRKHFKSR FPAANVSRLN EIVATDTFFS DTPAADDGIL NHGGATMAQL FVGKSSQITS
VFPMKRESQF AHTFEDFIRT HGAPDALLSD NARAQIGQQA LQILRMYAID DMQCEPHHQH
QNYAERRIQE VKKMVNTIMD RTNTPPEYWL LCLFYVTYLL NRLAVESLNW RTPLQVAHGQ
RPDISALLLF RWFEPVYYYD PDHASFPSAS REKTGRWIGV AEHKGDALTY WILTDNTHQA
IARSVVRSAN VDNGLKNHRA ANSSPDGGEP SNPKPIVLAT SDLRHDATVD PSFEKSPAFS
PDELIGRYLI REAPDGQSHR ALVARKIIDA DSDNHQAICF LLQIDEKDAD EIISYNELSD
LMEAQQSEPA TNGNIEDHFK FTSIIGHQGP LQPTDAGYKG SSWNVLVQWE DGSQSYEPLI
EMAKDDPVTL AMYASDNDLL N