Gene PHATRDRAFT_38979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38979 
Symbol 
ID7194694 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp21444 
End bp25049 
Gene Length3606 bp 
Protein Length1165 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183014 
Protein GI219125495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCTC TTGAACATGT TCTTGTGAAC CTTTTGGGAG CGACGACACC GGATTCGTCG 
TACCGTCGGT TCTTTGAAGA GTACGGTATT ACTCAGGCCA GCGAGTTGGC CTCAATCACC
GAGAATCGTC TTGCAACGGT GTCATATGGT GTTTTGACCC CTTCTGTGGG AGATACCCCT
GCCACCATTG TTCGTATGTT TCTTCCGCCT GCCCAGCAGG ATCGGATCTT GAAGATTGTC
AAATGGTTCC TCTCGAAAGG TACCGACGTG ACAAACGAAA CCTGGTTTGA ACTTACCCCT
GAAGTCCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCTGCTAC CCCTGTTGGA
TCGGATGCTC GGAGTTCCTT TGTCGAAAGT GCTGCCGCAA AGTTTCGGAA GACAATCAAG
AATCACTCCG TTCCGTACCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG
AATATTCGTA TCAAGCTCCG TATCCATGGC GTCCAGTTGG TTCTTGACCC GGATTATTTG
CCTGAGACCG TCGACGAGAC GGATACATTT GTCGAAATGC AGAACTTTGT CTTTGGCGTG
TTCAACGATA TATTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAA
TTGGATGCTC AGGCTGTTTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATCAATGCG
CAGATCACTG CCACATCCAT TGAAACAAAG CTCACTTTGT ACTCATTTGC GACTTCAAAG
AGCAAGACCT GTGTTGCTTT TTTGACGACT TGGCGCAATT TTATTTACGA TCTTGAACGG
ATCAACGAGT TCCCCTTGCC GGATCACCAG AAGAGCGTAC GACTCAAGTC AGCTGTCCGT
TCCCATCCGC AATTGAAACT TTTCCTCGGA AATGTTCAGC TTTACTCTCG GACCCATGTG
GGTAAGAGTG CCGACGATTC CGATTTTGAG TATGTTTATG ATTTGATGCT CAAACATGCG
ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGCGGTGG CCGCTCAGCA
AACAACGCGA AGTCTCAGTC TTCTTCCAAG TCTTCTTCCA AGAAGAAAAC TAACAAACTG
ATTGGTAAGA AGCACAAGAA TTATGTGCCT CCTGAAAAGT GGAATGCTCT CTCTCCCGAA
GAGAAGCGGA CCATTATGGA CCAACGAGGA CCTCGCCCTG CTCCAGCTCC TGCCCCTGCC
TTATCGGTGA ACGCCGCTGC CACTCAGCCT CCTCCTACGG TGTATGTCAG CGACTCGACG
GTTGTTGACA ACCAAAGCTC GCTTCGACTC ACGTCCCGCC TGCTGCCGGA CCTGGTCAAC
TGCTTCGTTC GCTCATTTCG AATTCAGCTG CCCGCCAGCA CCCTGCTCCA TCGAATGGAG
CCACGTCTGA CTCTTTTTCG GTCAATGGGA CCACCTATCG CCGCGAAGTG AACCGTGCTT
CTGTGCAGTA CCGCCTTTCC ACTCACGATG TTTCGTTGAA CAAGGACTCT TTGATCGATG
GTGGTGCCAA CGGTGGCCTT AGCGGCTCCG ACGTAACCGT TATTTCGCAA TCCCTGTTGG
AGGCCACAGT CTCTGGAATT GGAAATTCGG AATTGACCAA CCTCCGTTTG TCAATGGTGG
CCGGACTCAT TCACACGACG GATGGTCCCA TTATTGGTGT GTTTCACCAG TATGCTCACC
TTGGTACTGG CAATACCATT CATTCGTGCA ACCAAATGCG CTCCTGGGGA GTCACGGTTG
ACGACGTCCC TCGTACTTTT GGTGGCAAAC AGCGTATTGT CACGTCCGAT GGTCGTTTTG
TCATCTCGCT TTCGGTTTCT GGCGGACTCA CTTACTTGTC TATGCAGGCT CCTACCGAGG
AGGACCTGGA CACTTTCGAA TGGGTGCCTT TTACCGCTGA CAACGAGTGG GACCCAAATG
GGGTCTCTTC TCCTGCCGCT GCCGACGATG ACCTCAGTTT GCAGCTTCCT GCCGGCCATG
TCCCGTTCCG TGATGAACGC ATCAATAACT TTGGTCTCCT TGCACATTCC GCGGCTGTTA
GTCGATCCCC TTTGAATGCC GATGCTTTGC AACCCAATTT TGGATGGGTT CCCAGTGCTT
GTATCTCTCG CACGTTTGAG AATACCACTC AATTCGCTCG TGCCGATGCC CGTTTGCCCC
TGCGCAAACA CTTCAAGTCG CGTTTCCCTG CTGCCAATGT TTCTCGTTTG AACGAAATTG
TGGCAACCGA TACCTTTTTC TCGGATACCC CTGCGGCCGA TGACGGCATT TTTAACCATG
GTGGGGCTAC GATGGCCCAA CTTTTCGTTG GCAAAAGTTC GCAAATCACA TCTGTCTTCC
CGATGAAGCG TGAATCCCAA TTTGCCCATA CTTTCGAGGA CTTTATTTGT ACCCATGGCG
CTCCCGATGC CCTCCTCAGC GACAACGCTC GTGCTCAGAT CGGTCAGCAA GCACTTGAGA
TTTTGCGTAT GTATGCAATC GACGATATGC AGTGCGAGCC GCATCATCAA CACCAAAATT
ACGCGGAACG CCGCATTCAA GAGGTGAAAA AGATGGTGAA CACGATCATG GATTGTACAA
ACACTCCTCC GGAATATTGG TTGCTCTGCT TATTTTATGT GACCTATTTG CTCAATCGCC
TTGCTGTTGA AAGCTTGAAT TGGCGTACCC CGCTTCAGGT TGCCCATGGA CAGCGTCCTG
ATATTTCTGC TTTGCTCCTT TTCCGTTGGT TTGAAACCGT TTATTATTAC AATCCTGACC
ATGCGTCTTT CCCATCGGCT TCTCGCGAGA AAACTGGTCG TTGGATTGGT GTTGCTGAAC
ACAAAGGTGA TGCGCTGACT TATTGGATTT TAACCGACAA TACTCACCAA GCCATTGCTG
GTTCTGTTGT TCGTTCAGCC AATGTCGATA ATGGTTTGAA AAACCATCGT GCTGCGAATT
CCTCTCCCGA TGGTGGGGAG CCTTCGAATC CTAAGCCCAT TGTCTTGGCT ACGAGTGACC
TACGCCATGA TGCTACGGTC GATCCATCTT TTGAGAAATC CCCTGCATTC TCTCCTGACG
AATTGATCGG CAGGTATTTG ATCCGTGAAG CCCCTGACGG CCAGAGCCAT CGAGCCCTTG
TTGCTCGTAA AATTATTGAT GCCGACTCCG ATAACCATCA GACGATTTGC TTCTTGTTGC
AAATTGATGA AAAGGATGCT GACGAGATCA TTTCGTACAA TGAACTTTCC GATTTGATGG
AAGCCCAACA ATCAGAGCCC GCTACGAACG GAAATATCGA AGATCATTTC AAGTTTACTA
GTATTATTGG ACACCAAGGC CCTTTGCAAC CGACCGATGC TGGTTACAAG GGATCCTCTT
GGAATGTTTT GGTTCAATGG GAAGATGGTT CCCAGTCGTA CGAACCTCTA ATTGAAATGG
CTAAGGACGA TCCAGTCACA CTCGCGATGT ACGCGTCTGA CAACGATCTC CTTAACGTGC
CCGGGTGGCG CCGCTTCAAT CGTTTGCTTC GCAACCGTGA TGACTTCAAT CGATCTGTTT
CGTTAG
 
Protein sequence
MDPLEHVLVN LLGATTPDSS YRRFFEEYGI TQASELASIT ENRLATVSYG VLTPSVGDTP 
ATIVRMFLPP AQQDRILKIV KWFLSKGTDV TNETWFELTP EVLEYWQPAS AIVAPATPVG
SDARSSFVES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL
PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQAVYRDL VASYGKGINA
QITATSIETK LTLYSFATSK SKTCVAFLTT WRNFIYDLER INEFPLPDHQ KSVRLKSAVR
SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLKHA TDIDQTDLED RGNNRGGRSA
NNAKSQSSSK SSSKKKTNKL IGKKHKNYVP PEKWNALSPE EKRTIMDQRG PRPAPAPAPA
LSVNAAATQP PPTLASTHVP PAAGPGQLLR SLISNSAARQ HPAPSNGATS DSFSVNGTTY
RREVNRASVQ YRLSTHDVSL NKDSLIDGGA NGGLSGSDVT VISQSLLEAT VSGIGNSELT
NLRLSMVAGL IHTTDGPIIG VFHQYAHLGT GNTIHSCNQM RSWGVTVDDV PRTFGGKQRI
VTSDGRFVIS LSVSGGLTYL SMQAPTEEDL DTFEWVPFTA DNEWDPNGVS SPAAADDDLS
LQLPAGHVPF RDERINNFGL LAHSAAVSRS PLNADALQPN FGWVPSACIS RTFENTTQFA
RADARLPLRK HFKSRFPAAN VSRLNEIVAT DTFFSDTPAA DDGIFNHGGA TMAQLFVGKS
SQITSVFPMK RESQFAHTFE DFICTHGAPD ALLSDNARAQ IGQQALEILR MYAIDDMQCE
PHHQHQNYAE RRIQEVKKMV NTIMDCTNTP PEYWLLCLFY VTYLLNRLAV ESLNWRTPLQ
VAHGQRPDIS ALLLFRWFET VYYYNPDHAS FPSASREKTG RWIGVAEHKG DALTYWILTD
NTHQAIAGSV VRSANVDNGL KNHRAANSSP DGGEPSNPKP IVLATSDLRH DATVDPSFEK
SPAFSPDELI GRYLIREAPD GQSHRALVAR KIIDADSDNH QTICFLLQID EKDADEIISY
NELSDLMEAQ QSEPATNGNI EDHFKFTSII GHQGPLQPTD AGYKGSSWNS HSRCTRLTTI
SLTCPGGAAS IVCFATVMTS IDLFR