Gene PHATRDRAFT_47730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47730 
Symbol 
ID7202908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp664210 
End bp667744 
Gene Length3535 bp 
Protein Length1085 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181948 
Protein GI219123265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATAT CCAGTAAGAC GATTCTGGCA ATGCTCGGTG CTTCGTTAGG GTGGACGACG 
GCCTTGGTCG TGGACGTACC TCACCGTCCC TGGTCGACGA GCTCCCGGAC GCGAAGTACC
AGTGCGATCG GCAACGGCAA CGGCGACATG GACTACGATT CGTCGGCGAG GGCGGGAGCG
GACGGATACT CGGTACTGCG ACAACCTGCC TCCCGTTCCA ATTGGGACCC CAACTTGGAT
CCGGAGTTCG AGGTTCCTCT TTCCTTGGAT CAAGCCCAAT CGTCCTTTCA AACACAAGAC
GACTATTGGT GGAACAATGA AGTTCAAAAG GCCAAACCGA AACAATCAAT CCAACGATCG
TCGTCAGTCA CCGCTTCCTC TGTCACGACT GCGAGCGATC GAGAGCCTGC CAACAATGCC
ATTGCCAAAC CCGAGGATCT CGATTTGTTT CAACGATCTC TCGACACGCT CGATTATCCC
CGTGTCTTAC AGGCCCTGGA GGAGCAATGC ACGACGGTTC CAGCGCGACT CATGGTTCGG
CAAGCATCCC ACGATTTGAC CACCAATGGT ACCAGTACAA TCCAGTCGAA GAAAAGTAAA
CGCATACCCA AAGGCTCCGA ACGTGCCTTC CAACCGCTGA CGGCCGACAC GGTACTGGGC
ACACAAGAAC GATACCGAGC AGTGCAAGAA CTGGAATGGA TCCTCCAAGG TGGATCGGGA
CAGATTAATT TGGCCGACTA CAGTTACCGC AATCGCAAAA GCTACAAGGA AACCCTGGCG
GGCAAACCAC CACCGCTCGG CGGGAACGCG TTCGATTTAC TGGCCATTCT CGCCGTGGCT
GAACAGGGCA AAGTACTCGA AGGGGAAGAA ATATTTGACG TGTCCCAAAT GCTTGATCGT
ATGCAAGATG TACGGTTGTG GAGCGACGAC GGTCTCCTGA ACGTGAATCG ACTGCAACAG
GACATTGAAT TTGTTGAATT GCCCAAACTA GCGTCCTGCA TCCAAGTCAA TACGACTCTC
CAAGATTTGC TGCACAACGC CTTTGACAAG GACGATCGAT TGAGCGGCAC GACATTTCCA
GTACTGGGTC GTTTGCGTGC CCGGGTACGA TCCTTAAAAG CCGACATTAT GGGAACGCTC
GATAGCTTGT TGGCCTTGCC CTCCATCAAG AACAAACTGG CGTTGGAAAG CGGCGGTCCG
ATCTATTCCG AAGTCAACGG TGGTCGCCTA GTACTGCCCG TGGCACAAAA GTATGCCTCC
TCCGTAGGCA TCGTCCACGA TACGTCTCGC TCGGGCAAAA CCGTATACGT GGAACCGACG
GAGCTCGTGG GACCAACCAA CGAATTGCGA CAAGCCGAAG GTGAACTGCG GGCCGAAGAA
GCCCGTGTGT GGCGGTCCTT GACGGAGCAA ATATTGAAAA ATCAAATCGT GTTGGAAACC
TCGGTTCGAG CGATCGGACA GCTGGATCTT GTCATGGCCC GACTCTTGCT GGGACGCAAA
CTGTCCGGCA CCATTCCTGT TGTACAAGAC GAAGGAGTTA TTCAACTGCG TAACGCCAAG
CATCCCGTAT TGTTACTGCG GCAAGTCAAG AATGTGGTCG GTAGTGACGT GGATCTGGGG
GCCGACGGCA ACCAAGGTTT GGTGTTGACG GGGCCCAACT CGGGTGGAAA AACGGTGATT
CTCAAACTGC TCGGTCTCAT GGCACTCATG TCTCGCGGTG GTATACCCGT GCCGGCCGAT
CGGCCACGGG TCGCAGTCGG AGCCAAGTCC TACGGCGACG AGTACGATAG CAACAACGAC
GAATTCCAAC CCCGCATTGA CTTTTTCAAT CCTGTCCTTG CCGATATTGG TGACATTCAA
AGCGTCGGCG GCGACCTGTC AACCTTTTCC GGCCACATGC TCGTTTGCCG TGAAGTCCTG
GCCAATTCGG GCCGCAACGC TCTGGTTCTC ATGGATGAGC TGGGGAGCGG CACAGATCCG
GCTCAGGGTG TTGCGATTGC GCAGGCTTTG CTGGAAGCTA TTTTGGAGAC GGGCGCTCGC
GTGGCCATTA CGACGCATTA TATGCAATTG AAGCAGCTGG CCGCGTCCGA CGACCGTTTT
TCCGTCGCGG GGATGCAGTT TGTCCAGGGT CGGCCCACGT ACAAGCTGCT TCCCGGCACC
GTGGGTGAAT CGTTCGCCTT GGCCGTCGCG GAACGCCTCA ACCTGCCCCA AAGTGTCATT
GACCGAGCGG AGGCCTTGAT GGACTCAGAA ACCAGACAAT TGGGCGACTT GATTCGCGAA
CTGGAAGACC AAAAGGGTTT GGTAGATCAG CAAGTGTTGG AGCTGGAGGA GAAACGCCAA
GAAATCGGCA AGATGCGGTT TGAACTGAAG GAACAAGGAC TCCGACTCGA AAAGAAGCAG
CTTACGGTAC GGCGCGAAGA AGCACGCAAG TTTGCGAAAA AGCTGGAAGA AAAGGAACAA
GTATTGGAAA ATGTTCTAGA GAAACTCAAA GCGGATCCCA CCCGTCGGGT CTTGGCCAAG
AGCTGGGACG ATATCAAGTT CGTCAAACGA GACGCCTTGA ACGAAGCTGA GAATATTCCC
AGCATCGTTG CACGTAAAAA GAAAGCCAAC GCCGTGCTCG CAGCGGAACA AGGCGAGCTG
ATTCCCATTG CGGAACTTCG CGAGCGCCCC GAGCTCAAGG AGGGTGACAA GGTAATTGTT
TGCAAACAGG GTCCCGTCTT TGGCCGAGAA GCAACGATCG TTAAGTCTCT CGGTAGTCGA
GTAGAAGTGT TGGTGAACAA TATGAATGTA GGCCTCAAAC TGACACAAGT CGCGCTGCCC
ACCGCATCCT TTCGATCCAC CTCGGGTCCC GCAAACACAT GGGGCGACGG CCGCCTGTCC
ATTGGCCGAG CGGCGGAACG AGCGCTGGCA ACGGAACGCT GTGCGGGACC ATCAACGTCA
TCATCGTCGT CCTCCGATAC CGTTGCCGTG TCGGCTCCGT CTAAATCCCG AGGAGTCACG
ATGCGCACCA CATCCAACAC TGTCGACGTG CGCGGTTGCA ATTTGGAAGA AGCCAAGGAC
CGCATCCGGT CCGCGTTCAG CGCGAGCTTA CTGGCGGGCC GATCCGTGGT TTACGTACTG
CACGGCCACG GAACGGGTGG GGTTTTAAAA AGCAAACTGC GGCAGTGGTT GCCCAAGGAG
AAGACACTGG TGGATTCCTT CCAAGGAGCC GATGCGGCGG ACGGTGGCGA CGCCTTTACC
CGCGTGCAGT TGCGGTAGTC CGTGCATAGC GCCGCCAGCT GGGACTATCT CGCCGTCCAT
GAACCGCTTG GTCGACACTC TGCTAGGCAA CGACTAACAC GAGTTGAAGT TTGTGGTGAT
TGATATAGTT ATTCTGCGCA CAGAAAGCAT TGTTGGTCCC GGAACGGGCC CGACGGCTGG
TACAGGAGCG CCTCGAGCAG CACGGTGTGG CAGGCACCAT CCGTTGTCGC CATGGGTGCG
GTATCGCGAC CAACCGACAA ACTGGCTTAC TGTAAATCAT GGAAATATGC AGAGC
 
Protein sequence
MGISSKTILA MLGASLGWTT ALVVDVPHRP WSTSSRTRST SAIGNGNGDM DYDSSARAGA 
DGYSVLRQPA SRSNWDPNLD PEFEVPLSLD QAQSSFQTQD DYWWNNEVQK AKPKQSIQRS
SSVTASSVTT ASDREPANNA IAKPEDLDLF QRSLDTLDYP RVLQALEEQC TTVPARLMVR
QASHDLTTNG TSTIQSKKSK RIPKGSERAF QPLTADTVLG TQERYRAVQE LEWILQGGSG
QINLADYSYR NRKSYKETLA GKPPPLGGNA FDLLAILAVA EQGKVLEGEE IFDVSQMLDR
MQDVRLWSDD GLLNVNRLQQ DIEFVELPKL ASCIQVNTTL QDLLHNAFDK DDRLSGTTFP
VLGRLRARVR SLKADIMGTL DSLLALPSIK NKLALESGGP IYSEVNGGRL VLPVAQKYAS
SVGIVHDTSR SGKTVYVEPT ELVGPTNELR QAEGELRAEE ARVWRSLTEQ ILKNQIVLET
SVRAIGQLDL VMARLLLGRK LSGTIPVVQD EGVIQLRNAK HPVLLLRQVK NVVGSDVDLG
ADGNQGLVLT GPNSGGKTVI LKLLGLMALM SRGGIPVPAD RPRVAVGAKS YGDEYDSNND
EFQPRIDFFN PVLADIGDIQ SVGGDLSTFS GHMLVCREVL ANSGRNALVL MDELGSGTDP
AQGVAIAQAL LEAILETGAR VAITTHYMQL KQLAASDDRF SVAGMQFVQG RPTYKLLPGT
VGESFALAVA ERLNLPQSVI DRAEALMDSE TRQLGDLIRE LEDQKGLVDQ QVLELEEKRQ
EIGKMRFELK EQGLRLEKKQ LTVRREEARK FAKKLEEKEQ VLENVLEKLK ADPTRRVLAK
SWDDIKFVKR DALNEAENIP SIVARKKKAN AVLAAEQGEL IPIAELRERP ELKEGDKVIV
CKQGPVFGRE ATIVKSLGSR VEVLVNNMNV GLKLTQVALP TASFRSTSGP ANTWGDGRLS
IGRAAERALA TERCAGPSTS SSSSSDTVAV SAPSKSRGVT MRTTSNTVDV RGCNLEEAKD
RIRSAFSASL LAGRSVVYVL HGHGTGGVLK SKLRQWLPKE KTLVDSFQGA DAADGGDAFT
RVQLR