Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47730 |
Symbol | |
ID | 7202908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 664210 |
End bp | 667744 |
Gene Length | 3535 bp |
Protein Length | 1085 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181948 |
Protein GI | 219123265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATAT CCAGTAAGAC GATTCTGGCA ATGCTCGGTG CTTCGTTAGG GTGGACGACG GCCTTGGTCG TGGACGTACC TCACCGTCCC TGGTCGACGA GCTCCCGGAC GCGAAGTACC AGTGCGATCG GCAACGGCAA CGGCGACATG GACTACGATT CGTCGGCGAG GGCGGGAGCG GACGGATACT CGGTACTGCG ACAACCTGCC TCCCGTTCCA ATTGGGACCC CAACTTGGAT CCGGAGTTCG AGGTTCCTCT TTCCTTGGAT CAAGCCCAAT CGTCCTTTCA AACACAAGAC GACTATTGGT GGAACAATGA AGTTCAAAAG GCCAAACCGA AACAATCAAT CCAACGATCG TCGTCAGTCA CCGCTTCCTC TGTCACGACT GCGAGCGATC GAGAGCCTGC CAACAATGCC ATTGCCAAAC CCGAGGATCT CGATTTGTTT CAACGATCTC TCGACACGCT CGATTATCCC CGTGTCTTAC AGGCCCTGGA GGAGCAATGC ACGACGGTTC CAGCGCGACT CATGGTTCGG CAAGCATCCC ACGATTTGAC CACCAATGGT ACCAGTACAA TCCAGTCGAA GAAAAGTAAA CGCATACCCA AAGGCTCCGA ACGTGCCTTC CAACCGCTGA CGGCCGACAC GGTACTGGGC ACACAAGAAC GATACCGAGC AGTGCAAGAA CTGGAATGGA TCCTCCAAGG TGGATCGGGA CAGATTAATT TGGCCGACTA CAGTTACCGC AATCGCAAAA GCTACAAGGA AACCCTGGCG GGCAAACCAC CACCGCTCGG CGGGAACGCG TTCGATTTAC TGGCCATTCT CGCCGTGGCT GAACAGGGCA AAGTACTCGA AGGGGAAGAA ATATTTGACG TGTCCCAAAT GCTTGATCGT ATGCAAGATG TACGGTTGTG GAGCGACGAC GGTCTCCTGA ACGTGAATCG ACTGCAACAG GACATTGAAT TTGTTGAATT GCCCAAACTA GCGTCCTGCA TCCAAGTCAA TACGACTCTC CAAGATTTGC TGCACAACGC CTTTGACAAG GACGATCGAT TGAGCGGCAC GACATTTCCA GTACTGGGTC GTTTGCGTGC CCGGGTACGA TCCTTAAAAG CCGACATTAT GGGAACGCTC GATAGCTTGT TGGCCTTGCC CTCCATCAAG AACAAACTGG CGTTGGAAAG CGGCGGTCCG ATCTATTCCG AAGTCAACGG TGGTCGCCTA GTACTGCCCG TGGCACAAAA GTATGCCTCC TCCGTAGGCA TCGTCCACGA TACGTCTCGC TCGGGCAAAA CCGTATACGT GGAACCGACG GAGCTCGTGG GACCAACCAA CGAATTGCGA CAAGCCGAAG GTGAACTGCG GGCCGAAGAA GCCCGTGTGT GGCGGTCCTT GACGGAGCAA ATATTGAAAA ATCAAATCGT GTTGGAAACC TCGGTTCGAG CGATCGGACA GCTGGATCTT GTCATGGCCC GACTCTTGCT GGGACGCAAA CTGTCCGGCA CCATTCCTGT TGTACAAGAC GAAGGAGTTA TTCAACTGCG TAACGCCAAG CATCCCGTAT TGTTACTGCG GCAAGTCAAG AATGTGGTCG GTAGTGACGT GGATCTGGGG GCCGACGGCA ACCAAGGTTT GGTGTTGACG GGGCCCAACT CGGGTGGAAA AACGGTGATT CTCAAACTGC TCGGTCTCAT GGCACTCATG TCTCGCGGTG GTATACCCGT GCCGGCCGAT CGGCCACGGG TCGCAGTCGG AGCCAAGTCC TACGGCGACG AGTACGATAG CAACAACGAC GAATTCCAAC CCCGCATTGA CTTTTTCAAT CCTGTCCTTG CCGATATTGG TGACATTCAA AGCGTCGGCG GCGACCTGTC AACCTTTTCC GGCCACATGC TCGTTTGCCG TGAAGTCCTG GCCAATTCGG GCCGCAACGC TCTGGTTCTC ATGGATGAGC TGGGGAGCGG CACAGATCCG GCTCAGGGTG TTGCGATTGC GCAGGCTTTG CTGGAAGCTA TTTTGGAGAC GGGCGCTCGC GTGGCCATTA CGACGCATTA TATGCAATTG AAGCAGCTGG CCGCGTCCGA CGACCGTTTT TCCGTCGCGG GGATGCAGTT TGTCCAGGGT CGGCCCACGT ACAAGCTGCT TCCCGGCACC GTGGGTGAAT CGTTCGCCTT GGCCGTCGCG GAACGCCTCA ACCTGCCCCA AAGTGTCATT GACCGAGCGG AGGCCTTGAT GGACTCAGAA ACCAGACAAT TGGGCGACTT GATTCGCGAA CTGGAAGACC AAAAGGGTTT GGTAGATCAG CAAGTGTTGG AGCTGGAGGA GAAACGCCAA GAAATCGGCA AGATGCGGTT TGAACTGAAG GAACAAGGAC TCCGACTCGA AAAGAAGCAG CTTACGGTAC GGCGCGAAGA AGCACGCAAG TTTGCGAAAA AGCTGGAAGA AAAGGAACAA GTATTGGAAA ATGTTCTAGA GAAACTCAAA GCGGATCCCA CCCGTCGGGT CTTGGCCAAG AGCTGGGACG ATATCAAGTT CGTCAAACGA GACGCCTTGA ACGAAGCTGA GAATATTCCC AGCATCGTTG CACGTAAAAA GAAAGCCAAC GCCGTGCTCG CAGCGGAACA AGGCGAGCTG ATTCCCATTG CGGAACTTCG CGAGCGCCCC GAGCTCAAGG AGGGTGACAA GGTAATTGTT TGCAAACAGG GTCCCGTCTT TGGCCGAGAA GCAACGATCG TTAAGTCTCT CGGTAGTCGA GTAGAAGTGT TGGTGAACAA TATGAATGTA GGCCTCAAAC TGACACAAGT CGCGCTGCCC ACCGCATCCT TTCGATCCAC CTCGGGTCCC GCAAACACAT GGGGCGACGG CCGCCTGTCC ATTGGCCGAG CGGCGGAACG AGCGCTGGCA ACGGAACGCT GTGCGGGACC ATCAACGTCA TCATCGTCGT CCTCCGATAC CGTTGCCGTG TCGGCTCCGT CTAAATCCCG AGGAGTCACG ATGCGCACCA CATCCAACAC TGTCGACGTG CGCGGTTGCA ATTTGGAAGA AGCCAAGGAC CGCATCCGGT CCGCGTTCAG CGCGAGCTTA CTGGCGGGCC GATCCGTGGT TTACGTACTG CACGGCCACG GAACGGGTGG GGTTTTAAAA AGCAAACTGC GGCAGTGGTT GCCCAAGGAG AAGACACTGG TGGATTCCTT CCAAGGAGCC GATGCGGCGG ACGGTGGCGA CGCCTTTACC CGCGTGCAGT TGCGGTAGTC CGTGCATAGC GCCGCCAGCT GGGACTATCT CGCCGTCCAT GAACCGCTTG GTCGACACTC TGCTAGGCAA CGACTAACAC GAGTTGAAGT TTGTGGTGAT TGATATAGTT ATTCTGCGCA CAGAAAGCAT TGTTGGTCCC GGAACGGGCC CGACGGCTGG TACAGGAGCG CCTCGAGCAG CACGGTGTGG CAGGCACCAT CCGTTGTCGC CATGGGTGCG GTATCGCGAC CAACCGACAA ACTGGCTTAC TGTAAATCAT GGAAATATGC AGAGC
|
Protein sequence | MGISSKTILA MLGASLGWTT ALVVDVPHRP WSTSSRTRST SAIGNGNGDM DYDSSARAGA DGYSVLRQPA SRSNWDPNLD PEFEVPLSLD QAQSSFQTQD DYWWNNEVQK AKPKQSIQRS SSVTASSVTT ASDREPANNA IAKPEDLDLF QRSLDTLDYP RVLQALEEQC TTVPARLMVR QASHDLTTNG TSTIQSKKSK RIPKGSERAF QPLTADTVLG TQERYRAVQE LEWILQGGSG QINLADYSYR NRKSYKETLA GKPPPLGGNA FDLLAILAVA EQGKVLEGEE IFDVSQMLDR MQDVRLWSDD GLLNVNRLQQ DIEFVELPKL ASCIQVNTTL QDLLHNAFDK DDRLSGTTFP VLGRLRARVR SLKADIMGTL DSLLALPSIK NKLALESGGP IYSEVNGGRL VLPVAQKYAS SVGIVHDTSR SGKTVYVEPT ELVGPTNELR QAEGELRAEE ARVWRSLTEQ ILKNQIVLET SVRAIGQLDL VMARLLLGRK LSGTIPVVQD EGVIQLRNAK HPVLLLRQVK NVVGSDVDLG ADGNQGLVLT GPNSGGKTVI LKLLGLMALM SRGGIPVPAD RPRVAVGAKS YGDEYDSNND EFQPRIDFFN PVLADIGDIQ SVGGDLSTFS GHMLVCREVL ANSGRNALVL MDELGSGTDP AQGVAIAQAL LEAILETGAR VAITTHYMQL KQLAASDDRF SVAGMQFVQG RPTYKLLPGT VGESFALAVA ERLNLPQSVI DRAEALMDSE TRQLGDLIRE LEDQKGLVDQ QVLELEEKRQ EIGKMRFELK EQGLRLEKKQ LTVRREEARK FAKKLEEKEQ VLENVLEKLK ADPTRRVLAK SWDDIKFVKR DALNEAENIP SIVARKKKAN AVLAAEQGEL IPIAELRERP ELKEGDKVIV CKQGPVFGRE ATIVKSLGSR VEVLVNNMNV GLKLTQVALP TASFRSTSGP ANTWGDGRLS IGRAAERALA TERCAGPSTS SSSSSDTVAV SAPSKSRGVT MRTTSNTVDV RGCNLEEAKD RIRSAFSASL LAGRSVVYVL HGHGTGGVLK SKLRQWLPKE KTLVDSFQGA DAADGGDAFT RVQLR
|
| |