Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50598 |
Symbol | |
ID | 7199439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 31411 |
End bp | 34514 |
Gene Length | 3104 bp |
Protein Length | 1008 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185554 |
Protein GI | 219130822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.732398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCGTGTACA GGTTAACAAC AGTAACTAAG CTTTGCAACA TTGTCACTGT AGCGTTTCAC TGTCACTGTC ACGGATCATG TCTGATTCTC CGTCAACCAG GAACGCCGAC GATCTGGAAG TTCCTTTGGT AGACGATCGT TCCTATCGCT ACTGGACTCT TCCCGCGACG TCCACTGCCG ACGTCGTCAC AACGGATCGT CCCGGACTCC GCGTCCTGTT GGTTCACGAT GATTCCGTCG ACAAGGGTGC TGCTGCGGTA GACGTGGCGG TCGGACAGTT CCAGGACGGT GACTTGCCGG GCCTCGCACA CTTGACGGAA CACATGCTCT TCCTCGGCAC GCAACGCTTT CCGCAAGAAA ACGCTCTGGA CAGTTTCCTC GCCGCACACG GGGGACACTC CAACGCCTAC ACGGATCTGG AACACACCGT GTACTACATG GATGTGCAAG CGGCACAGTT GGAACCCGCA CTCGATCGAT TCGGTTCCTG CTTCGAAGCA CCGCTCCTAC TCGAGAACTG CGTCGCCCGT GAATTGCAAG CCGTCGACAG TGAACACGGC AAAAACAAAC AGTCCGATTT CTGGCGGTAC CATCAACTCA CCAAAACACT TCTGGGACAG CACAATAGTC ACGTCTATCA ACAATTCGGG ACGGGCAATC TAGAGAGCTT GCAACCCCAA GGAACGGCCG TTTTGCGGCA AGCCGTACAC GACTTTTATC AGCGTTACTA CCACACCGCT CGTATGACCC TCTGTGTCCT TGGCAATCAG GATCTTGACG TGCTGCAAGG ATGGGTGGAA AAGTATTTTG GCAGCTTGCC CAGTCAGCCG AGTGACACCT TGGTGGAACC ACCCGTGCCG CCGTTGACAC CGGTCCTCCC ACAACGCGTC CACGTCGTTC CGACACGAGA AACCAACGTA CTCGAATTGC AATGGTGCCT CCGGGAAATA CAATCTCTCT ACCGGTCCAA GCCTACCCGA ATACTATCGC ACTTGTTAGG GCATGAAGGC CCCGGCAGTT TATTGGCCGT CCTACGGGAA CGACTCTGGG TGCAGGAATT GTACGCCGAT GACTCCAGCA AAACTACCTC CGCCTTTAGT ATATTCTGCG TACAACTCGA ACTCACCGTG CTAGGATGGG AACACGTTAA CGACGTCGTG GCCACGGTGT ATCGGTATAT TGGACTGTTG CAGAACGAGA TTCCCGCCTG GGTTGCGGAC GAATTGCAAA CCACCGCGTC TACGCAGTTT CGATTCTTGT CCAAAAGCTC ACCCTCCGAC ACCGTGTCCA GAGTTGCACA CCAAATGCAA GAGTTTGCGA TAGCGCACGT ACTGTCGGGA CCGTACCTAG TCTACGAGCA CGACATGGCT GCCGTCCAAT CCTGCCTCGC CAGTTTGCAC GTCGACAATA TGCTCGTACT TGTGGCCTCC AAGGAGTATA CTGGACAGAC CACCGCGACC GATCCGTGGT ACGGTACCCA GTATGCCACG GTTGCGCTGG AACCAGACGC GTTGGAAGCG TGGCGTCAAG CGCGCAGCGC TGCGACGGAT GGTAGCGGTG TCGATTTCAT CGGTCTACAT CTCCCTGATC GCAACGACAT GCTTGCTACC GATTTTGAGC TCAAAACGTC TCCCTACGCC GTCTTTGCCA AAACGAACAC GAACGACAGC AATGGCGACA ACGGCAACGT TCCACCCCCG CCCCGTTGCT TATTGGACAC AGATACGTGT CGCCTTTGGT ACAAACCTGA TACAGAATTC CGCATGCCCA AGGTCAACAT CATGTGTGTC TTGCGTAGTG CTACGGCCTA CGAAAGCGTG ACACAGTCTG TCTTGGCATC GTTGTGGTCG GAAACTGCAG ACGAACTTTG CAACGTGTTT TCGTACGCCG CGTCCATGGC TGGCCTGCAT TGCAACTTTT CCAATACGCG GAATGGTATG GAACTCCACC TGTCCGGCTA TCACGACAAA GCTCACGTTT TGCTGCAACG AATTGTGGAC ACGGTTCGGG ACTTTCGGGT AACGCCGGAT TTGTTTGAAC GTATTCAATC AAAATTGGAA CAGCAGTTTC AGGAATTCTT GGTAGCACAG CCGTATCAAC ACGCCATTTA CGCTGGCGAT TTGTGTTTGG AAACACCCAA ATGGGACATT CACGACCGGT TGCAGTGTCT CGCTTCGCTG ACTTTAAATG ACCTTCAGCA CTTTGGTCGT CACATTCTGG CTCGGTTTCA ACTCGAAATG CTGGTCCACG GGAACGTGAC CGCGTCCGAA GCGGTTCAAC TATCGGATAT TGTTTTGCTC GGTTGGCGAC CTCAAGCACC ACTCAATCAA ATCGATGTCC GAGTAGTCCA GCTCCCTGCA CAAGGTTCCG AGGGTACGTC GACTGTGCAT CGATTTTCCG GCTGGAACGA AGACGATGAA AATAGCTCGG TGTGCAACAT TTATCAGGTA GGAACCATGG ACACCAAGAT GAATGCAACT CTGGGCCTTT TGCATCATTT GATTCGCGAG CCGGCTTTTG GTCAATTGCG CACGCAAGAA CAATTGGGAT ATATTGTTCA CACACAGGTC AAAACGAGCG GGGACAAAGT AAAGTCGTTG CTATTCTTGA TTCAGAGTGA CTCCTTCGAT CCGATCCACA TGGACCAACG GATCGAAGCG TTTTTGGTAG ATTTTCGTCA TAAACTGGTG CAAATGTCGG AGCCTGACTT TGCCGCCAAT GTTGGCGCCT TGTGCCAAAG CTTTTTGGAG AAAAACAAGA ACTTGAGTGA AGAATCGTCC CGATATTGGC ACGTGATCAC CAACCAAACC TATCGATTCT ACCGGATGTC CGAATTGGCG GCTGCTGCCC AAACCGTAAC AAAATTGGAT GTTTTGCGTT TCTTGGACCG TCACGTCCTG GCAACGTCCC CGTACCGCCG TAAGCTGTCT GTGCAAGTGT TTGGACAAAA TCATATTGCG GATCTCTTAG ACAAGACGGA TGTTGCTGGG GATGGTATTG TTCTTGTCGA GAGCGCCAAC GACTTCCGTC GGTCACAGGC GCTCTTTCCT ATGCAAGCGT CCGCTTCGAT TGAGGATTGG CGATTAGACG CGAAAGACGA CTAA
|
Protein sequence | MSDSPSTRNA DDLEVPLVDD RSYRYWTLPA TSTADVVTTD RPGLRVLLVH DDSVDKGAAA VDVAVGQFQD GDLPGLAHLT EHMLFLGTQR FPQENALDSF LAAHGGHSNA YTDLEHTVYY MDVQAAQLEP ALDRFGSCFE APLLLENCVA RELQAVDSEH GKNKQSDFWR YHQLTKTLLG QHNSHVYQQF GTGNLESLQP QGTAVLRQAV HDFYQRYYHT ARMTLCVLGN QDLDVLQGWV EKYFGSLPSQ PSDTLVEPPV PPLTPVLPQR VHVVPTRETN VLELQWCLRE IQSLYRSKPT RILSHLLGHE GPGSLLAVLR ERLWVQELYA DDSSKTTSAF SIFCVQLELT VLGWEHVNDV VATVYRYIGL LQNEIPAWVA DELQTTASTQ FRFLSKSSPS DTVSRVAHQM QEFAIAHVLS GPYLVYEHDM AAVQSCLASL HVDNMLVLVA SKEYTGQTTA TDPWYGTQYA TVALEPDALE AWRQARSAAT DGSGVDFIGL HLPDRNDMLA TDFELKTSPY AVFAKTNTND SNGDNGNVPP PPRCLLDTDT CRLWYKPDTE FRMPKVNIMC VLRSATAYES VTQSVLASLW SETADELCNV FSYAASMAGL HCNFSNTRNG MELHLSGYHD KAHVLLQRIV DTVRDFRVTP DLFERIQSKL EQQFQEFLVA QPYQHAIYAG DLCLETPKWD IHDRLQCLAS LTLNDLQHFG RHILARFQLE MLVHGNVTAS EAVQLSDIVL LGWRPQAPLN QIDVRVVQLP AQGSEGTSTV HRFSGWNEDD ENSSVCNIYQ VGTMDTKMNA TLGLLHHLIR EPAFGQLRTQ EQLGYIVHTQ VKTSGDKVKS LLFLIQSDSF DPIHMDQRIE AFLVDFRHKL VQMSEPDFAA NVGALCQSFL EKNKNLSEES SRYWHVITNQ TYRFYRMSEL AAAAQTVTKL DVLRFLDRHV LATSPYRRKL SVQVFGQNHI ADLLDKTDVA GDGIVLVESA NDFRRSQALF PMQASASIED WRLDAKDD
|
| |