Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33729 |
Symbol | |
ID | 7197977 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 224988 |
End bp | 227285 |
Gene Length | 2298 bp |
Protein Length | 679 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178186 |
Protein GI | 219114781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTGA GGGATCCTAG ACTTAGAATT GGAGGGAAGG TGACGGCAAA GGCTTGTCAT GTTGTGCATC TGAGCGAGTG CGCACGGAGA TATGGCGTCA ACAAGCACTC CAAGCGGCTT GTTGGAACGG TTCTAGACGT CACGACCACC CCTGTATCCA TTACAACCGG GCGTACCTCT ACTTTGATAA CAGCAGTTTA TGATTTTGGA GAGAGTTTGT TCAAGGAAAA AACACTGAAC ATTCGGAGTG TAAAGGCATT TGTACCGCCA GAAGATGAAG GAATGTCCTT AATTGAGGAA TTAGCAGCAG AGGCTTTGCA GGCAGCAGAA GCAGACATGG AAGGCGGAAA CTTGATGGAA GAAAGTGTCG AAGCCCCGGT AGCCAAAATG GTTGAAACCC CGGCTGACAT AGAGCCCGAT ACCTTGGTCG ACACAGAGCC CAATACCCCG GTTGACACAG AGCCCAATAG CCCGGTAGCC GAAATTGTCG AGACCCCGGT TGACACTGAT ACCTCGGTTG ACACAGAGTC CGAAAACCCG GTAGCCACAG TGCACCAAAC AGAGTGGTAT GTGAATGAAA GAAAAACCCG GCTGGATGTG AATGGCCATG TCTATGTTAG GCACTTCCAT ATCCGTACTT CAGTTGGTGA CCTTATTGGT CAAGATTCTG ACAATGGGGT GAGATTTTCG CGCCTCGAAT ATTTTCTGCT CATGTTTCCG CCGACCCAGC TGACTACTAT GTGTCGGCTT ACAAATACTC AGCTTGCACA GCGAAACAAG AATCCAATCA CAACCGGAGA ACTTCTTCGG TTCTTTGGAA TGCTCATACT CACTACAAAG TTTGAGTTCA GTAGCCGGGC CCAACTATGG TCCACCACTG CACCCTCCAA GTACATTCCT GCCCCTTCAT TTGGACGCAC AGGAATGTCC CGGCAACGGT TTGACGATAT CTGGAAATAT ATCCATTGGA GTGAACAATG TCCAGTCTGA CCCGATGGTA TGAGCACTCA TGTTCACCGA TGGCAACTTG TTGACGACTT TGTCACAAGG TTTAATGAGC ATCGTAGCAA AAACTTTGTA CCTTCCCATC TGATTTGCGT GGATGAATCT ATCTCAAGAT GGTATGGGCA GGGTGGGGAT TGGATAAACC ATGGTCTACC AAATTATATT GCAATTGATC GAAAGCCTGA GAATGGGTGC GAGATTCAAA ACGCAGCGTG TGGACAATCC GGTATTATGC TTCGATTGAA ACTTGTAAAG GGAAAGACGA TAACTGACGA CGAAGAGGGT GACGAGGAGG ATGAGTATCT ACCGCATGGT GCAAAGATTA TCAAAGAACT TGTTCGTCCT TGGTGGGGGA GTGATCGGAT TGTGTGTGCT GATTCTTATT TTGCCTCCGT TGTGACAGCT GTCGAGCTTA AGAGGATTGG CTTGAGATTC ATTGGGGTTG TGAAGTCGGC AACGAGAAGA TATCCAATGG CCTACCTTTC ACAGTTGGAA ATGACAAGTA GAGGAGAATG GAAAGGATTG GTGACAGACG GAATCTCGGA TGGAAGTTGT GACCTGATGG CTTTTGTATG GGTGGACCGA GACCGTCGAT ATTTTATATC AACAGCATCC AATCTGAATA GAGGCTGGAA TCCAGTTCGC TACCGGTGGA GACAGGTGGA TACATCACCT GATGCAGACC CTGAGAGGGT GGAGATCAAT ATTGCGCAAC CAGTTGCAGC AGAAGTGTAT TATTCTTGTT GTGCAATGAT TGACAGACAT AACCGGAGTC GGCAGGATAC ACTGATGCTT GAAAGAAAAC TTGGCACATG GGATTGGTTG ACACGAGTCA ACTTATCAAT TTTTGGTATC ATTGTTGTGG ACACATGGTT AGCCTACAGC CAATGTACAG GAATAGGAAA GTCTGCTGGA CGAGAAGAAA AGCAGAAGGA TTTCTACAGT GCCTTAGCCG AGGAGCTGGT GGACAACCAG TACGATAGTG TTGGAAGTCG CAAAGTTGGG GGGGATGAGT TGGACAAGGA TAGCCCAACC ATCTCCAGAA CTGGAGAGCC GCGATGTGGT CTCTCCGCAC ATCTAACACC CACCAAAAGA AAAAGAAAGA ACAAAGATGG TACTATTAAA AACCAAAGAC AGCAGGGAAG GTGTTTGGTG TGTTCCAAGA AGACCACATA TGTTTGCTCT GTATGCAAAG ATGTTGAGAC AATTGAAAGC AAAGAACCTT GGATTTGCTA CACAACGGGA GGGCAGCTAT GCTTTGCCCA GCACTTGACT ACCTTGCATG GTAGTTAA
|
Protein sequence | MPVRDPRLRI GGKVTAKACH VVHLSECARR YGVNKHSKRL VGTVLDVTTT PVSITTGRTS TLITAVYDFG ESLFKEKTLN IRSVKAFVPP EDEGMSLIEE LAAEALQAAE ADMEGGNLME ESVEAPVAKM VETPADIEPD TLVDTEPNTP VDTEPNSPVA EIVETPVDTD TSVDTESENP VATVHQTECL HSETRIQSQP ENFFGSLECS YSLQSLSSVA GPNYGPPLHP PSTFLPLHLD AQECPGNGLT ISGNISIGVN NVQSDPMHRS KNFVPSHLIC VDESISRWYG QGGDWINHGL PNYIAIDRKP ENGCEIQNAA CGQSGIMLRL KLVKGKTITD DEEGDEEDEY LPHGAKIIKE LVRPWWGSDR IVCADSYFAS VVTAVELKRI GLRFIGVVKS ATRRYPMAYL SQLEMTSRGE WKGLVTDGIS DGSCDLMAFV WVDRDRRYFI STASNLNRGW NPVRYRWRQV DTSPDADPER VEINIAQPVA AEVYYSCCAM IDRHNRSRQD TLMLERKLGT WDWLTRVNLS IFGIIVVDTW LAYSQCTGIG KSAGREEKQK DFYSALAEEL VDNQYDSVGS RKVGGDELDK DSPTISRTGE PRCGLSAHLT PTKRKRKNKD GTIKNQRQQG RCLVCSKKTT YVCSVCKDVE TIESKEPWIC YTTGGQLCFA QHLTTLHGS
|
| |