Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34124 |
Symbol | |
ID | 7198130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1154539 |
End bp | 1157049 |
Gene Length | 2511 bp |
Protein Length | 836 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178651 |
Protein GI | 219115711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCACC GACACAACGA ATGCACCGGA GTTTTCAATA GAGACTTATC TGATTGTGGT GAAGAAAGTC GATCCCAGTT AAAAAGCTCA AGCCGGCGGA AGAGGTCATT TTTTCTCGTC ACTCCGGAAG GGAATGCCCT CACTGTCAGC CCTGGCTCGC ACTCTTCTTC GCACCTCCGG GGCCAATTCC CCTCGGAATT ACCCTCGGAC TGGCAACTGC ATCAACCTGA TCTGACTTCC GTTTCTCAGC TTCTAGCAGC TTTGCTTCTG ACGACGGCTC TAACGGTGCT GTCCGCACTC GATGCCATGC ACCACTGCCT GCAACACGAA ACCTACGCTT TGCTCGACCT CGTGGACGAC GACAACTTCC AGCAAGCGTA CGAGAACGCA ATTTCGAACG AAGGAGGGAA GTGCAGGCGC ATGTTTCAAA ATGTCTTGGT TCCGACAGGT GCAGTCACAT TCTCTTTGGG AGGTGTTTGT CTCGTACTCA TCTACTGCCA CAGTCGTCGG GGCGGCACCT GTGCCTACGC GACGATCAGA GTGACCGTCA AACTGCTACT CTTGCTGTTA GCCATGTTCG CAATTCAAAC CTACAGTATC ATGGCTATCA TGCTGCAGAC ACGGAACTAC TCCGCCAATA GTTCCGACAA TCCCTATGAG CACTTGGCGG CCGTTGACCA TTACGGTCAC GTCGCTGACA ACGCTAATCT ATACTACATG GCTTGGATCT CGGAGATCCT AGCCATTTCT TTAGTATACC AGACAGTAGC AGCTACGTTT CGTCTCACAC GGGCAGCCAA AAATCAAGCG GACAGTGCTT TGCTACCGGC GACATCCTGG GATTCTTCGT ACAGAATGGA TTCGGAAAGT CGGGCGATGT GGTACAAGTC TCTCTACCGA CTGCGTATTC GAACAGGCAT ATGGATGGCA GCGTTTTTAT CAAGTTTTGT AGTAATCGCC AGTTCCCAGT ATGTTTGGAC ACAAGTATTG TGGGCGTACG CTTCGCAGTA CAGCGCCGAC GCCAAGTACA CGACCGTCTG TCGCGTCGTT CGTGAGAATT CCAATCTTCC ACCGCAGCTC TGCCGCCGTA CAGTTACCTC GTGGCTCTCG GGGGTTATTG GATCAGCACT AAGCGCAACC GCAATTGGTA TGCATTTGGC TGGACGCTGG ACGGCCGCGG ACAGTCAATT GCATCAACAG GACTCTCTGC CGGTGTGGGA GCGTTTACTG GTACAGAACA GATTACCACT GCGTACGGAG CTTGTTTTAA GTGTACTACT CAGTATCGTA TCTGGATTCA ATGCAGTTTT TGCAACGGGC GTACAAGGAC CTGCTGCAAC TGTTGGAAAC CTGTACTACG CGTCATGGCT GTGCTTCTTG CTCTATGTTC GAATCGGACT GGGATGCTTG GAAGAACATC ACAATATCGA GGAAGTGGGA GAAGAAATCA ACCAAGACGG ATACATTGCA CCCGTACTGG TGGGTCGGCA AGTTGGCGTT ACTTCGGCAA GAGACATGTC GACTTTGCGG AGTAGTAGTG ACACAACGGG AAAGTCGGAG CAAGTATATA CTGATCCATT GGAAAAGGAT CGGGCGCCTA TTGCACGGAA GTACTTCTTT CTTGCTATCT TTTCTTTGGT ATGCGCTGCC TCAGCGTACG ATGCGGCATC GAATCAAAAT AAACGCCTGA CACCGGTCCA GATTTACATG AGTACCGCAC CCTGTGTTGT GTCCGTTCTT TCTGGTTTTT TATTTGTTCT GTGCCTTTCG CCGCACTGCT ACACCATCGT CTCACGGTTT TGGATTGGTG GATCTCTATC CATATTTTGT TTTATCGTTT GGCTTGTCAA CTTGGTACTG ACGATGCACT CCGCAAATTC TTGGGCTGTG AATGGAATTG GAGAGATCAA GACCGCCAAT CTGTACTATT TTTCTTGGGC ATCTATTATC ACATCTAGCA TTCAAATGAT GTCTTATCTC AAGGCGGCAT TCGGGGTAAA AAAGAACGAC TATATGTCCT TCGTTTGGGT CGGCATTTGC AAAGTTTGCT TTGTAGTACT TGGTGCAGCC ATGCACATTT GGCACACTAT CTCTGGCAAC TGTGGTTTCG ATGAAATTAC AATCGGGGCC GTGACCTTTT GTTCACGAAC CGTTCTGGCC ATGGTCGTTG CCCTCACAGG AATGCTAGTT GGAGGTCTGG TTGTCGCCGG TCGGATGCTT CTGATGTGCT GTCCGTCCTG TCAATGCAGT CGTTTCCAAA CACACATCGA GATGATCATT TCTATTTTCT TGGTCTTTCT TTTTGGAGCC GCAGTAGCGC TGATCACCGG GATTGGTGGA CCAGGACAGA GTGTCGGCGA CCTTTACTAC AGCACATGGT GTGCCTTCTT GATTTCAATT GGAATCTTTG TCAGTTGCTT CGAGCAGATT AAATTAGAAG ACATGGAGTT GGACTCCTCA CAACCAAAAC AGCTTGAACG AAGGAAAGCG ACAGACAATG TTCTAGTGTA A
|
Protein sequence | MDHRHNECTG VFNRDLSDCG EESRSQLKSS SRRKRSFFLV TPEGNALTVS PGSHSSSHLR GQFPSELPSD WQLHQPDLTS VSQLLAALLL TTALTVLSAL DAMHHCLQHE TYALLDLVDD DNFQQAYENA ISNEGGKCRR MFQNVLVPTG AVTFSLGGVC LVLIYCHSRR GGTCAYATIR VTVKLLLLLL AMFAIQTYSI MAIMLQTRNY SANSSDNPYE HLAAVDHYGH VADNANLYYM AWISEILAIS LVYQTVAATF RLTRAAKNQA DSALLPATSW DSSYRMDSES RAMWYKSLYR LRIRTGIWMA AFLSSFVVIA SSQYVWTQVL WAYASQYSAD AKYTTVCRVV RENSNLPPQL CRRTVTSWLS GVIGSALSAT AIGMHLAGRW TAADSQLHQQ DSLPVWERLL VQNRLPLRTE LVLSVLLSIV SGFNAVFATG VQGPAATVGN LYYASWLCFL LYVRIGLGCL EEHHNIEEVG EEINQDGYIA PVLVGRQVGV TSARDMSTLR SSSDTTGKSE QVYTDPLEKD RAPIARKYFF LAIFSLVCAA SAYDAASNQN KRLTPVQIYM STAPCVVSVL SGFLFVLCLS PHCYTIVSRF WIGGSLSIFC FIVWLVNLVL TMHSANSWAV NGIGEIKTAN LYYFSWASII TSSIQMMSYL KAAFGVKKND YMSFVWVGIC KVCFVVLGAA MHIWHTISGN CGFDEITIGA VTFCSRTVLA MVVALTGMLV GGLVVAGRML LMCCPSCQCS RFQTHIEMII SIFLVFLFGA AVALITGIGG PGQSVGDLYY STWCAFLISI GIFVSCFEQI KLEDMELDSS QPKQLERRKA TDNVLV
|
| |