Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37861 |
Symbol | |
ID | 7202659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 247439 |
End bp | 248560 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182035 |
Protein GI | 219123445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.095088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGAC TGAATGGATA TGACCAAAAG GCTCGTAGCG TGCAAGAAAT TGGCTGCCGT GTACTGTGGA CTTTGGCACT GAATGCCGAT ACCACCAAGA CCTCCATCGG AAAACAAGGG GGAATTAGAG TGATATTGGC TGCGATGCAG CGTCACAACA GAGTCAATCA GAGCGACGCA GCAGTGCAAG CATCTTGCCT CGGGGCGCTC TGGAGCCTCT CTTTACTCGA GAGTAACGCA ATGTGGATTG CTTTGCGAGG TGGAATCGAT CTTATCATTT CCGCAATGCT CAGGCACAAT TCTGGGACTG ATCTAGACGG CGAGCTTCAA AGAGTCGGCT GTCTCCTTTT GACTAGTCTA TCCAGAGAGG GTCTCACCAA AAGCGCTGGC ATCCGTACCA TGCTAATGAG ACAAGGCGGT ATAAGCGTAC TGCTTAGGGC TATGCGTCGA CACAATTCGG GTAGCCATCT TTCGGCTATG TTGCAACAGG CCGGTTGCAC CGCAATCGCT AACCTTGCTA AAGATAGCAA AAGCCGTCAA CTCTGTTTAG CAGAGGATGG AGGTATCCAG GTAGTCTTAG AAGCGTTGCA GAAACACAGT GGAGCCGAGT GTTTGCTGGT ACAGCGCGAA GGTTGCAAAG CGCTGGCTCA TATTGCACAG AACATTTACA ACTCAATCTC GATTGGCAAA CAAGGAGGAG TCATGGCTGT TCTTGCAGTC ATGCGAAAAT TTGGCTCTGC TTCCGATTCT GATGTCAGCC TCCAAGAATC CGCTTGTCTT GCGCTTTCAA ATTTAGCGCA AACCTACGAA AACAGAGCTT CCATCATGGA GTCCGACGGT TTAGATCTTG TTCTCGCAGC GATGGAAGCA GGAAGGAATC GTTCTATCCT CGGTTTAGAC GCAGACTTGC AGTTGGCAGC TTGTGGTGTT TTGTCGCGAT TAGCAAAAGA CTCGGAAAAT GCCAATGTCA TCGCTGAACG TGGAGGGATT GAGGTCGTCT TACTCGTATT GAGCAAATAC AAAGCAAGCA ATTTCGTTAT CCGAGACTGC GGCCGTGCCG TCTTGAAAGA GCTTGCGAAG AAATGCGACG ATGAAATTGT TATCAGATCA TGCCGGGCAT GA
|
Protein sequence | MRGLNGYDQK ARSVQEIGCR VLWTLALNAD TTKTSIGKQG GIRVILAAMQ RHNRVNQSDA AVQASCLGAL WSLSLLESNA MWIALRGGID LIISAMLRHN SGTDLDGELQ RVGCLLLTSL SREGLTKSAG IRTMLMRQGG ISVLLRAMRR HNSGSHLSAM LQQAGCTAIA NLAKDSKSRQ LCLAEDGGIQ VVLEALQKHS GAECLLVQRE GCKALAHIAQ NIYNSISIGK QGGVMAVLAV MRKFGSASDS DVSLQESACL ALSNLAQTYE NRASIMESDG LDLVLAAMEA GRNRSILGLD ADLQLAACGV LSRLAKDSEN ANVIAERGGI EVVLLVLSKY KASNFVIRDC GRAVLKELAK KCDDEIVIRS CRA
|
| |