Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50481 |
Symbol | |
ID | 7199321 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 178337 |
End bp | 179432 |
Gene Length | 1096 bp |
Protein Length | 309 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185441 |
Protein GI | 219130582 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTGACTATC ATTTGACAGA ATTGAGGCGT AGGGGGCTTT CGATCCTCGA AACACAACAT CTTTCTAAAT ATGAATGTTT CCGTAAAACA ACCAGAGACA GGGGGGGCAC TTTGTCATTT AGTTGAGGCT GCAACGGCTT TGGCAAACTT GACTTCTGTC AATGAAGTCA GAACTGGCGT TACGAAGACT TCGACGGGTG CAAACAGTCC TCCGACTGAT GAGGGCATGA TTGTCAGTGA TGAAGAAGAG GCTCGAAAGA TAGCACCTAT GGTATTGTCA AATTGTAGCA GTGCTGGGAC TGGAAACTGT AAGCGCGACA TATTTCCGCA GCGCCTTCTA GCGATTTTGA GCGAATCATC CCTATCTGAC ATTATCACTT GGCTTCCACA TGGACGTTCG TTCGTCATCA TACGTCCCGA TGTGTTTACG GTCAAGATTT TGCCAAAATA CTTGCCTCCC GTTGATGCTC GCGGATCACC CAAGTACGCA TCCTTCACGA GAAAACTCAA CCGATGGTAA GCCACTCGGT AGTCAAAAGA GTGGGAAGCA GGTGCAGGTG CATAGATGAG TTTCTCACTC AATTCTGTTT TTCACTACTT CTTCGATATC AGGGGGTTTC GTCAGGCAAC TCGGGGCCCT GACACTGGCG CCTTCTACCA TCCCCTCTTC TGTAGAGATC AACCAGACCT CTGCTTGGAT ATGGTGTGTC AGCGATCTCG CGATCGCAAA GGTGGCGAAT GTGACAAAAA TCGTAGCCAA AGCAATTTGC CTCCCAAAAA GCGGTCCCCT GAAATCAATG AATCCATAAA CAAAATCACT CCTCTCACCA AACAGGCTTT GGATTCTATG ATGTCGCCAG AACCCTCATT AATGAAGAGA CCAAATACAG TATCGGTCGA TGACAATCGT TCCGTGGCAT CCGCATGCAA CTCAGCTTCT ACTGTGTCGT CAAACCTGTC TCTACCACCG CCTAGAATAT CCAGCGACTC TTTGCTGGTA ATCGCGGCAC TGCAACGGCG TGATGAAAAT GAGCGTATCA AGGTTGCGAA AGCTATGTTA TACGAGTCAT ACTTAAAGGC CAAAGAAGGG CAATAA
|
Protein sequence | MNVSVKQPET GGALCHLVEA ATALANLTSV NEVRTGVTKT STGANSPPTD EGMIVSDEEE ARKIAPMVLS NCSSAGTGNC KRDIFPQRLL AILSESSLSD IITWLPHGRS FVIIRPDVFT VKILPKYLPP VDARGSPKYA SFTRKLNRWG FRQATRGPDT GAFYHPLFCR DQPDLCLDMV CQRSRDRKGG ECDKNRSQSN LPPKKRSPEI NESINKITPL TKQALDSMMS PEPSLMKRPN TVSVDDNRSV ASACNSASTV SSNLSLPPPR ISSDSLLVIA ALQRRDENER IKVAKAMLYE SYLKAKEGQ
|
| |