Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26176 |
Symbol | |
ID | 7197777 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 778935 |
End bp | 780387 |
Gene Length | 1453 bp |
Protein Length | 408 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178302 |
Protein GI | 219115013 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTGTACGAA CTTTACTCCA TTCCACTTTC GCGAATCTTC TGCTTTTTTG TCTTCTTGGG CCAAACTCTT TTGCTTCTAT CATGCCAACC TACAAATTGC ACGCCCCCGA AGCATCGTTC CGTGCTTTCA GCACGCTCAT TGCCGCCGAA TACAACGGGG TCAAGATCGA TGTCTCCACG GACCTCGGTG CCGCTTCCAA GTCCCCGGTC GGCAAGTTGC CCTTGCTTGA GCTTCCTGAT GGTTCCACTA TTTTTAGCAG CCATTCGATG GCCCGCTTTG TCGCGGGAAT CCGACGTGAT AGCGGATTGA CCGGGAACAC TTTGAACGAA CAAGCTGCGA TCGATGCATG GATGGACTGG GCGGGCCAAG ATTTGGAATT GCCTGCTTGT GTTTGGTTCT ACCCAGTAGC GGGCTACATG CCGTTCAATC AGTCTGCGTA CGTATGGATA CAAAAGATTC ATTAGAAACC TTTTGCGTCT CAATTCGCTT CTCCAATACT CACTTATCCC ACTCATTTGA ATTATTTCAG TTACGAAAAG GCCAAGGCTG ATCTGGGCAA TGCGCTTGCT GTGCTTGACC AGCACTTGTT GGATAAGACT TATTTGGTGA ATCATCAAAT AACCCTTGCC GATATTGTTG TCGCGTCGAC CCTGCTTTAT CCATTCAAGC TCGTTGCTGA TAAAGGCTAC CTTAAGCCTT ACGGAAACGT TGTTCGCTGG TTCCAAACCT GTGTCAATCA GCCCGAGTTC CAGCAGGTTG TTGGGCAAGT TGCTATGTGC AAAAAGGAGT TGGCCGCTGC CGGGCAAGAA ACGAAGAAGA CCAGTGGTGG CAGTAGCAAG AAATCCGAAA AGAAGCCGAA GGAAGATGTT GCTGCGTTGG CACCGGCGCC GGCGCCAAAG ACCGAACATC CCTATAAGAT CATGGATAAG GAAGCTCCTT CGGGTTTCTC TATGGATGCC TGGAAGAAGA CTTATTCCAA CGCCAGCAGC TACGACGCGG CCATGCAAAC CTTCTGGGAG ACGTTTGATG CCGAAGGCTG GGCGCTCTGG TTGCAGGTGT ACAACTACAA CGAAGACAAC AAGCGGATTT TCATGAGCGC GAACGCCGTT GGTGGCTTCC AGCAGCGAAC CGATGAGATC CGTAAGTGGG CATTCGGCGT TATGGATGTT CTTGGTACTG AGGAAACTGT TTTGGAGATT AAGGGTATTT GGCTGTTGCG AGGCGACACG GTCGAACACA TGGTTTCTGC GAATGACGAC GCCAACTGGT ACACCTGGAC GAAGCTCGCT GGAAAAGGAT TGCCGCCCAC TGACGAAGTC AAAACCCAGG TTGCTGCCTA CTGGTGCTCC GAAGATGAGC TCGAGGGCAA GCCTATCCAG GACAGCAAGG TTTTTAAGTA GGAATTTCCA TATATATGAC CCTAATTGTC ACTAGTGAGT TGGACTTATC GAC
|
Protein sequence | MPTYKLHAPE ASFRAFSTLI AAEYNGVKID VSTDLGAASK SPVGKLPLLE LPDGSTIFSS HSMARFVAGI RRDSGLTGNT LNEQAAIDAW MDWAGQDLEL PACVWFYPVA GYMPFNQSAY EKAKADLGNA LAVLDQHLLD KTYLVNHQIT LADIVVASTL LYPFKLVADK GYLKPYGNVV RWFQTCVNQP EFQQVVGQVA MCKKELAAAG QETKKTSGGS SKKSEKKPKE DVAALAPAPA PKTEHPYKIM DKEAPSGFSM DAWKKTYSNA SSYDAAMQTF WETFDAEGWA LWLQVYNYNE DNKRIFMSAN AVGGFQQRTD EIRKWAFGVM DVLGTEETVL EIKGIWLLRG DTVEHMVSAN DDANWYTWTK LAGKGLPPTD EVKTQVAAYW CSEDELEGKP IQDSKVFK
|
| |