Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47534 |
Symbol | |
ID | 7202774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 40691 |
End bp | 41860 |
Gene Length | 1170 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181831 |
Protein GI | 219123021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAACCATGA CATCCGCCAC AGCAATCTAC GCCGCTGTCC CTCTGAATGA CAAGATTACA TCGGAATCAA TTGTCGAGAC TCTTGAGGAT AGCGATATCG TCTCCTTCTC GCATTCCACG GATGACGATA CTCTCGAAAA ACCACTTTTG GGAAGCGCTC CAGCCAAATC GGTCATGAAT GTGACCCGTC TCAAGGCTCT CTTTTTCGTA ACCGGCATTT TAATAGCCCT TGCGTCCCAA TACCTCTTGG CGAAAACCAT GTGGAAGGAC GACGTCGTCG GTCGAGCCTC ATGGGAAGTC GTTGTCTTTA GTCTACAGTG GAGCTTTTGG ACCTGCGTCA TGGTGTTCTC CGTCATGATA TGTATGGTCC GCGCCTTTTC TACCCAGCAG CAAACTCCGG TGGAGGAAGG TCTTGCATTT ACCTTGGAAG CACACCACAT TGTCGGGGCA TTGCTTGCCG TATCGGCCAG TTGGTTCACG GTAGATTTTC TCCATCTGCA AGTTCCGACG CATGCCCATA CCCTGTCGAT TCTCGGGCTT GTTTCGATCG CGTACGCAAT TTTTGTTCGC TGCATGACCG CACGCTTTCG AGAGCGTCGT CATAGTTTCA GCGGAACGGC AGATCGGCAA ATCTACTCCT CCACCCAAGC TCTCATGCCG ACCTACCAAC TCTTGGCGGC GACCCTCGGA CTCGTGGTTG GTCTCTGTTC TCAGTTCCTT TTGAGTTTCT TGCTGTGGAC AGACAGCATG ACAACTCCAG TCATCGACAA CATGGTTGTC TTTGCCGCAA TTTGGAGCAT CTCCACCGTC ATCATTACCT TTGTTGGTTG TGCATCCTTG CGCTGCTTGG TTAATCAGGA AGAGCACAAT ATGCTCGAAA CGGAGCGTGT CTTTTTGCGC ATGGAAGCAC ACTACGTCTT TTGTGCTTTG ATTGGAATCT GTGCCGCTTG GATTCTCATG AACGTTGCGC TTGGTTTGGA ACAGCAAGTC TTACCCAGCT TGGGCATGCT CGCTCTCAGC TTGATCGGCT TTCGAGCCAT CCTCCACTGC TTCCCCGAAG AAGATTGCCT AGCTGAGATT GGACTCGCCC ATGCTAGAGA AAAAGAGGTT CTCGTCAGCA AGAGTACCAA AGAACAGGAT GCGCTGCATC TGGTTGTCCA AATCGTCTAA
|
Protein sequence | MTSATAIYAA VPLNDKITSE SIVETLEDSD IVSFSHSTDD DTLEKPLLGS APAKSVMNVT RLKALFFVTG ILIALASQYL LAKTMWKDDV VGRASWEVVV FSLQWSFWTC VMVFSVMICM VRAFSTQQQT PVEEGLAFTL EAHHIVGALL AVSASWFTVD FLHLQVPTHA HTLSILGLVS IAYAIFVRCM TARFRERRHS FSGTADRQIY SSTQALMPTY QLLAATLGLV VGLCSQFLLS FLLWTDSMTT PVIDNMVVFA AIWSISTVII TFVGCASLRC LVNQEEHNML ETERVFLRME AHYVFCALIG ICAAWILMNV ALGLEQQVLP SLGMLALSLI GFRAILHCFP EEDCLAEIGL AHAREKEVLV SKSTKEQDAL HLVVQIV
|
| |