Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39573 |
Symbol | |
ID | 7195376 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 164636 |
End bp | 166262 |
Gene Length | 1627 bp |
Protein Length | 505 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183563 |
Protein GI | 219126646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.273058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAC AACGACGATG GTCAATCTTG ATTTTTGTTC TATGCGGGCT ACAGCGTATT TCTCAAGCGC AGCCTTCTAA AGGTACCAGA AAGCTTCCAC CACTGAGGCA CAGTATATTC GAGACACGAT CTTGGGGCTC CTCTGAAACG GGGAAGGGGG AAGGGAAGTC CAAGAGCAGT AAACGAAGCT CTCAGACATC CCATCCGAAA ACACAAGGTA CGCGCCCTAA AAATTGCGAT TTGCCATGTT CCTTTTTAAA TGGATTTCCA ATTCACTTTC CCCCCTTGAA TTGCAGCATC GTCAAAAACG GGAAAGGGAA AAGATGGCAG AAGCTCGAAA AAGTCGAAGA GTTCGAAGTC GAAGAAATCA AGCAAAACAT CATTGAAAAC GTCGACGCCA ACTGCGGTTG TGCAACCTAC AAATGCTCCG AATACAGATA CTGCAGATCC AACTTCTCGA CCATCCCCGC CAGCTATTTT TCCCACTGCG ATCCCCTCTG TAAGCCGGCA AACAGTTTCT CCAACACAGG TCGCCGAGAC TGAGACGCCC ACTCTACTCC CCACGTTTGT ACCCATGTCT TTAGAGGCAA CTGACCAGCC AACTGTTGCT GAAACAACAG CACCGACCGC CGCTGGACCA ATCGTTGTAA CGGTTATCCC TACTATATCC CAAACTCGGC CGCCAGCTAC AAATGAGGTC GAGCCAACAC TAAGTCCGCA CCTTGATACA AGCTCCCCTA CTGCTCTCCT GGCACCAAAC CGAGAGACAT CAACACCTAC AATACGTTCT ACGACAACAC AAACACCCAC CGGCATAGGA GAGGAAACAG CGACACCGAC CATAAATTTT AGTGCGACAC AAGCACCAAG CACCACAACA GTCGACACAC CATCACCGAC GGTAGATTCT ATTGCAACAC AAGTACCAAC CACCATGACA GTGGACTCCA CTTTGCCAAC CTCAGGACCA TCTACCAGAG CGCCATCAGC AGCCACTGAT ACCGAAGCTT CGCCCTTCGT TATTACGTAC CAAACAAGCG ATAGTAGGGA ACCAACACCG GAGGAGTTCG ATCAAGCTCA AGCGGTCACC CTTCAATATC TTGAAGACTT TTTAGTGGGG GAATACGAGT TCAACTTGAT CACAGCGCTC AACGACGTTT TGGGATTGGC TCTTTCAGAG TCAACAGATC CTTTGGCTGT AGCGTATGCA ACCACACTAC TGTTTTCGTC AGAATCAGGA TTTATTCCAT CTCAGGAAGA TATAGATGTG CAGGTTTTTA CCGCATTTCA AGAACAGGCT GTTTTTGATC TCGTAGCAGC ACTTCAAGGC CTGCCTCCTG AAAATCCCTT CTCATCGACT ACAAGCGTCC AGTTCACTTC TTTTGTGATC GAAGTAGTGC AAGCATCTGA GACATCCTCA GGAAGTACGG TGGCTTTCGG AGCTCTTATT GGGATGATCT TGTTCATTTT TGGTTTGCTC GCGTCTCGTG TTGTGTATCG GAAGCCATCG TACATCTCTG TTGACAACGA CATCCCCCAT AGGGTAATAA TTCCTGGACA AAGCAACATC ATGGCATTCA TGGACTGTGA AAGTGAAGCA TCTTCTCGCA GAACTTCAGC GAATTGA
|
Protein sequence | MTTQRRWSIL IFVLCGLQRI SQAQPSKGTR KLPPLRHSIF ETRSWGSSET GKGEGKSKSS KRSSQTSHPK TQGTRPKNCD LPCSFLNGFP IHFPPLNCSI VKNGKGKRWQ KLEKVEEFEV EEIKQNIIEN VDANCVSPTQ VAETETPTLL PTFVPMSLEA TDQPTVAETT APTAAGPIVV TVIPTISQTR PPATNEVEPT LSPHLDTSSP TALLAPNRET STPTIRSTTT QTPTGIGEET ATPTINFSAT QAPSTTTVDT PSPTVDSIAT QVPTTMTVDS TLPTSGPSTR APSAATDTEA SPFVITYQTS DSREPTPEEF DQAQAVTLQY LEDFLVGEYE FNLITALNDV LGLALSESTD PLAVAYATTL LFSSESGFIP SQEDIDVQVF TAFQEQAVFD LVAALQGLPP ENPFSSTTSV QFTSFVIEVV QASETSSGST VAFGALIGMI LFIFGLLASR VVYRKPSYIS VDNDIPHRVI IPGQSNIMAF MDCESEASSR RTSAN
|
| |