Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_1389 |
Symbol | |
ID | 7204211 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 688166 |
End bp | 689239 |
Gene Length | 1074 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186109 |
Protein GI | 219113051 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000337906 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCGATCCCG ACACGGTTAA GCAATTATTT TTACCAGGAA TTGGGCTTAA AGGAAGCATT CCTCGTGAGA TTGGTCTATT GACGGATCTC GAAAAAATTG ATCTTTATTC GAACGATCTT AATGCAGCAA TTCCTGATGA TATGCAATAT CTTGAGAAAC TCGGTAGTCT TGTGCTCTAC GACAACTTAC TTTCCGGCGA GATTCCGGAA TGGATTGGCT CTCTATCCGA TTTGGCTACG ATCAATTTAG CCAAGAACAA ATTTGAAGGT GGCGTACCGT CGAGTCTGTT TAACTTGACT AAGTTGATCA CTCTTAACCT CGAGAACAAT GACTTCCGCT TTCAACTTGA TGACCTACAG CTTCAATCTA ATCTTCAGGC TCTATTTCTT GGCGGCAACA ATATTTATGG AAAGTTGACC GAGGAAAGGG TAGACATCTG GGAGGACATC GAAGTATTGG ATCTAAGCAA TAACCAGCTA ACGGGTCCTC TTCCTTCGAA TTTGTTTAAT CAAGAGGCAC TCATTATTTT GGACCTACAC GGTAACAAGT TCACAGGTCA CATCCCGGAG TTTGAAGATT CTCCTTTTTT GACTTTTTTG GCGCTTCAGG ATAACGACCT CACTGGATCC ATTCCGGCCA GCATAGGATT CGACCATTTA TTACTACGTC ATATTGATCT GTCTCAAAAT ATGTTGAACT CCACAATTCC AGACACAATC GGCTCGCTTA ACGCTCTGGA GTATTTGTTT ATTGCTTTGA ACCCCTTTCT GGAACCTGCA CCGGTGCCTT CCTTCTTGGG CGAGCTGACC AATCTTGTTG ACGTGTCGTT TCAGGACAGT AACCGCGAAG GTCAGCTTCC GACAGAATTG GGTCTGTTAA CAAACCTGGT ACTTCTCGAC TTGGCTGCCA ACGAGATCGA GGGTGACATA CCGGTTGAAC TTGGCAATGC CGATAAACTA CGTTTTCTCT TTCTGAACGA GAATCATCTT TCCGGCAATG TTCCTGGGTC TTTCGAGCAA CTGACCGACT TGGTTGTTAC CATGATTGAC TCCAATGACT TGACGGGAAA CGTC
|
Protein sequence | CDPDTVKQLF LPGIGLKGSI PREIGLLTDL EKIDLYSNDL NAAIPDDMQY LEKLGSLVLY DNLLSGEIPE WIGSLSDLAT INLAKNKFEG GVPSSLFNLT KLITLNLENN DFRFQLDDLQ LQSNLQALFL GGNNIYGKLT EERVDIWEDI EVLDLSNNQL TGPLPSNLFN QEALIILDLH GNKFTGHIPD IGFDHLLLRH IDLSQNMLNS TIPDTIGSLN ALEYLFIALN PFLEPAPVPS FLGELTNLVD VSFQDSNREG QLPTELGLLT NLVLLDLAAN EIEGDIPVEL GNADKLRFLF LNENHLSGNV PGSFEQLTDL VVTMIDSNDL TGNV
|
| |