Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48137 |
Symbol | |
ID | 7203289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 304687 |
End bp | 305885 |
Gene Length | 1199 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182509 |
Protein GI | 219124435 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAACAGAAG AAGCAACAAC AAAAACAGCG GAAAGCCCCA CGAGAGTCTG CGTTTCGACA TGTCGTATAC GCAAGCCCCT CCTCCGCACG GCGATGACCT CCTTCGCTCA ACAAGTACCC TCGCGCTCAA CAACGTCGGA GTCGCACTCC TAGAGCGATC CTGCTTCCGT CAAGCTATGG GCACTTTCCG GGACGCCCTG GCGAACCTCC AAGACATTAA CAGAAGTTTT GCTAAGTTGC GCCTCCAACA CGCGACGCAC CGCTTGGCCT CTCCCCAAAT CGATCTGCCC CGCGAAAGTG CCAACCCACA TTCCGACAAA CTTCATGCAT TCCGTATCGA AACGCTTTCG GACACGGATT CCGTACCAAG CCTGTTGGCC CGCATGACGC AAACATTGGT ACCGACGACT GCGCCGCTAG AGCCTTCGCC TTCGTCGTGC GATATCCAGA AAGCATATTC GACCAACTTA GTTCCCCGCA CCTTGATCCT ATGCCCAATC CGCATGGAAC AGATAAATGT AGGAGAAGGT ACTAAGCTTG CCGTCCTGCT CTACAACTTT GCGGTGGCGC ATCTGTGCTT GGCAGAGACG ATCACGACGA CCATAGTCAG TGATGACGAC GATGAGGAGG AAGGCGACGA AGATACCAAC TACCACGAAC AGCAAGATAT GGACTGTCTC ACTTCAGAAC AAGATTTTCT GTACGATAAC GCCTTGCAGA TCTTAAATCA CATCAAGGAC ATGCTATATT CGTGGATTGC CAACGAAAGG CGCGCCGATT TTAACATGCC ACACGCGTCG GGGGCAACAT CCGTGCAACA TGTCACGTCG GGAGCCGGCG AAAGCCGATG GTACGGAAAT TGTCTCCCGT ACCTCTACGC GGCTCCACCG GAAGTTGCCT ACTTGCTCGC CGTTGGCTGC GACCCTGGTC GCTTGTTGCG GCAGTTGTTG CAAATATTGG TATGGGACTA CACTGGTCAC GTGTTCGCGG CCAACGAATT GCACGGTGAC GCCGATGCCG CGTTTGAACA GACCTGGTAT TTGAGGGAAA CGGCGGCTCA GTTGCAAATG TCTTTTTATC AAAACAGCAG CCTCGCGATG ACGATGACTC GAAGGGGATG GCGTTTCCCT AATCGCGAAA TCAACACTTT TCAGGATCCA ATCAGCGTCA TCGCAACCGG GACCGCTGAT GCAGCATAG
|
Protein sequence | MSYTQAPPPH GDDLLRSTST LALNNVGVAL LERSCFRQAM GTFRDALANL QDINRSFAKL RLQHATHRLA SPQIDLPRES ANPHSDKLHA FRIETLSDTD SVPSLLARMT QTLVPTTAPL EPSPSSCDIQ KAYSTNLVPR TLILCPIRME QINVGEGTKL AVLLYNFAVA HLCLAETITT TIVSDDDDEE EGDEDTNYHE QQDMDCLTSE QDFLYDNALQ ILNHIKDMLY SWIANERRAD FNMPHASGAT SVQHVTSGAG ESRWYGNCLP YLYAAPPEVA YLLAVGCDPG RLLRQLLQIL VWDYTGHVFA ANELHGDADA AFEQTWYLRE TAAQLQMSFY QNSSLAMTMT RRGWRFPNRE INTFQDPISV IATGTADAA
|
| |