Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50473 |
Symbol | |
ID | 7199317 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 156950 |
End bp | 158413 |
Gene Length | 1464 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185390 |
Protein GI | 219130475 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTACCAATC TCGTACACAC GCTAGTATGG CCAATTCTTC GGTCGGAGCG CCGTCGTCTC GTGACGCATG CAGCCAAAGA CTTCCGCACC CGTCCAGCTT TCTTTCCACT ACGACGGGGC GAAAATCGAC AAAAAGCGGT GTTGCCAAAG CCCAGGAAGC CATCGTTCTC TCCCAAGCGA CTTCCCAGGC AATCGTTGCC GCACGGTCGA TTCTGGAATC GGGCGGATCG GAAGCAACAG CTCTGTCGAC GGCCAAGGCA GCAGCACAAT CAATTCTAAT CCCCTTTTTG TCCGATACCG AACATGGAGC CAGCAAATTG TTCCTGGGAC GACGAAAAGC CAAACGACAA GCAGACATTA TTGCCTCCAT GGCTATCTTA TCCGTTACGA ATGGATCCAA CGAATCGGAG GGTTCCGCAG TCAGAGACAA TGAAGCTCCC ACGTTTATGC AACACCGAAT CGGACCGAGT GGACTACCCG GCTACTCTTG CTCCAGGAAA AAACATCCCA GTCATTGCGA GTCCTCCGTG CGACGAGGAG ACAACAATTC CACTTTGAGG ATTAAGCCCA AACACTCCTA CCCACCAGAT CGTGAGAGAC CGTATTGCAA AGAGAAGGCA CGTCACGGGG CCGGATTTTT GAATTCTAGG CGTGGACACG TGAAAGACGC CGGTCGATCC CACCCGAACG ACCTCGCGGC CCGCTATCCC GACCAGGATC CAATCTACTG CTCTCCGTCC CACAAAAGTA CACTTGCAAA GCAACGCAAC AAGTTTCCCT CCTATAGCTC GGAAACAGAC GACGAAACTA GAAGCCGAAT ACTGGGAATA AGGGGTTCAG GGAGTGCTGA TTACATAGTG TACGGCTCTG GAACGATTGA TACAACAGAT CAGCCCGTTC ATAGTTACAG CAATCCCTTT TCTTTTCTGA CTAGTCTACT GTGTGGCGTA TCGGAAAGCA GCCTCAACAT TGATGACCAT CTGGTGGACG CGTATACCGA TCAAGGAAGA TCGCGATCAG GATGACACTT TCCGGCACGA AAGCGTCAAT GAGAGGGTAG AGTCCTCCAG TTCCGCCAAC GCTAATGATG TTCTGCGCAA GCTCGTCAGT TCTTCTTCGG ATGAGAAGGG CCGCAAAGAG ACATCAATCA AACCGCGAAT TCGAGAGTCA ATTGAGCAGA CTGTACTGCG AGCACTATCA GGCGGTGTTG TGCCTCCTCA GTGGGGCGCA GAGGCCGTAG GTATAGTAGC CGATGCTCCT TCCGATCGGA TATTCCAATC ACGAGCAATC CGGGAGGAAG ATCTGTTTAC GGGTGTGCAA CCGAGCGAGC CCCCGTCACC TTTCACAGCA ACATACAGCA GAAGCAAAAT GCTGAGTCAC AAACCACGCT TCCGCAAGTG GGTCACAATG CGCAAAAAAA GCAAAGACAA TCACGAGAAT GCAATAGTTG CATTCTCTGA TTAA
|
Protein sequence | MANSSVGAPS SRDACSQRLP HPSSFLSTTT GRKSTKSGVA KAQEAIVLSQ ATSQAIVAAR SILESGGSEA TALSTAKAAA QSILIPFLSD TEHGASKLFL GRRKAKRQAD IIASMAILSV TNGSNESEGS AVRDNEAPTF MQHRIGPSGL PGYSCSRKKH PSHCESSVRR GDNNSTLRIK PKHSYPPDRE RPYCKEKARH GAGFLNSRRG HVKDAGRSHP NDLAARYPDQ DPIYCSPSHK STLAKQRNKF PSYSSETDDE TRSRILGIRG SGSADYIVYG SGTIDTTDQP VHTSTLMTIW WTRIPIKEDR DQDDTFRHES VNERVESSSS ANANDVLRKL VSSSSDEKGR KETSIKPRIR ESIEQTVLRA LSGGVVPPQW GAEAVGIVAD APSDRIFQSR AIREEDLFTG VQPSEPPSPF TATYSRSKML SHKPRFRKWV TMRKKSKDNH ENAIVAFSD
|
| |