Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16803 |
Symbol | |
ID | 7199092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 63324 |
End bp | 64610 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185115 |
Protein GI | 219129900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCGC TGGGTCCGGC CTTTGTAAAA CTCTGCCAAT GGGTGGCAAC TCGGCGCGAT ATTTTTCCTC CGCACGTATG TGATCGTCTG TCTAGTTTAC ACGATCGTGG CATTCCACAT TCGTGGAAAT ACACCGATCA AATATTGAGA GAATCATTCG GGAGTGGTTA TGAAACTCGT GGCTTAAAGG TAGACAGCCA AAGCGTAATT GGTTGTGGAT CAGCTGCGCA AGTGTATAGC GGCACGTTGT CAACTACGCT GAAAGATGGA ACGGAGGAAA GAACGCCCGT TGCCATCAAG GTTTTGCACC CTCGCTTTGC TCAGTTGGTG GAACGAGACT TATGGTTTAT GCAATCAATT GCAGATCTAG TGCACAGTTT GCCCTTTCAA CATATCAAAA TGGTGAATCT ACCGCGAGCT ACGCAAAACT TTGGCGCAGT ATTACAGCGA CAAGCCGACC TGCGGATAGA AGGGAATAAT TTGAAAACAT TTCGGAACAA CTTTTACAGA AATCGAGAAG ATGAATACAA CTCTGCCATA CTTTTCCCCA AGCCAGTGGA CGAGTGGACG ACAAGGACAA TCTTGGTTGA GGATTTAGTC CGAGATGCGA CTCCGATTTC GGACTACTTG CGGGATTCGT CGGATTCAGG GAAAGAAATA CGGAAGGAGT TGGCTGGACC TTTGCTACGT GGCTTTCTCA AGATGGTCTT TCTTGACAAT TTTGTACATT GTGACTTGCA TCCTGGCAAT GTGCTTATTC AAACGTCGCA GGTTAAGCCA GAACCACCGA CATTTCTTGG ATTTCCTCTT AATCCCTTCT CGGACACGGG AGAGAAATTT GAAATAAAGC GAAGTATTGT CTTTCTGGAT GCTGGTATCG CCACGTCTCT AAGTTCAAAT GATCAGCGTA ACCTGAAAGA CCTCTTCCGT GCCGTCATTA CCAATGACGG CGAGCGAGCA GGACGTCTTA TGGTGGAACG AGCCAAATTT GAGCGTTGTA GCCTAGTGGA AGGCGGAGTC GACGCGTTTG CAGGTGGGAT TCAGGAACTG GTTTCGGAAT TTCACGATAG GCGAAAAGAA GGGTTGACAC TTGGAGCTGT GCGTATTGGA TCACTCCTTA GCCGCGTGCT GGATCTCTGT CGCGTGCATG GAGTGGAAAT TGATCCGGCC ATGGCAAGCA TTGTTATTAG CACACTGGTT TTGGAAGGTT TGGGACGGTC ATTGGAGCCC AGCCTGAATT TGATTGACTT TGCTCTTCCA TTCGTACTAG GTCGAGGACG TGTATAG
|
Protein sequence | MQSLGPAFVK LCQWVATRRD IFPPHVCDRL SSLHDRGIPH SWKYTDQILR ESFGSGYETR GLKVDSQSVI GCGSAAQVYS GTLSTTLKDG TEERTPVAIK VLHPRFAQLV ERDLWFMQSI ADLVHSLPFQ HIKMVNLPRA TQNFGAVLQR QADLRIEGNN LKTFRNNFYR NREDEYNSAI LFPKPVDEWT TRTILVEDLV RDATPISDYL RDSSDSGKEI RKELAGPLLR GFLKMVFLDN FVHCDLHPGN VLIQTSQVKP EPPTFLGFPL NPFSDTGEKF EIKRSIVFLD AGIATSLSSN DQRNLKDLFR AVITNDGERA GRLMVERAKF ERCSLVEGGV DAFAGGIQEL VSEFHDRRKE GLTLGAVRIG SLLSRVLDLC RVHGVEIDPA MASIVISTLV LEGLGRSLEP SLNLIDFALP FVLGRGRV
|
| |