Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49099 |
Symbol | |
ID | 7195452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 584270 |
End bp | 585867 |
Gene Length | 1598 bp |
Protein Length | 499 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183773 |
Protein GI | 219127083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0142086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACAAGAGTA GAAGCATTTT TGACTGCTTA CGAGTGCGTT TATTGGAGTC TTTACTCATA GACAGCAACT AACTGTAAAT CTTCGTTCAT TCTTTGCAAT GGAGGGCAAA TTGCCAAGGG AGCGGCTGTC AGAACACAAA CGCTCCGTTC TAACCAACGA TGCCGTCATC AAGCAAGGAA AGCAATCGAG TCTGCAAGAA GTTATCGAGT GGCGAGGCCG TGCTAGCAAT GTGGCCAGCG CCACGCATAG CAACGATTCT TCCAGTCATC ACGATACGTC AAACAAGTCC AAAAGTCGTC CGTCTGAGGT TGATGACGAA AAAGCTTCCG GCAGCGCCGG TATCGTCAAA ACCGGTGCTA TAGCAAGTTC GGATGAAATG GCTTTCCAGA AGGCGCAAGA ATCATCCAAA AAGATGGCTG CTAGACTTCC AACTGACCAG AAACGGCCTT CAATGTCCGA CCTGTCGTCC GCATCGTCAA TACCCGTAAG CACGGAACAG ACTGACGTTA AAAAAAGGAA AAAGAGGCGT TCCCGGGGAT GGACCTTCAA TGATGTCGAA AGCGCCACAG AACGCTCAAA CAAAAATATC AAGATTACAA AAGCAGGGCC ATTTCGGGAT CATAAGGACA ATGCTGATAA ACGCGGTAAT TTCGAAGATG TTGACGGTAA ACTGATATCT TATGATCAAG CCAGCACAGG TTCCTTAGCT GTGGAAGGAA GGCTTGCCAA GAACTCTGGA CTGAGGAGAG AAGATGCCAA GGCCGCTGAG AAACGAGAGT ACAATCGAGT GAATGCTGCC CGAGCTCGGC TCCGCAATAA AGAAATGGTC GAGGAATTGC AAAAGAACGT TATTGATCTG AATGCCCATA TTGTCGAATT GGAACGATCA AACGAAATCC TCCGGGCTCA AGTCGAGGTT TTGGGCAGCA GGAGTCAGAG TCTCCTCACG ACGAGCCAAG TACCGACAGC TGCGGCCCCT GAACAAGAGT TAGACCATGT GCAGTCTTCC ACAGCTGCAA TTGTTGCACC TAGTTTTAAC ACTCCAATTT TTTCTGTCCA GCAAAGTGGT ACAGGTACAG GGCAACCTTC CCCTCATCAA AATGTGGTTG CAGTAGAGCA ACTGTTGGCT TCGATCCTAG GAAGAAGCCT TCCCCAGTTA GAACAACCTC CATCACCGCC AGCGCTTGAC AATCTATCTT TGTTGCTAAC GCTCGTGCAA GGGAGTAATG GAGCCAGTAT AAATTCCCAG CTGCAAAGCG GTCCTCCACA ATTCAGTGGT GCCGCGATAG CTCCGCCCAC AGCACAACCA CTGCCTTCGA CGCTCTTCGC GATGAATCCT TCCTTGCACC AGCAGCAACA GACTATTCAT TCTCGGGCAA GACTCTTAAA TATGCAGCAG CCGTTTGATC CGTATGCTCA TTTGTCGAGC GCTAATTTAC AATCTGTTCT GCAAAACCTT CCCGCCGGGA CCCTTTATGC TGCTTTACAG CAACAAAGGC AATTACAGCA AGGCGACGGA CCTGGTTATC CAATCGATGA CAGCCTCCGC AACAAGAACG ATAACAAATC CTCATCATTA GGGAAAGATG GACGATGA
|
Protein sequence | MEGKLPRERL SEHKRSVLTN DAVIKQGKQS SLQEVIEWRG RASNVASATH SNDSSSHHDT SNKSKSRPSE VDDEKASGSA GIVKTGAIAS SDEMAFQKAQ ESSKKMAARL PTDQKRPSMS DLSSASSIPV STEQTDVKKR KKRRSRGWTF NDVESATERS NKNIKITKAG PFRDHKDNAD KRGNFEDVDG KLISYDQAST GSLAVEGRLA KNSGLRREDA KAAEKREYNR VNAARARLRN KEMVEELQKN VIDLNAHIVE LERSNEILRA QVEVLGSRSQ SLLTTSQVPT AAAPEQELDH VQSSTAAIVA PSFNTPIFSV QQSGTGTGQP SPHQNVVAVE QLLASILGRS LPQLEQPPSP PALDNLSLLL TLVQGSNGAS INSQLQSGPP QFSGAAIAPP TAQPLPSTLF AMNPSLHQQQ QTIHSRARLL NMQQPFDPYA HLSSANLQSV LQNLPAGTLY AALQQQRQLQ QGDGPGYPID DSLRNKNDNK SSSLGKDGR
|
| |