Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46834 |
Symbol | |
ID | 7204688 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 583088 |
End bp | 584916 |
Gene Length | 1829 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185910 |
Protein GI | 219121370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGTCTCGGG TGTGTGGCAG GATCGTTTCA TTGAATGTCC CGAAGCCTTC GTCACCGACC CGTTACTCAC GAACACGAGG AAGAGCAAGA AGTCTCCCCG AAACTCCGCC GCTACAGTAC GGGTTCGTCC ATGGCCTACA GTCGCCGTCC GGTCATCGCC TACGCGCGTG GAATAACGGG GGGTACCAAC GCGGCGACCC GACCGCGAGC CAACGGCAAT CTCTCGGTCT ACCGCAACAA AATCGGGCTC GTACTACTAG CCACCGTGGC TTACACGGCT TTCCTGTACC GATCGGGTTC GTATTTGATG AATGCGGGCG GTCTGCCCAG CCAATGCCTT TCTTGATTCC TTTCGATTCC GCGAGCTTTT TCTCACACGC GATTGTTTGG TTGCGAATGA TAGGTGGGGC ACGCAGCTTT CTCGGCAACG CGACGTTGGG TATGGCGGCC ATGTCGAAGG CTGGGCGTCG CCTGCAGGTA TCTACTTGGA ATATTGCGGC TATCAACAAC AATCCCTTCG AGTACTGGAT TACCTACAAC AAGGAATACG AGGATCTCAT GGTCAAGGTG GAGTCATTCT TGGATTCTCC GGGTGACAAA GATGTGGCCG TTCCAAAAGT CTTTTCACAG AGCATGTTTG ATCAGTTGGA TGCACGCATG ACTAGCGTAG GTTGGCAATC CGTCAAGCCA TACTGGGAGA ATGATTTTTC CAAGCGCACG ATCGTCGAAG GTTTCATGAA AGACGGTTTG CTAGGACTCA AACGTCTGGC ATCGATGCCA GATCGAATGA CGAATACCAT AAATACGGAA GATGGCAACA TGGTGTTCCG TCCAACGATT ATCAACATGT ACGAAGACGA TCTAAGTTCT CAGGAAATTT GGTGGGAGAA GTGGAGTTCG TTCATGTTCG ATACTAAATT GAAGATTAAA GGCAAGGGTG ATGTAGTACA AGAACAGATG CCTTACCAAA TGCTTCAAAA GATCAAACGC TCCAAGTACA AAGCCGTTAC TGAGGAAGAA GAGAAAGACT CTCTTCCTTT GCAAACAATG TGTGGAGCCA TTTTTGACGC AATTCTCGTT CACATGATGA ATACTGTTTC GACTCCGGAT ACGTGGCAAC CTTTAAAGCG AACCATGGTT CAAGCGCTTA ACAAGCAAAA GGTCCCAAAT ACACTCAAGA TCATGGAAAA ACACTACATG GACTCCGATA TAATTACTCT GCAAGAAGTC TCCGCTAGTT TAATTGATCA AGCGCGAAAG TCGGCAATCG GGAAGAAGTT CCATATCATC GCGCCCGCAA ATCTGGACGC TGTGCGAGAT CAAAACTCCG TTGTTTGTTT AAAGCGCTCA ACGTTTCCGG ATGGAGCAAT CGAAGAGATC ACAGCAAAAG TGGAAGCGGC ATTTCCTAAG GATGAGGACA TACCGGTATC CAACGGAGAC ATTCTCGCTG TCACGACCAA AAGTATTGAC AACGTGCCGT TTGTGGTTGC CTCATTTCAC GGAGACACAA ACGGTCTCGC TACCAAGCCC GTGGTGGACG CTATCCTGAA AGCAATGGGT GCGGACTCAA AACTCGTCTC ACATCGTCTA GTTTTTGGTC TAGATGCCAA TACTTACGAA AAAGCGATTC CTGATAGTCA GCAAGATGTC TTGGATTTTG GGAAGTCGTA CGTCAAGCAT GGCCTGACTT CGTGCTGGGG TGACGTGCCT GATCCGAAAA ACTATACTAC TTTTAACGCC CGCACCTTTC TACAACCTCA GCTTAACAAA GCTTGTCGGC GAGATGAGAA ACGCAGTAAC GGAGATGTGA ATCCCAAGGA TTTTATTTT
|
Protein sequence | MSRSLRHRPV THEHEEEQEV SPKLRRYSTG SSMAYSRRPV IAYARGITGG TNAATRPRAN GNLSVYRNKI GLVLLATVAY TAFLYRSGGA RSFLGNATLG MAAMSKAGRR LQVSTWNIAA INNNPFEYWI TYNKEYEDLM VKVESFLDSP GDKDVAVPKV FSQSMFDQLD ARMTSVGWQS VKPYWENDFS KRTIVEGFMK DGLLGLKRLA SMPDRMTNTI NTEDGNMVFR PTIINMYEDD LSSQEIWWEK WSSFMFDTKL KIKGKGDVVQ EQMPYQMLQK IKRSKYKAVT EEEEKDSLPL QTMCGAIFDA ILVHMMNTVS TPDTWQPLKR TMVQALNKQK VPNTLKIMEK HYMDSDIITL QEVSASLIDQ ARKSAIGKKF HIIAPANLDA VRDQNSVVCL KRSTFPDGAI EEITAKVEAA FPKDEDIPVS NGDILAVTTK SIDNVPFVVA SFHGDTNGLA TKPVVDAILK AMGADSKLVS HRLVFGLDAN TYEKAIPDSQ QDVLDFGKSY VKHGLTSCWG DVPDPKNYTT FNARTFLQPQ LNKACRRDEK RSNGDVNPKD FI
|
| |