Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50271 |
Symbol | |
ID | 7199035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 155374 |
End bp | 156653 |
Gene Length | 1280 bp |
Protein Length | 421 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185219 |
Protein GI | 219130117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0291461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTACG GTATTCGTTT CCGGAACAAA AGAAAAATTT GGGTTCAACC CTGGGCATCA AACCTACGCT ATCGAACGAT GCCATCCGTA AGACACCGAA CCGATGAAGC GATCGACCCC AGAAAGACTT CGCCATTGCT GATTCTTGCT CTTGACATCG GTGTTGGCAG TGCTCGGTTA TTGCTTTTTT TCAGCGTTGA AAGATTCTTT TTCGCAGCTA TGGTTGGGTT GCGGAAACTT GACGCGTTCG TCAAGACGCG GCCCGAATTG CGATCACAAA GTGCCGTTGG TGGAATGATC ACACTGGTGG CTGCGACGGT GTCAGCCTTT TTGTTTGTGG GTCAGATTAT TCATTACATT ATTGGAAATC CGAAAGACTC TCTTCTGCTT TCCAAATCCG TATCAATTCC GCTCATTCCT CTCACCAGCA ACTACCTGAC AACAAAGATT CTGGAACGAG CAGCCAAACT CCCTTTGGAT ATGCTGATAA CTTTCCCCTA TTTACATTGC AGTCAGCTGG ATTTCAATCA CGATGGAGCA TCGCTGGCAA CAAGCGAATT CCAAAAGCTA CATCCCAAAC ACTCTCTCAC GATGCGAACA CCATTCCAGC ACGAATTATC AACAGCAAAG TTTGAAACCA AAAAGGGACA GGGTTGTACC ATCGAGGGAC ACATCCGTGT ACCTGTGGTC GCAGGAAAGT TCGAGATTAC TCTCAACAAG CGCACGTGGC AGCAAGCTGC CAGTATTCTG AATCGCCAAA TGTTGATGCA AGTTCTGGGT GCCACATCCG AGCACACTTC ATCCAATGAC GAGCTCGGTG ACCGCTACAA CTCCACACAC TTTATCCACT ATATTCGTTT CGGAGATTCC TTTCCACTCA ATATAGAGAA GCCCTTGGAG AAACGACGTC ACATCTTCCG TAACAAGTAT GGCGCAATGG CGGTGCAAGA GATGAAGATC GAGCTCGTAC CCACCTACAC GTCCACATGG TTGCCGACGT CCAGTCGACA AACCTACCAA GCGTCCGTTG TAGATAGTAC GATAGAACCG GAGCACATGG CGCAAGCCGG TGCCTCTTCG TTGCCTGGCC TTGCTGTCCA GTATGACTTC TCGCCGTTGA CAGTTTATCA TACCGGTGGT CGTGACAACA TATTGGTGTT TTTGAGTTCA CTGGTGAGCA TTGTGGGTGG TGTCTTTGTT ACCGTCGGCC TCGTGAGTGG CTGTTTGGTA CATTCGGCTC AGGCTGTAGC GAAGAAGATA GACTAACGTT AACATATTTC
|
Protein sequence | MGYGIRFRNK RKIWVQPWAS NLRYRTMPSV RHRTDEAIDP RKTSPLLILA LDIGVGSARL LLFFSVERFF FAAMVGLRKL DAFVKTRPEL RSQSAVGGMI TLVAATVSAF LFVGQIIHYI IGNPKDSLLL SKSVSIPLIP LTSNYLTTKI LERAAKLPLD MLITFPYLHC SQLDFNHDGA SLATSEFQKL HPKHSLTMRT PFQHELSTAK FETKKGQGCT IEGHIRVPVV AGKFEITLNK RTWQQAASIL NRQMLMQVLG ATSEHTSSND ELGDRYNSTH FIHYIRFGDS FPLNIEKPLE KRRHIFRNKY GAMAVQEMKI ELVPTYTSTW LPTSSRQTYQ ASVVDSTIEP EHMAQAGASS LPGLAVQYDF SPLTVYHTGG RDNILVFLSS LVSIVGGVFV TVGLVSGCLV HSAQAVAKKI D
|
| |