Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47095 |
Symbol | |
ID | 7202011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 386582 |
End bp | 388144 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181369 |
Protein GI | 219122054 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAC GCAGAAAAAT GGAAACGGTG GCATCCACGA CCATAGACGA AGCGACGTCC TCACTGGATC TTCGCATTCT AGAGCTGATA CAAGCCACTC AGGATCGCAC CATACGACCT GCCAAAGTCT CCACAGAATT GGGTATTTCT ATCAACGAGG CCACCGCGGA GCTTTGCGGC CTATTGGCTG CTGTTGGCGG TGGAATCGAT GGCGCCGCTT TTCGATTTGA ACAAGCCGAT GGAAATCCGG TGATGGTCTT TACATTTCCC GAGGACTTTC GTGCCCGAGC TCTACGGAAG CGGCGTCGAC AAGACTTTCA CGAAACAATG CAAACTTTTC TTGACATCGT TGGAAAGATA CTGAAAACGG TGACCGCTTT TGGTTTGATA CTCTCACTTC TGATTGTTTC TCTTGCTGCC ATGATGGGGC TTGTCGCAGC GGTAATCGGT TTGTCTCGAA CTGGCAATCA AGGGCATCGA AATATAGTTG TACGCCAACT CCGTTCTATG TTTTATACCA CGCGGCAGTT GCTATGGTGT TATGCTGTCT TCGGTCCTGC AGGCGATGAT GGACAAGATC TGTTCCTAAG GGAGGTTGCC TATGATACAT CTCTCGCGTG TTCCGTCTGT TATGGGAATC CGGCCAGCTT TTTTTACTGG ATCCGTGCTC AACAGCTGGC AAGACGAAGT CGAGTTCGTG GGTGGACCGC GTTTGCTCGT TCACAGGGAA CACTCACCAA TAACGAAGGA GCCGCGCTTC TTCGACCACG TTTGAGACCT AACAACCTTC CAACCGATAC TCCAGAAATC TCACAACGGA CGCTACTACC CGTCGCTGTT GAATTTTTGT TTGGGCCACC TTCGTATCTA AACGAGGAGA CCGAGAAATG GAAGCTACGA GCACATGCTT TGATCCAAAA GTCAATGATA AGTAGCAGTA GAGGCGTCTC TTTGGAAGAA ATGAGTCCTT ACGTAGATCA TCCACCAGCA ACGCTGAGTG AATCGTCGAA GATCGTAGAA CAGGGTTTAA TATTGGTTGC TTACTTCAAT GGAGTTCCAA TGAAAAATTG CACGGACCAA CCGGCCAAGG CCCTGTTTAA TTTCCCAGAA TTGCTATCCG AAAGCAGCTC GATAACTAAA TTTGATTCTT CACCGGTGTA TGACGATGAT GGAAGTTGGA GCTCTGTTTT GTACGCCAAG GAAGCAGGCA GCGGACGCTT ACGAAGCTCA TTTGATGTTG CAGAGAGCAT AGAAGAACCT CCTTTACGAT TCACTCAACT TCCAAAGAAA GACTTCGTGA GGTGCATCGG TCTAGGGCTC CTAAACTTGA TTGGAGTGTT ATGGTTAGGA CAGTCAATTG GAGTGGGAGG TGCGCTGGAG TTGAAGAGCG GTATTTTGCT GGAACGGTTT TTGCGTAGAT GGGTGGTCCC TATTCTTCAG TTCTATGGAT TTCTGTTCTT GGGCCTACCT GCCGGTCGGC TTTGCATCGT TATTCTGCGA AACAAGCATC GATATGTGCG CAACAGGAAG CGCCGCTCTC TAGTTAAAGA GCTTACAGCG TAA
|
Protein sequence | MTQRRKMETV ASTTIDEATS SLDLRILELI QATQDRTIRP AKVSTELGIS INEATAELCG LLAAVGGGID GAAFRFEQAD GNPVMVFTFP EDFRARALRK RRRQDFHETM QTFLDIVGKI LKTVTAFGLI LSLLIVSLAA MMGLVAAVIG LSRTGNQGHR NIVVRQLRSM FYTTRQLLWC YAVFGPAGDD GQDLFLREVA YDTSLACSVC YGNPASFFYW IRAQQLARRS RVRGWTAFAR SQGTLTNNEG AALLRPRLRP NNLPTDTPEI SQRTLLPVAV EFLFGPPSYL NEETEKWKLR AHALIQKSMI SSSRGVSLEE MSPYVDHPPA TLSESSKIVE QGLILVAYFN GVPMKNCTDQ PAKALFNFPE LLSESSSITK FDSSPVYDDD GSWSSVLYAK EAGSGRLRSS FDVAESIEEP PLRFTQLPKK DFVRCIGLGL LNLIGVLWLG QSIGVGGALE LKSGILLERF LRRWVVPILQ FYGFLFLGLP AGRLCIVILR NKHRYVRNRK RRSLVKELTA
|
| |