Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35668 |
Symbol | |
ID | 7201098 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 481169 |
End bp | 483249 |
Gene Length | 2081 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | beta-xylosidase |
Protein accession | XP_002180246 |
Protein GI | 219118959 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.12421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCTT GCTTCAGGAT ATTGACTATA TGCACGTCGT ATTTCTTCTT ACTGCTTGCG CTCTGGGCAT CGTACTCCAT TTTTGAAGGC TCTGTTCGGC TTCAAGGTCC AATCGGGCTC GACCGCAACG TCAGCCCCGT CAACAACTAT ATTTGGAAAG ACAAAGTCCC GAATTTTTGG GGATGCCAGA ATGACGTAGC CAAAGCACTG CCTTATTGCG ACATGTCGCT CTCAATTGAC GAGAGACTGG AGGATTTGCT CTCACACTTG ACCCTGGATG AGAAAGTTGA CATGATCGGC GCCGACCCTA CGCAAGACGT TTGTATGACT CATACTATGA ATGTGTCTCG CATAGGCTTA CCAGACTACT ACTGGCTCGT GGAAACAAAT ACGGCCGTTG GATCGGCATG TATTGCCGAA AACAAATGTG CGACTGAGTT TTCGGGCCCG TTATCGATCG CCGCTTCTTT CAATCGATCA TCCTGGTTTC TCAAAGGTAG CGTTTTTGGC ACCGAGCAAA GGGCGCTGAT GAATGTCCAT GGCGAACGAT TTCATACCCA TAGCGGCCGA CATATTGGTC TGACAGCGTT CGGCCCAAAT ATCAATCAAC AACGCGATCC GAGGTTCGGG CGCTCATCGG AGTTGCCGGG GGAAGACCCG TTTCTGTCGG GGCAGTACGC CGCGCACATG GTACAGGGTA TGCAAGAGCG AGATGCCAAC GGATATCCTA AAGTTTTGGC GTATCTGAAG CATTTTACGG CGTACAGCCG AGAGGAAGGG CGCGGGAACG ACGACTACAA TATTTCGATG TACGATCTGT TTGATACATA TTTGCCCCAG TACGAAATGG GCATGGTCCA AGGCGGAGCC ACCGGAGTTA TGTGCTCGTA CAATGCTGTC AATGGTATTC CCGCGTGTGC CAATGACTAT TTACTCAATA AAATTTTGCG GCAACGCTGG AATCGTTCCG ATGCGCACGT GACGACCGAC TGTGGGGCGG TGAACAATCT GCGTGGCAAA CCAATCCAGG CGGCCGATGA AGCGCAAGCT GCCGCAATGG CACTCATGAA TGGCGCGGAT ATTGAGATGG GATCAACCTT ATTTGTACAC AATCTCACTA CTGCTATAAC ACTGGGATAT GCGACCGAAG AAGCAGTCAA TCAAGCTATT CGTCGTTCAT ATCGTCCTCA TTTTATTGCG GGTCGCTTCG ATGATCCTAC CTTGAGCGAA TGGTTCAGTC TAGGGCTAGA CGACATTCAG TCAAAAAAGC ACCAGGAGAT CCAATTGGAA GCAGCACTTC AAGGACTAGT TTTGCTGAAA CATGAGGACA GCATTTTGCC TATTGCTGCG GGCACTAAAT TGGCGGTTCT AGGTCCATTA GGAATGACGC GGTCCGGCCT GATGAGCGAC TACGAAAGCG ACCAAAGCTG TTTTGGTGGC GGGCATGATT GCATACCAAC GTTGGCCGAG TCAATCGGAT TCATAAATGG AAAGGAGTTC ACCGTTGCAG CTGCTGGTGT CGATGTGGAC TCTCGCAATA CATCGGATGT TGAGAGAATC TTGCAGCTTG CCGCTGACAG GGATCTTATA GTGCTTTGTC TCGGGAACAC AAAAACTCAG GAGCAAGAAG GATTTGATCG CAAGGACACA GCTTTGCCGG GTCAACAATA CGCCTTGTTT GAGGCCGTAC TTACTCTTCG CAAACCTGTA GTTCTTGTTT TGGTAAATGG TGGCCAGATC GCGCTTGACG GAATGACCGG ATACCCTTCG GCTATCATTG AAGCCTTCAA TCCCAACGGT ATTGGCGGGA CTGCCTTAGC TGCGTCTCTA TTTGGTCAAG AGAATCGCTG GGGGAAACTT CCGTACACAA TATATCCGTA CAGTGTGATG CAGTCGTTCG ACATGAAAGA CCATAGCATG TCAGCCCCGC CGGGCAGGAC GTATCGATAT TTCACGGGAA AAGCAACGTA TCCATTCGGA TACGGTCTTT CACTCACAGC ATTCGAGACA TCATGCTTTC ATCAAAGGAT CAGCGATTCC TCGATTCTAC TGGAATGTAC TGTCTGGAAC ACTGGAAATA G
|
Protein sequence | MPSCFRILTI CTSYFFLLLA LWASYSIFEG SVRLQGPIGL DRNVSPVNNY IWKDKVPNFW GCQNDVAKAL PYCDMSLSID ERLEDLLSHL TLDEKVDMIG ADPTQDVCMT HTMNVSRIGL PDYYWLVETN TAVGSACIAE NKCATEFSGP LSIAASFNRS SWFLKGSVFG TEQRALMNVH GERFHTHSGR HIGLTAFGPN INQQRDPRFG RSSELPGEDP FLSGQYAAHM VQGMQERDAN GYPKVLAYLK HFTAYSREEG RGNDDYNISM YDLFDTYLPQ YEMGMVQGGA TGVMCSYNAV NGIPACANDY LLNKILRQRW NRSDAHVTTD CGAVNNLRGK PIQAADEAQA AAMALMNGAD IEMGSTLFVH NLTTAITLGY ATEEAVNQAI RRSYRPHFIA GRFDDPTLSE WFSLGLDDIQ SKKHQEIQLE AALQGLVLLK HEDSILPIAA GTKLAVLGPL GMTRSGLMSD YESDQSCFGG GHDCIPTLAE SIGFINGKEF TVAAAGVDVD SRNTSDVERI LQLAADRDLI VLCLGNTKTQ EQEGFDRKDT ALPGQQYALF EAVLTLRKPV VLVLVNGGQI ALDGMTGYPS AIIEAFNPNG IGGTALAASL FGQENRWGKL PYTIYPYSVM QSFDMKDHSM SAPPGRTYRY FTGKATIRDI MLSSKDQRFL DSTGMYCLEH WK
|
| |