Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_52108 |
Symbol | |
ID | 7204628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 280077 |
End bp | 283174 |
Gene Length | 3098 bp |
Protein Length | 974 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | glycosyl hydrolase/mannosidase |
Protein accession | XP_002185676 |
Protein GI | 219120889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTACGTTC ACATTGTTCC ACATACGCAC GACGATGTGG GATGGCGTAA AACCGTTGAG CAATATTACT ACGGCCTCAA CAATTCCATC GATACTAGAG GTAGCGTACA CTCCATCATC ACGACCGCCG TGGAATCACT TTTGGACGAT CCAGCGCGTA CCTTTACGTA CGTGGAAATG AAATTTCTCT CCATATGGTG GAAGTCTCAA AGTGACCAAA TGAAAGACAA CGTTCGCTAC CTAATTGCCA ACAAGCAGCT ATCGATTGTC AACGGAGGTT GGTGTATGCA TGACGAAGCG GCGCCCCACT ATATCGGGAT GATTGATCAG ACAAGCTTGG GACATGAATT CCTTACTCGT GAGTTAGGAG TGATACCTAA GGTTGGCTGG CAACTCGATC CTTTCGGTCA CTCCGCGACG CAGGCATCCT TGATGTCGCG TGGTATGGGA TTCGATGCAC TATACTTCGG TAGGATTGAT TATCAGGATT TACGGCTGAG GCAGTTAACA CGTCAGTGCG AGGGTTTGTG GAACGCATCA AGAGAGTCGT CAGACATCGC ATTACCAATC TTTTGGGGGC TGACGGGAAG CTATGGAGGA AACTATGGTG CCCCGCGAGG TTATTGTTTC GACGTATTGT GTCAAGATGA ACCTCTGGTC GGTGCCAACG AAACGAGACT ACGGGAACGG CTTCGTACGT TTTTGGAAGA TGTGCGTATA CAGAGTGATC GTACCCAGGG TGATCACGTT ATGGTGACAA TGGGTGAGGA CTTTAACGTA AGTGGTGGGG GTGAAAAGAT CGATGCAAGG ATGCTGCCAA ATTGTCAGTA GGATTCCTGA CCTGCATTTT TATCTCCAAA TTCCAGTACA TACAAGCGCA TTTAAATTTT GCAAATATGG ATCTATTGAT AAACTCGATT ATGAGCTACC AGCATTGGAA GATTCTGGAT ATACCGTCTA TCTTTGGTCC CCAGTACGAT CGAGTCGACA TCTTCTATTC AACACCGGAC TACTACACAG AGATGAAGTA CAAAGAGACG GTCCGTGTTC GCAATAGACC GCGGACTATG CAACGCGGGA CATCCAATTC AGCTGTAGGA CAACAAGGAC AACACGTGTC GGAGAGTCCA GTCTGGGTAG TCAAAACGGA CGACTTCTTC CCGTATTCTG ATTGTGAGCA TTGTTTCTGG ACAGGTTATT TTACCTCTCG TGCATCTTTC AAACGGTTTG AACGTGTTTC GTCTTCTTTT CTTCTGGCTG CCCGACAAAT TGAGGCATTG TGGAGTGGAC AGAGTAACAG CACGGGTCAA GGTATGGAAA GTCGTCCCCT GTTTGCACTC GAGGATGCTT TGGGTATTGC GCAGCATCAT GATGCCGTGT CTGGTACGGC AAAGCAACAC GTAGCGGATG ATTACAGTTT CAAATTACAA ACTGGTTTGG ATCTTGCTTC CAAGTTTGTG GCAAAGACAT TAAAGAATAC GCTGATATCG GACTCAGGGC TGCTGGAAAA CTTGACTTTC TGTCACCAGT TGAATGAATC TATTTGCGAT CTTTCACAGG ATGCCACGAA GTCTCTTGGC AAGGATCTTT ATGTTGTTGT TTACAATGCA AAAGCCTCGG AGGTATCCAG CATTATCCGA CTACCGGTCT CTACTAACCA AACGTACCTT GTTGAGCGTG TGGAGCGTAA CGCAACGGTC GCGAGGTCGA GACTTGTGGA GACTGTACAA GCCGTCAACC TCCGGACTAC TGACAAACCG AGGTATACCG TTATGTTTGA CACTGGTCCC CTTCCTCCCA TTGGTGTTGC TTTATTCCGC GTATCTATGA CAAATAAAGT CTTCTCTAGT TCTTTGAGAT CGAATGATTT GACGGAAACT CGACGTCTCT TTCGAGCTGC CGACGGCAAA GATGTTGTAG TATCGAATGA ACTTCTCTCA GTAACCTTTG ACAGCTCTAC TGGAATGATG AAACAGGTAT TCTCTCAGAA TGTGAGCCTT CTGCTTACGC AGGAATGGGG GTACTACACG TCTTTCGACT CGGACTATGA TAGAACGGAA GTGCCCTCTT CACGTGCCGA TCAAAATTCA GGAGCCTATG TGTTTAGACC AAGCACTCCT GAAGAAAAGC TTAAGTTGAT GAATGCGTCG CCTAGCGGTG CAAAGTTCGT CGATACGTCG GTGGGAACTG AAGTACACGT AGCATTTGAT GCGCCCTGGA TTAGGCAAGT GACTCGCATA CTGAAAGGTC AGCCATACGT GGAAATCGAG TATACCATCG GTCCAATTCC GATCGATGAT GGGCGCGGAA GGGAGATTGT GAATCGATAC ATAACTCCAA TTAAATCCGA AGGCAAGGTG TACACAGACT CGAACGGTCG CGAGTTTCTG GAGCGACGCC GCAATTACCG TCCAAGTTGG TCTTTGGAGG TTTACGAACA AGTGGCCGGG AACTACTACC CGATCAACGC AGCCGCTTAT ATTGAAGACA GCGACGCGGC TTTGTCAGTG GTCGTGGATC GGTCCCAAGG TGGTGGCTCT ATTATAGATG GAACGTTGGA GTTCATGGTG CAGCGTCGGA CTGTGGCGGA CGACTTTCGG GGAGTAGACG AACCGTTGAA CGAGACGTGT GGCGGTATGG AGCCGTATCC TCCGTACGGG GACGCAAAAC GCGTGGGTGA CGGCGTGGTG ATTCGTGGCG TCCATCGGCT ATTGGTGGGA GCAAAAGGGG CTGGGTTGGC TAGATCCCAG ATGGACGCCA CGTTTGCGGA GCCATTAATC TTTGTGGCAT CGTCGCCCAA ACCGTCTACA CCAGTAGGCG CACTAGTCGC TACGCAATCA TCTCTTTCCG CTCTGCAATT ATCGTTGCCA TCAAATGTAA TGTTGATAAC GTTGATGCGT CTTCAGGATC GGGAGCGACC GACTTGGCTC CTGCGATTGG GTCATCAGTA CGCTGCGGGT GAACACGAGG TTCTTTCTCA GCCGGCCAAG GTAAATCTCG CCACGTTTCT AGTGGATTGG GATGTGTCCA AGATCGAGGA AAAGACCCTG TCGGGCAATC GGGATTGGGA CGTTTACACG AAAGAACGCT ACGACTGG
|
Protein sequence | LYVHIVPHTH DDVGWRKTVE QYYYGLNNSI DTRGSVHSII TTAVESLLDD PARTFTYVEM KFLSIWWKSQ SDQMKDNVRY LIANKQLSIV NGGWCMHDEA APHYIGMIDQ TSLGHEFLTR ELGVIPKVGW QLDPFGHSAT QASLMSRGMG FDALYFGRID YQDLRLRQLT RQCEGLWNAS RESSDIALPI FWGLTGSYGG NYGAPRGYCF DVLCQDEPLV GANETRLRER LRTFLEDVRI QSDRTQGDHV MVTMGEDFNY IQAHLNFANM DLLINSIMSY QHWKILDIPS IFGPQYDRVD IFYSTPDYYT EMKYKETVLW VVKTDDFFPY SDCEHCFWTG YFTSRASFKR FERVSSSFLL AARQIEALWS GQSNSTGQGM ESRPLFALED ALGIAQHHDA VSGTAKQHVA DDYSFKLQTG LDLASKFVAK TLKNTLISDS GLLENLTFCH QLNESICDLS QDATKSLGKD LYVVVYNAKA SEVSSIIRLP VSTNQTYLVE RVERNATVAR SRLVETVQAV NLRTTDKPRY TVMFDTGPLP PIGVALFRVS MTNKVFSSSL RSNDLTETRR LFRAADGKDV VVSNELLSVT FDSSTGMMKQ VFSQNVSLLL TQEWGYYTSF DSDYDRTEVP SSRADQNSGA YVFRPSTPEE KLKLMNASPS GAKFVDTSVG TEVHVAFDAP WIRQVTRILK GQPYVEIEYT IGPIPIDDGR GREIVNRYIT PIKSEGKVYT DSNGREFLER RRNYRPSWSL EVYEQVAGNY YPINAAAYIE DSDAALSVVV DRSQGGGSII DGTLEFMVQR RTVADDFRGV DEPLNETCGG MEPYPPYGDA KRVGDGVVIR GVHRLLVGAK GAGLARSQMD ATFAEPLIFV ASSPKPSTPV GALVATQSSL SALQLSLPSN VMLITLMRLQ DRERPTWLLR LGHQYAAGEH EVLSQPAKVN LATFLVDWDV SKIEEKTLSG NRDWDVYTKE RYDW
|
| |