Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55197 |
Symbol | |
ID | 7199248 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 46605 |
End bp | 50076 |
Gene Length | 3472 bp |
Protein Length | 911 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | oligosaccharyl transferase |
Protein accession | XP_002185367 |
Protein GI | 219130427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGACTCTC TCCTGCTCTC TCTCTCTTGG AGAAAATTGC ACTCGTGTGT ACATTGGTGT GTGTCGAATT GCCATGGTAG ATTTAGACAA GAAGACGAAG AAGAACAACG GCAGCGTGAA GACGGTAACC AACGGCCAGA CCAATGCGCC GTCATCGTCG TCGAACATGA ACGATGCGAG TCGTCCCCCC GTTACGGAAG CTACCGTAGA TAGCACGGTG CGCAGTGTTT CGCGTTTGAT CCAAGCCTTA ATCTTCGCGG CTGTCCTTCA AGTACTCTAC CGAGCGCTCG TGGAAGCCTA CACGATTCGA TTGCACGCCA TTAACGAATA CGGACGCGTC ATTCACGAGT TTGATCCTTA CTTGTAAGTC CTCCGTATCG ACAAGCCTGT TATAGTGGCA GTTATCCTGA CGCGTACCCC GCAACTCACT GTACCCTTCC CTTTTCTGCT TGTAATATCT ACTGTACTGT CGGTGCTGGT ATCACCGTCT ACCTTTGCTA ACGTCGTTCG ATCGTTGGCT GTCGGTTTTA ATTATCTTCT CGATCCGTAC CGGACAGCAA CTACCGCGCT ACGGAGTACC TGTACGAGCA CGGATGGCAA GCCTTCTGCA CCTGGTTCGA CTACCTTGTC TGGTATCCTT TGGGTCGTCC CGTTGGTACC ACAATCTACC CCGGTATGCA GGTCACCGCC GTCGCTTTGA AGAACTATGT ACTCACCGAT ATGTCCATCA ACGACATTTG CTGCTTCATC CCCGCCTGGT TCGGGGTCAC AGCGACAATG GCGGTCTTTG CGTTGTCTTT CGAGTGCACC CGATCCACCG CACGCTACGA GTCCATCTTT AGTACCATCC CTTTGGTCAA GGATATCTAC CAGCTTTTGT TCAAACCGGT GCTGGAGTTT GTCTGGAACG GCATTTTGGT CAAGATTACC GGGGGTGACT GGGGATTGGG TCGCGTGCAC CGCAAGCCAA CGGCACCGGC AGGCTCCTTT CGGACTCCCT GGATGTGTGC CACGGCGGCG GCTTGGATTA TGAGCATTGT ACCGGCACAT CTCTTGCGCT CCATTGGTGG CGGTTACGAC AACGAAAGCA TCGCCATGAC GGCCATGGTT CTGACGTTTG CCGTCTGGAC GCGGGCCTTA CGAGATACTC CGGAAACGAA CGGATCCGGC TGGTTACCCG GGACCATGCT GTGGGGTGCT TTGACCGGGG TAGCCTATTT TTACATGGTG GCAGCGTGGG GTGGCTACGT TTTTGTCCTG AACGTGATCG GCGTGCACGC GTCGGTGTTG GTGCTGCTCG GACGCTATTC AACCAAACTG CATCGGGCCT ACTCCTGCTT TTACGTGGTC GGTACCGCGC TGGCGATTCA GGTACCCGTT GTGGGATGGA CGCCCCTCAA GAGCCTGGAG CAAATGGGAC CACTCGCGGC CTTTTTGGGA ATGCAACTCT TGGAAGTCTG TGCCGTCATT CAGCGCAAGA ACCCGGGCAT GTCCCAGAAA AAAATCTGGA AGCTGCGGAT GCAGGTCTTT GGTGGCGCTG GATTGATTGG CGCGGCCCTT GTTTTTCTGC TCTTGCCAAA CGGCTACTTT GGTCCGTTCA GCTCCCGGGT ACGTGGACTC TTTGTGAAGC ACACCAAGAC TGGCAATCCG CTGGTCGATT CCGTTGCGGA ACATCAAGCC GCCTCTCCGG AAGCGTATTT TCAATATCTA CACAAGGTGT GCTACCTAGC TCCCGTTGGA CTGCTCATGG TTTGTTTGTT TTCCTTCAAC GACTCGAGCT CGTTTTTGCT CGTGTACGGT GTGGCGGCGT ACTTCTTTTC TCACAAGATG GTCCGGCTTA TCTTGCTGAC GGCGCCGATT GCATCCGTTA CGGCTGGCAT TGCTTTGGGT AAATTTGTCT CGTGGGCCGT GGATGCCATT GCGCTGGATC AATCTCTGGA TTTGTGGGCC TTTTTGCTGG GCAACTCGAC TACCGAAGCA TCCACAAACG GAACTGCCGT CGCGTCCAAG GCCAAGCCAA AAAAAGGCAA GCGGAATACT AACGGAGATG GAGAAGACGA TGTTGTAGCA GCGGGCCAAC CCCTCTATAT GCGCATCGTT CGTTTGGCTA TCTTTGGCGC AGTCGCTTTC CAGTCCATCC CCGAGTTCAA AAGCTTTTGG CAAATTTCCC ACGAAGTTGC GGTACAAGTC TCGCACCCGA CAATTATACA AAAAGCACAG CGTAGGGATA CTGGAGAAAC AATCTTTATC GACGACTACC GTCAGGCATA TTTGTGGCTC AAGAACAATA CTCCGGAAGA CGCCCGCATT ATGGCTTGGT GGGATTACGG GTACCAGATA ACCGGCATCG GGAATCGCAC CACTATTGCC GACGGAAACA CTTGGAACCA CGAGCACATT GCCCTCTTGG GACGGACGTT AACGGCCCCA GAAAAGGAAG GTCACCGCAT TGCTCGCCAT CTCGCTGATT ATGTTCTCGT TTGGGCCGGC GGCGGTGGCG ACGACGTGGC GAAGTCTCCA CACTTGCGAC GCATTGCCAA TTCGGTGTAC CGTGGCTTGT GCCGAGAGCC GACTTGCCGA GACTTTGGCT TTCAAGTAAG TGCCGTGATT TGTTGTCCGT TTACCATCTT TGGTTTGGGC TAATAATTGT ATTACTTTTC GTGTAGAGTA AGGGAAACCC GACCGCTGAT ATGCGAAAGA GTCTCTTGTA CAAGCTGCAC AGTCACAATT TGGTCCCTGA TGTGCAGGCC GACCGAAATC GATTCCGTGA AGTTTTCACC AGTACATACG GAAAGGTTCG AATTTACAAA ATCTTGAGTG TTTCCAAGGA GAGCAAAGAT TGGGTTGCGA ATCCGGAAAA TCGAGTTTGT GATGCCCCAG GTAGCTGGAT GTGCCGGGGG CAGTACCCAC CGGCTCTCAA GAAAGTGTTG AGTGAAAAGA AAGATTTCAA GCAGTTGGAA GACTTCAATG CGAAGAGCGA CCATGACGAT TCGGACTATC AACGTGAGTA CATGGAAAAT GTCATGCACA AGAAGCAAGT CCCGCGTCCA AGAGACTCTA AGAGTGGGCA GGAACCAAAA GAACCAAAGG AGCAGCCCTC AGCGAGGCAC GAAGTAGTTA AACTTTCCGA TGAAAAGATT GAGGCCCTGA ACCAAAATTG GGAGAACAAC GAGCGCACCA CACTTATGTG GGAATTGATA TCAGAAAATC GCTTGAATGA TATGGTGCAA CTATTGTACG AATCCCCTGA ATTGGTGCAC CTCCGCAGTG CCGATGGGCG TGGACCTATG TTCTGGGCTC ACGAGTATGG CAGATCGCAA TTCCTTCAAG TCTTTCGACA GCTAGGCGTT CGAGAAGACC GAAAGGATGC TGTTGGAAAA ACGGCTCTCG ACATGGCGTA GGTCCAACCA GTTATATGAC TACACATGGT TAAGAGCTTT ACTAAAATTA GCCACCGATT TTGCATTCGT CA
|
Protein sequence | MVDLDKKTKK NNGSVKTVTN GQTNAPSSSS NMNDASRPPV TEATVDSTVR SVSRLIQALI FAAVLQVLYR ALVEAYTIRL HAINEYGRVI HEFDPYFNYR ATEYLYEHGW QAFCTWFDYL VWYPLGRPVG TTIYPGMQVT AVALKNYVLT DMSINDICCF IPAWFGVTAT MAVFALSFEC TRSTARYDIV PAHLLRSIGG GYDNESIAMT AMVLTFAVWT RALRDTPETN GSGWLPGTML WGALTGVAYF YMVAAWGGYV FVLNVIGVHA SVLVLLGRYS TKLHRAYSCF YVVGTALAIQ VPVVGWTPLK SLEQMGPLAA FLGMQLLEVC AVIQRKNPGM SQKKIWKLRM QVFGGAGLIG AALVFLLLPN GYFGPFSSRV RGLFVKHTKT GNPLVDSVAE HQAASPEAYF QYLHKVCYLA PVGLLMVCLF SFNDSSSFLL VYGVAAYFFS HKMVRLILLT APIASVTAGI ALGKFVSWAR NTNGDGEDDV VAAGQPLYMR IVRLAIFGAV AFQSIPEFKS FWQISHEVAV QVSHPTIIQK AQRRDTGETI FIDDYRQAYL WLKNNTPEDA RIMAWWDYGY QITGIGNRTT IADGNTWNHE HIALLGRTLT APEKEGHRIA RHLADYVLVW AGGGGDDVAK SPHLRRIANS VYRGLCREPT CRDFGFQSKG NPTADMRKSL LYKLHSHNLV PDVQADRNRF REVFTSTYGK VRIYKILSVS KESKDWVANP ENRVCDAPGS WMCRGQYPPA LKKVLSEKKD FKQLEDFNAK SDHDDSDYQR EYMENVMHKK QVPRPRDSKS GQEPKEPKEQ PSARHEVVKL SDEKIEALNQ NWENNERTTL MWELISENRL NDMVQLLYES PELVHLRSAD GRGPMFWAHE YGRSQFLQVF RQLGVREDRK DAVGKTALDM A
|
| |