Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49793 |
Symbol | |
ID | 7198460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 320447 |
End bp | 323259 |
Gene Length | 2813 bp |
Protein Length | 874 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | beta-glucosidase |
Protein accession | XP_002184524 |
Protein GI | 219128657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATA CGAACAAGGC GGAATCACAC CAAACACTCC GTACCGTTAC CGACGATGAT TCCGTGACCA TGACTGCCTC GCCGGATGTC CAGTACCACC AAACCAACTC CACGGATCCG GACGTCGAAT GCCCCCCGTC TACTCCAACA TACGTCTCGG TGCATGACTT GACTTTGGAG GAAAAACTCT TGCTGCTTTC CGGTCAAACC TTGTGGTTGT TGCCGGATTT GCCCCGCTTC AATTTACCTT CTCTCACTGT CGCCGACGGT CCCCACGGCG TGCGCAAGCC CGTCAAAGAG CTGTCGCTAC AGGAAGCGCT ACCCGCCACG TGCTTTCCCA CCGCCGCCGC TCTCTCCTGC AGTTGGGACG TGGATTTCTT GCGCCAAGTG GGTATCGCAC TCGGCAACGA ATGCGCGCAC TACCAGGTCG CCGTCCTCCT CGGCCCGGGC ATGAACCTCA AGCGCCACGG CGGCGGCGGG CGCAATTTCG AATACTTTTC CGAAGACCCC CTCGTTTCGG CCAAACTCGC CACGGCCTAT GTACAAGGCG TCCAAGCCAA CGGACGCGTC GGCGCCTGCA TTAAACACTT TGCCGTCAAC AATCAGGAAT CACACCGATT CGTCGTGGAC GCTGTCGTCG ACGAACGCAC CGCACGCGAA CTCTACTACC GCGGTTTCGA AGCCGTGGTC CGCGACGCGC AGCCCGCCAC TATCATGTGC GCCTATAACA AAATTAACGG CGTGTACTGT AGCGAAAACG AATTCTTGAA CACGCAGTTG CTCCGCGACG AGTGGGGCTT CCAGGGCGTG GTGATAACGG ACTGGGGTGC TACCAACGAT CGGCCGGCCG CCATTGCCGC CGGCATGGAT TTGGAAATGC CCGGATCGCA CGGAGCCCAC GGCAGGGAAA TACGGCGCGC CCTTCGGGAA GGGACGGTTT TGCGCATGGA ACACGTCGAC GCCTGCGCCC AACGTATGCT AAATCTCATG TGCCGGTACA AAGAATCGGT TCGGGATACC TACGAATTGT CGAGCTGGCA TGATCAGCAT AAGTTGGCCA AGCAAGTGGC TATGCAGTGC GCCGTCTTGT TGCAGAACCA GGGCAATTTG TTGCCGCTGA AACAAGGAAC GTCGGTAGCC GTTATTGGAG ACTTTGCCAA GGAACATCCC CGCTACCAAG GCATGGGGAG TTCGCAGGTC TGCACTAATT CTGTCGTAAC GGCGTACGAT GAGTTGTTTC GTCATACGAA GGACGTTTTC TTTGCACCCG GATACCATGC CGATGACGAC CATATTGAAG CCGTGAACGA GGAATTATTG GCCGAAGCGG TGAGGGTGGC GCAGCAAGCC GAAGTCGTCT TGTTGTGCCT GGGACTTCCG GAAATTATGG AGTCCGAGGG TTTTGACCGT TTACACTTGA ATATCCCTGC ACAACACAAT GCTCTAGTGG ACGCGGTTAG CAAAGTGAAC AGTAACGTGA TTGTGATGTT GAGCAATGGC GGAGCGATTG AGATCCCGTG GGCTGACAAG GTCAAAGCTA TTTTTGAAGG CTACCTCTTG GGTGAAACCG GTGGAGCCGC CACGGTAGAT TTGATTTTCG GTGTGCAATC GCCCTGCGGC AAACTCGCCG AAACCTTTCC AATTGTCCAA GAGGACATAC TCGCAGACCG GTACTTTCCG GGCAGTCGCG ATCGTGTGGA GTATCGAGAA GGTTTGGATG TCGGCTACCG TTACTTTGAC ACCGCCCAAA AAGACGTTCG TTTTCCCTTT GGGCACGGAT TGACGTACAC GACCTTTGAA TACGGCAATC TTAATGTGCA AGTCAATCGC GACGATGCTA CATCCAAATC TGTGCATGTC TCGTTCGACT TAACCAATAC TGGCGCAGTA GCTGCCAAGG AGGTTGTGCA ATGCTATATT CATCAAGACT CGCCGTCGGT TTATCGACCT GTTCACGAGC TCAAGTATTT TTGCAAAATA CATTTGGAGC CTCAGCAAAG TAAACAGGTA GAATTTGATC TCCTTACCGA TGCGTTTTCG TTTTACGACA TTGGAGTCTC AGATTGGACG GTAGAAGCTG GTGGTTTTGA AATACGCATT GCGTCGAGCA GTCGAGACAT TCGCTTGGAA GCGCCTGTAG TATTTGCGGA AGGGCGTGGG CCGAGTGATC TGGCGAAGGA AACCTATCCT CCCGTTGCTG GAGGTGGCAC ACTCAGTCAA GTGGACGACG AAACATTCGC TAAGCGGTTT GCGAAGAGAA AAGAATTTGT ATTGGCAGAA TGCGTGGCGT CTGCCGAATC TAGTACGGTC TCAAGAGTTG GCGGTTTTCA TCGGAATTCG CTACTTAAGG AAGTAGCAAG TCGCAGACTT ATGGGCAAGC TCTTACTGTC AGTTGTTCTA TCCGCGGCAG CAAAAGAGGT CAAGAAAGGA CCTACCCGAA AAAGGCAAAA ACGTATGGTT CGAGCCAATG TGGAAAATCT CCCGCTTCGA ACGCTGGTGC TATTCAGTAA AGGTGTACTG AGTTTTGAAC TGCTGGATGC CTGTATTGCA GCCATGAACT ACCAGGTATT TCGTGCCATC GGAGGCTTTG GTTTGGCTTT TGCCTGTTTA TTTAAACGAA ACTAAATTGT ACAAACCGTG CTCTTGACAG TGAGGTTTTT CGTCAAGTAA ACTCTTCTGC CTATTATTGG TTCCGCTGTC ACTGTGAAGA CGCTTCCATC AGGCTCTGTC TCCGTTAGAG CAAACGAATT CCAAAAAAAG TATGGTCACT TCCAAAATCC CTGAAATAGC AGATAAGGAC GGATATGTCA TTT
|
Protein sequence | MSNTNKAESH QTLRTVTDDD SVTMTASPDV QYHQTNSTDP DVECPPSTPT YVSVHDLTLE EKLLLLSGQT LWLLPDLPRF NLPSLTVADG PHGVRKPVKE LSLQEALPAT CFPTAAALSC SWDVDFLRQV GIALGNECAH YQVAVLLGPG MNLKRHGGGG RNFEYFSEDP LVSAKLATAY VQGVQANGRV GACIKHFAVN NQESHRFVVD AVVDERTARE LYYRGFEAVV RDAQPATIMC AYNKINGVYC SENEFLNTQL LRDEWGFQGV VITDWGATND RPAAIAAGMD LEMPGSHGAH GREIRRALRE GTVLRMEHVD ACAQRMLNLM CRYKESVRDT YELSSWHDQH KLAKQVAMQC AVLLQNQGNL LPLKQGTSVA VIGDFAKEHP RYQGMGSSQV CTNSVVTAYD ELFRHTKDVF FAPGYHADDD HIEAVNEELL AEAVRVAQQA EVVLLCLGLP EIMESEGFDR LHLNIPAQHN ALVDAVSKVN SNVIVMLSNG GAIEIPWADK VKAIFEGYLL GETGGAATVD LIFGVQSPCG KLAETFPIVQ EDILADRYFP GSRDRVEYRE GLDVGYRYFD TAQKDVRFPF GHGLTYTTFE YGNLNVQVNR DDATSKSVHV SFDLTNTGAV AAKEVVQCYI HQDSPSVYRP VHELKYFCKI HLEPQQSKQV EFDLLTDAFS FYDIGVSDWT VEAGGFEIRI ASSSRDIRLE APVVFAEGRG PSDLAKETYP PVAGGGTLSQ VDDETFAKRF AKRKEFVLAE CVASAESSTV SRVGGFHRNS LLKEVASRRL MGKLLLSVVL SAAAKEVKKG PTRKRQKRMV RANVENLPLR TLVLFSKGVL SFELLDACIA AMNYQVFRAI GGFGLAFACL FKRN
|
| |