Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_3369 |
Symbol | GALK1 |
ID | 7203303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 379926 |
End bp | 381116 |
Gene Length | 1191 bp |
Protein Length | 397 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | galactokinase |
Protein accession | XP_002182673 |
Protein GI | 219124778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.949069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCCCGGCGT ACGTGGTGGC GGCACCAGGA CGGGTCAATT TAATTGGCGA GCACGTCGAT TACACGGGTG GATTTGTCTT ACCCTTGGCC ATCGACTTTT CCACCGTGAT TTACGGAACC GGCTTTTGCC ATACCGGCAA GGGCAATGGA CCGACGTCGA TGCGTCTGCG ACTCATTTCC GAAAAGGCCC TTAACGGAAG TATTGTTGAA GAACGCCGGC TGAACGGCAC AAGTCGTCCA CCAGATGCGG ATGAACCGCG ATCATGGGTC AACTACGTAG TCGGGGTAGT ACAACAGTAC ATGCACGATC TTCCCAAAGA AGGGTGCACT ATAGACCTGT CTATGGCCAT TGCTAGCGAT GTTCCTTTGA GCTCCGGTCT GTCCAGTTCG GCATCGCTCG AAGTTGCAGT CGCGACATTC TTCGAATGCT TCCTGCCGGA AAATATGGCC TATTCCTCCG CCAAAGAAAA GGATATTCGA AAGGAACGCG CGTTACGTTG TCAGAAAGCC GAAAACGATT GGGCACATTC GCCATGCGGT ATCATGGATC AACTAGCCTC GTCTGCCGCC CAAGTCGGCA AACTTATGCT CATTGACTGC CAATCACTGG AAATTGAACA CGTCACTATG AAAGCAAATA CACCAGAAGA TCCCGTTATT CTTATCACCA ACTCAAAGGT CACGCACAGC ATTGCCGATT CCGAGTATGG GATCCGACGA GCCCAGTGTC ACGATGCGCT CCTAGCAATG CAGTCCATTC CGCTCTACCA CGTACTTTCT CTGCGAGACG CCACTAAGGA TGACGTGAAA GAGGCCGAAG CTAAGATGAA TAAGATTTCT TACCATCGAG CTCTCCACGT CGTTAACGAA AATGTTCGCA CCAAAGAGTG CAAGGTTGCT CTGAAAATGG GACTGTGGGA TCATGTTGGA GAACTTATGA ATGCGTCCCA CGCAAGCTTG CGAGACGAGT ACGAAGTAAG CTGTGAAGAA GTTGACTATC TCGTCGAAGT AGCTCAGGCG TACGAAGGTG TGTACGGTTC CCGCATGACC GGCGGCGGTT TTGGCGGTTG TACGGTCACT TTCGTTCAAC GCCGAGTTGT CGAAGGACTC ATAAAGCATT TACAATCATC TTACGAAGCC AAGTATGGGA AACAAGCCGA GTGCTTTTTG ACAGAACCGG CGGAAGGAGC C
|
Protein sequence | PPAYVVAAPG RVNLIGEHVD YTGGFVLPLA IDFSTVIYGT GFCHTGKGNG PTSMRLRLIS EKALNGSIVE ERRLNGTSRP PDADEPRSWV NYVVGVVQQY MHDLPKEGCT IDLSMAIASD VPLSSGLSSS ASLEVAVATF FECFLPENMA YSSAKEKDIR KERALRCQKA ENDWAHSPCG IMDQLASSAA QVGKLMLIDC QSLEIEHVTM KANTPEDPVI LITNSKVTHS IADSEYGIRR AQCHDALLAM QSIPLYHVLS LRDATKDDVK EAEAKMNKIS YHRALHVVNE NVRTKECKVA LKMGLWDHVG ELMNASHASL RDEYEVSCEE VDYLVEVAQA YEGVYGSRMT GGGFGGCTVT FVQRRVVEGL IKHLQSSYEA KYGKQAECFL TEPAEGA
|
| |