Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23924 |
Symbol | GPI_1 |
ID | 7199044 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 194949 |
End bp | 197014 |
Gene Length | 2066 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | isomerase glucose-6-phosphate isomerase |
Protein accession | XP_002185147 |
Protein GI | 219129966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAGCCTTG CAAGCTCTTT TTACCCGACT GACTGAAAGA TACCCACAAG GAGAATCTTG TTCAATCCTC TCCTTGCAAG TCCTTTACAC TTCTCCAAGC CTATCCTACC ATGTCGGACC TGACGTCTTC TCCCGCTTGG AAAGCCCTTG AGGCCCATTA CGGTACCATG AAGAACGTTC ACATGAAGGA ACTATTCGCC AACGATTCCG AACGCTTTCA CAAGTATTCC ATGAAGTTCG AAGACATTCT ACTCGACTAC TCCAAAAATC GTGTTGCAGA TTCGACTATG GATCTGCTGT ACGCTTTGGC GGAGCAGCAA GATGTCAAGG GTAAGGCTCA GGCCATGTTC AGTGGTGAAA AGATTAACTC CACGGAAGAC CGGACTGTCC TGCACATTGC TTTACGCAAC CAGTCCAACA AGCCGATCTA CGTCGATGGT GAAGACGTCA TGCCCGCTGT CAACGAAACC CTCGAAAGGA TTAAAGCTTT TACGAATCAG GTTCGTGACG GGACCTGGAA AGGGCATACG GGAAAAGTGA TTACGGCGAT TGTCAATATC GGTATCGGTG GATCCGATCT TGGTCCCGTC ATGGCCTGTG AAGCTCTCAA GCCGTACGCA GCCGACCATT TGAAGATGCA CTTTTGTTCT AACGTAGACG GTACGCACAT TGCCGAGATT CTTAAGCTTT GTGATCCCGA AACAACCCTA TTCTTGATCG CCAGTAAAAC CTTCACCACG CAAGAAACCA TGACCAACGC CAACACAGCG AAAACGTGGT TGGTGGGCAA CTTTGATGGG GACGAGACTT CTGTCGCCAA ACATTTTGCG GCGTTGAGTA CTAATGGTGA TGCTGTGTCT GCCTTCGGTA TCGACGTCAC TAATATGTTT GGATTCTGGA ATTGGGTTGG CGGACGATAC AGTTTGTGGA GTGCGATCGG GACACCGATC GCCCTCGCTA TTGGTTTCGA CAATTGGATG GAAATGCACG CGGGTGCACA CGCTGTCGAT CAACACTTTT TGCAGTACGA TGGCAAGGAC AATCTGCCTT TGACTTTGGC GTTGATTGGT CTATGGTACA ACAAGTAAGC TCTGCGAATG ATCTGATGAA TATTTTGTTT TGGTACTGAC TGTGAGACTG ACTTCTTTTA TATGAACAGC TTCTTTGGCG CTGAAACTGT GGCAATTCTC CCCTACGATC AATACATGCA CCGATTTGCA GCTTACTTTC AGCAAGGCGA CATGGAATCG AACGGGAAGT ACGTTGATAT GAACGGCAAA AGGATGCAAA CCCAAACCGG TCCCATCATA TGTAAGTGTA TATGATCTTA TGAGTGTGCG ATCAGTTCAG AGCTAGCCTA ACCCTCATTT TTGTCATCAT ACAGTCGGTG AACCTGGCAC CAACGGACAA CACGCTTTCT ATCAGTTGAT CCATCAAGGC ACCAAAATTA TTCCTTGCGA TTTCGTTGCA CCTTGTCAAA GTCAAAACAA TTTGCCCGAG AAGCTTGGTG CGCCTGTTGA CCACCATCCT ATTTTACTTT CCAACTACTT TGCTCAAACG GAGGCTCTAG CATTTGGAAA GGATGAAAGC CAAGTTAAGG CGGAATTGGA TAAGACGGAC ATGTCGGAGG CCGAAAAGAC GGCGCTTGTC CCGCACAAGG TTTTCGAAGG CAACCGCCCT TCGAATTCTT TCCTGTTCAA GAAGTTGACG CCGCGTACGT TGGGTAGTCT GATCGCAATT TACGAAATGA AAATATTCTG TCAAGGCGCT ATTTGGAATA TTAACTCGTT TGATCAGTGG GGTGTTGAGC TCGGTAAGCA GCTTGCCAAG GCTATTTTAC CGGAACTCGA CGGCGACTCC CCTGTTTCGT CCCACGACAG TTCCACCAAT GGTCTCATCA ACTATTACAA GACAAACAAA TGAACTGTAG AGCATTCGAG CTGGGACGTC TGGCGTGTTA AGATGCAACA AAATAAACGA TGTGAAGTAG GAAGCGATTC CAATGACGCA TGAAAGTTGT AGCTCTGCGT TATATCCATT TAATTTAGCA AATCCAATCG TAAATA
|
Protein sequence | MSDLTSSPAW KALEAHYGTM KNVHMKELFA NDSERFHKYS MKFEDILLDY SKNRVADSTM DLLYALAEQQ DVKGKAQAMF SGEKINSTED RTVLHIALRN QSNKPIYVDG EDVMPAVNET LERIKAFTNQ VRDGTWKGHT GKVITAIVNI GIGGSDLGPV MACEALKPYA ADHLKMHFCS NVDGTHIAEI LKLCDPETTL FLIASKTFTT QETMTNANTA KTWLVGNFDG DETSVAKHFA ALSTNGDAVS AFGIDVTNMF GFWNWVGGRY SLWSAIGTPI ALAIGFDNWM EMHAGAHAVD QHFLQYDGKD NLPLTLALIG LWYNNFFGAE TVAILPYDQY MHRFAAYFQQ GDMESNGKYV DMNGKRMQTQ TGPIIFGEPG TNGQHAFYQL IHQGTKIIPC DFVAPCQSQN NLPEKLGAPV DHHPILLSNY FAQTEALAFG KDESQVKAEL DKTDMSEAEK TALVPHKVFE GNRPSNSFLF KKLTPRTLGS LIAIYEMKIF CQGAIWNINS FDQWGVELGK QLAKAILPEL DGDSPVSSHD SSTNGLINYY KTNK
|
| |