Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41531 |
Symbol | |
ID | 7199384 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 95315 |
End bp | 97000 |
Gene Length | 1686 bp |
Protein Length | 514 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | nad-dependent epimerase/dehydratase |
Protein accession | XP_002185484 |
Protein GI | 219130674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGCT TGCAGGCTGC GCATCATCAT CCTCCACCAG GGCAGCACAA TCCCCGCAAT GCCGAAATAA GTCGTCGGTT GGGAACCCCG AACGGTAGCA GTACCAACAA TACCAACACC AACAACGGCA GCCACGCCGC CATGTTTTGC AGTCGTCAGA GTATCTTGAC GCTGACGGTG GGAGTGTTGG TGGGTTACAT TCTCTTACCC GTGTTGTTGG TGGAGATGGA GTTGCAGGAC CTGATGGGAG CACTGCCGGA CTGGGAAACG ACCGGCAGTG GGCCGTACGC CCGCCCGTCG CCTCCGGAGA TGCTGCACCA GGGTCTCTAC AGTACCGCAC GCATTCCGGC ATCCTTGTCC GAGACGCCGC GTTTGCGCAG TGTCGAGCAG GAACCGCCGC GAACGCTCGA AGCTGCTCCG CGGGACTTTG CCGGTGTCGT GGCGGAAGGC GTCACCGATA TCGAAAAACG AATCGTGCAA GATCACGATC TGCTCGGGAG ACAGTCCTTG CCCACCGCAA CGACGCCCTA CATTATGCCC ACCAAGGTCT TGCCCGATCA TCAACGCAAG AAGATTCTCG TCACGGGAGG AGCCGGATTC GTCGGCAGTC ATCTCGTGGA TAAACTCATG ATGGATGGGA TGGAAGTCAT TGTCGTGGAT AACTTCTTTA CCGGACAAAA GAAAAACGTG GCGCACTGGT TGCATCATCC CAATTTCAGG TAAGCACCAT TACAAGAAGG GAGGAAGTAT GGAGCTAGCA TCGAACCTCC AAGGACACGC ATACAAAGCT TTCCATGGCA CACACAAACA CACACTCACA ACACACCGTT TGTGCATGCT GTTATTCATA CACCGAACAG TCTCGTGGTG CACGATGTCA CCGAACCAAT CCAACTCGAA GTAGACGAAA TCTACCACTT GGCCTGTCCG GCGTCACCTC CGCATTACCA GTACAATCCG GTCAAAACCA TCAAAACGTC CACCATGGGG ACCCTCAACA TGCTCGGACT CGCCAAACGC GTCCGCGCCA AGATTCTACT CACCAGCACC TCCGAAATAT ACGGCGATCC CAAGGTACAC CCACAGCCCG AATCCTACTG GGGAAACGTT AACACCATCG GACCCCGCTC CTGCTACGAC GAAGGCAAAC GCGTGGCCGA GACCATGATG TACAGTTACA AGAACCAAAA CGGCGTCGAC GTCCGCGTCG CACGGATATT CAACACCTTT GGTCCGCGCA TGCACCCCAA TGACGGACGC GTCGTTTCCA ACTTTATCAT ACAAGCCCTG CAGAACAAGA ACATGACTAT TTACGGCGAA GGCAAACAAA CACGATCCTT CCAGTACGTT ACCGATCTCG TCGACGGTCT CTACGCGCTC ATGAACGGCA ATTACGATCT TCCCGTCAAT CTCGGCAATC CGGAAGAATA TTCCGTCAAG GACTTTGCCA CCTACATTCA AGAACTCACC AAGAGTACGT CGGACATTAT CTTCTTACCC AAATCCGAGG ACGACCCCTC CCAACGTCGA CCGGATATCA CCACGGCCAA GCGAGAACTG GGCTGGGAAC CCCAGGTCAA GGTACAAAAA GGCTTGGAAA AGACCATTGA ATACTTTGCC CGTGTTTTGG AAAGTGCGGG GGAAATCATT CCGACCGGAC CCGGGGCCGC CAAGCCCGAA GCCTAA
|
Protein sequence | MVRLQAAHHH PPPGQHNPRN AEISRRLGTP NGSSTNNTNT NNGSHAAMFC SRQSILTLTV GVLVGYILLP VLLVEMELQD LMGALPDWET TGSGPYARPS PPEMLHQGLY STARIPASLS ETPRLRSVEQ EPPRTLEAAP RDFAGVVAEG VTDIEKRIVQ DHDLLGRQSL PTATTPYIMP TKVLPDHQRK KILVTGGAGF VGSHLVDKLM MDGMEVIVVD NFFTGQKKNV AHWLHHPNFS LVVHDVTEPI QLEVDEIYHL ACPASPPHYQ YNPVKTIKTS TMGTLNMLGL AKRVRAKILL TSTSEIYGDP KVHPQPESYW GNVNTIGPRS CYDEGKRVAE TMMYSYKNQN GVDVRVARIF NTFGPRMHPN DGRVVSNFII QALQNKNMTI YGEGKQTRSF QYVTDLVDGL YALMNGNYDL PVNLGNPEEY SVKDFATYIQ ELTKSTSDII FLPKSEDDPS QRRPDITTAK RELGWEPQVK VQKGLEKTIE YFARVLESAG EIIPTGPGAA KPEA
|
| |