Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45393 |
Symbol | |
ID | 7200489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 22264 |
End bp | 24215 |
Gene Length | 1952 bp |
Protein Length | 425 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179774 |
Protein GI | 219117980 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000938939 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTAGTCTCA CATTGCCAGA AACGTAGACG ACGATCAATC AATACTTGAA GGCATCTGCT TTCGTCAGCA CTCGAGGTAC AGCAATATTC CCTCGCTAGA GTGTATTGAC AGAGCGAAAA ATGAACAACA ACCGGGGATC CGGCGCGGAC AACGAGCGTG ATATGAACGA CGTCGCCCGT GCGCTAAGCG AAGGATTGCT ACAGATAGCA GCGGAAGTTT CGAGTCAACG GGGTGACAGC GGGAGTCAGT CTGGGCAAAT CTTTCGTATG GAAGGTTTGG CGACGCCGAG TAGTCAGGGA GTTGGCGTAA CCACAGGCAG TTTGCCGCTT CGAGACCAAG ACGCTGTTAG TACACTTGTA CAAAGACTAG CACAGGGAAT GCAGACCTCT GCAGCCCAAG GCGGACCACC CCAGCCAAGT GGACTGTCTC AGCCAAATAC GGGTGTGGCC CTCTCTACTC CATCTTCTAC TCTTGATGTG AATCAGCTGG TGGCAATGAT CCAACTCGTC GCGGCACTCA GCGCAAACAG CAGTTTGATT GCAGAGAACA CCAACCAGTC CCTGGAGTTG CTAAGGCAGC TACAAGTAGC TTCTCACCAC GCCCAGCTAG CGTCGGTTCA TCAGCAAAAT ATCCAGCTCC TAAATCATCT TCGGACAGTG CAGAATGGGC TGGTGTCCGG CTCAAGTGAA ATAGTTCCAA ACCCAGTAAG GCCGACGAAC TCACAAGCCA TTACAACCGC AGACTTTATG AAGGAACTTC GGCAGAATAT TGTTCCGGTG TCATTCCAAG CACAGCACTT TCAGCCAGCC AATATCGGAA CACAGGCAAT CCCTGGCAAC GTTGGCAGTG GACCTTCGAA TTCCACTGCA CCGCAAAACA ATAGAATTCC GGTCGAAAAC ATAACCACAC GCTGCCCAGA ACTTGCTACC TTTCCTCAAA ACATTCATCA AGGAGACGCG AGTATTTCTG GATCTCCTGA AAGTCGTCAC GCTTTGCTGA GCTCGCCCAT ACACCGGCAG ATACCAGTGC AACTACGGAT GGAAGGCTAC GAACCATTTC CGAAAAAGTT GTATCGCATT CTCGAAGACG CAGAGCGTCA CGGACAGGAA ACAATCATTT CCTGGATGCC TTCTGGAAAG TCACTAAGGA TCCATGATCC TGACACTTTT ATGAAGGAAA TTGCCCCATT GCATTTTCGG CATTCGAAAT ACACGTCCTT TATGCGGCAG TTGCAAATGT ACGACTTCGT ACGCGCAGCC GAAGGAGCGT ACATTGACTG CTATGCGCAT CGAGATTTCC AGAAAGGAAA GCCAGAATTA TTGTCCAATT TACGACGAGT TCAGGTAAAG TTGAAAAGGC CTCCCAAGAG TGAAGCGGGC GATGAAAACG CGGACTAATA AAAGCAAACC CTCCACCAGA GCTAGAGAGC GGTGACAAAG GTGTCTCTCG AAGTCTGATT CCATCGAGAG GGAAATGATC GTTGGATGCA GGAAGAACAG CGTTGCTCAA AGCGAACCCG TCCTACACAT GCGTCGGAAA ATTGCCAACA TTAAATTGAT GTTGTAGCAA CAGACAAAGG TGTGGCACAG TTTCACGAGG CCCTCCTTTT AGCCGCGATG TCTCGATTTA TGAACAGTCC TTTCCTTTTT GCCAGAAGGA CCGTGAAAGT GTCGACCACG GCCATGTGGT GCGGCAGCGT GGGCTCCCGT GCAAAAGATA GGTCGCCATT TCCAGGAAAC CAAAATCTTT TAAAAATTGA CTACCCCCAT GGTCTCGAGG TATTTAGCCG AGAAAAACTG TTTCTATGTG TATTCCATTT CGAAGTCCAC GACATTGAAT TTCAATGCCA GATCTGAAAG AACCAATAAA CGAATTGGTG CGTCATTCCC CCGCCACTTG GTCTTTTTCT CTCCAATTAA TATAACATAA TGAAAATTAT GGAAATTGAT TC
|
Protein sequence | MNNNRGSGAD NERDMNDVAR ALSEGLLQIA AEVSSQRGDS GSQSGQIFRM EGLATPSSQG VGVTTGSLPL RDQDAVSTLV QRLAQGMQTS AAQGGPPQPS GLSQPNTGVA LSTPSSTLDV NQLVAMIQLV AALSANSSLI AENTNQSLEL LRQLQVASHH AQLASVHQQN IQLLNHLRTV QNGLVSGSSE IVPNPVRPTN SQAITTADFM KELRQNIVPV SFQAQHFQPA NIGTQAIPGN VGSGPSNSTA PQNNRIPVEN ITTRCPELAT FPQNIHQGDA SISGSPESRH ALLSSPIHRQ IPVQLRMEGY EPFPKKLYRI LEDAERHGQE TIISWMPSGK SLRIHDPDTF MKEIAPLHFR HSKYTSFMRQ LQMYDFVRAA EGAYIDCYAH RDFQKGKPEL LSNLRRVQVK LKRPPKSEAG DENAD
|
| |