Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47925 |
Symbol | |
ID | 7203179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 442431 |
End bp | 444179 |
Gene Length | 1749 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182228 |
Protein GI | 219123847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.462072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTGTCGAA AAGAATCATG ACCTAGCGAG AGAATTCGTC CCCATTGCTG TACTTCGACT ATGGGAAAGA AACGTCAAAG TCGACCTCGT CTAGATCCTG GGGCACACAT TGCCATAATA GGTTCCGGGC TAGCCGGGCT TTCGACTGCG CTCTCGTTGG AACAAGGTGG ATTCACCAAC GTGCATATCT ACGAGAGGGA TGGTTCGCAT GACGCCCGCA AGGAAGGCTA CGGTCTGACT TTGACTTACA ATCCGACAGG CGTCCTGCAC CAACTCAACG TTCTGGAGGA AATTGCGCAA TCTGACTGCC CCAGCCGGTC GCATTACATG TTCAATGCGA ATGGTGAAAT TCAGGGATAT TTTGGCAACG CGTTTGCGCG GAATCGAGGG TGGGGTCAGC GGGGCAATCT GCGCGTGCCG CGTCAACGAG TGCGCCAGAT TCTTGCGAGC CGACTGAAAA TAACAGAAAC TCACTGGGAT CATAAACTAG TAGGGGTGAG TGGCTGTGAA AATGGAGAGA ACATTTGCCT GGCGTTTCAG CTAGAAGGTG CAGCAGAGGA AAAGCTTCTG GTCGGGGCAG ACTTGGTTGT GGCGGCCGAC GGTATTCGTT CGGCGGTTTT GCAGCACGCA TATCCACAGG CTCCACCCAT TCAGTCGTTG GGCATTCGAC TTATTCTGGG AATATCCTCT AGCTTTACTC ACGTTCATCT GAAAGAACGT GGATTCTATA CACTAGATAG TGGAAAGCGA TTATTTGTTA TGCCTTTTGC CCGAACCGCT GAGTGGGCGG TTATAGATGA CAATGCGACC AGTTGCAGTC AGCCTACAGG AGAGCAGTAC ATGTGGCAGC TCTCCTTTGC TAGCTCCGAC GATAAGACAT ACAGCTCTAC GGAACTCTTA CAGCAAGCAC TATTTCATTG TCGAGATTGG CACGAACCCG TCCAGGAATT GATGCTTTCT ACGTCGCAAG AGTCGATTTG GGGAACGTTG CTATATGATC GCAATCCAGA GATCTTACAC AAACACTTAC AAATCAACGA CACTTTACCA CGCCGTATTT TGATCGTCGG CGATGCGTGT CATGCCATGA GCCCATTCAA AGGGCAAGGA GCCAACCAAG CCTTGCAAGA TGGCCGGGTT TTGGTCAAAC ACTTGACCTC GGCAAGAGTG GAAATTGCTG TCTCGAACAC GCAGCGAGAA ATTGTTCAAC GGACGGCATC TGTGGTAGCC GCATCTCGTC AAGCCTCTGT GTATTGGCAC GAACCACAGC TTGTCATGCC AAAGGATGGA CAAGATACAC AGAAATTTGC AGGTGTTTGC TCGCAAGATA TTCCAGCGTT GCTCCACGCT TTGCAAAAGA AAAACATTAA GGCAAATTCA GCCAACGATC TGGACAAATC GGTGCAATGC ACGATAGACG AACTTCAGTT GATCGGACCG CCGGAAGTCA GACGCAAGGC GGAATCGGAA TTGAAAGAGG CACACTTCGT TGATCTTGCT CGGCAAGCTA TTTTGTCGTC AGAAATGAAC AATTTAGCTT TCCTTCGAAA ACTTTCCTGG GAATATCCCA ATCTCATTCG AAACGTGGAT GTAGACGACA TGACTTGCCT ACAGAAGGCG GCTAAGTCGG GACATATCCG TATTGCGCAC TGGCTGATTA CAGAAGCGGG CTGTCTTGTC GATGCTCGAC TTTTGAATGA CCTTTCAATA AAGAGCTACA TGAAGGCTTT GCTTAAAATG TACATGTAA
|
Protein sequence | MGKKRQSRPR LDPGAHIAII GSGLAGLSTA LSLEQGGFTN VHIYERDGSH DARKEGYGLT LTYNPTGVLH QLNVLEEIAQ SDCPSRSHYM FNANGEIQGY FGNAFARNRG WGQRGNLRVP RQRVRQILAS RLKITETHWD HKLVGVSGCE NGENICLAFQ LEGAAEEKLL VGADLVVAAD GIRSAVLQHA YPQAPPIQSL GIRLILGISS SFTHVHLKER GFYTLDSGKR LFVMPFARTA EWAVIDDNAT SCSQPTGEQY MWQLSFASSD DKTYSSTELL QQALFHCRDW HEPVQELMLS TSQESIWGTL LYDRNPEILH KHLQINDTLP RRILIVGDAC HAMSPFKGQG ANQALQDGRV LVKHLTSARV EIAVSNTQRE IVQRTASVVA ASRQASVYWH EPQLVMPKDG QDTQKFAGVC SQDIPALLHA LQKKNIKANS ANDLDKSVQC TIDELQLIGP PEVRRKAESE LKEAHFVDLA RQAILSSEMN NLAFLRKLSW EYPNLIRNVD VDDMTCLQKA AKSGHIRIAH WLITEAGCLV DARLLNDLSI KSYMKALLKM YM
|
| |