Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50000 |
Symbol | |
ID | 7198703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 100948 |
End bp | 102980 |
Gene Length | 2033 bp |
Protein Length | 203 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184889 |
Protein GI | 219129423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCAT GCAGGCAGGC AGTCTGTCTG TACTGGCTAG TCCGTCTCGT CCATCTGGTT TGCACTGTCC TCGGAATGAC GAAAAGGAAA ATTCCCCAAC ACCGGACTTT TGGTGATCAC TCTTTCTGGG ACACCCACTG CCAATCCTAC CCTGTCTTCC GTCGACGCGA CGCCGTCCCT GACCCCGTCC GCCTCTCGTC GGACTCCGCG ATTGGAGTTA CAGACTCCGT CGTCGCCAAT ACGGTCCCCG TCAATCCGTA CCGCCGAATC CGTCCCGAAT CAATCCCTGC TCAACGGCGT GCCGGCGATG GATCATCCGA CTCCTGTACC TCACCAGAGC ACCGCCGAGA GCCAAGCCAA AACAAAACAC AGGCCGTGAA TGCGCCTCTC TCGAAGGCCT CATTATCACG GACCGGAAGG AGCGCTCCGA CCAGTATACC GCCAAGTGCG GAAAGCAGAG ACACCGGAAT GAAAGTGCCT TTCCCCGGCA CTGCCTACCC GTCGGAGCAA TCATCACCAA CGCAGAGTGT ACAAGCCCTT TTCGTCCATT CTGAATCCAT CCCAAGCAAT TCTTCGCCAC GCCTCCTCCA CTCCAACGGA CCCCAATATT GAGATCATCC GCCAAGCACT GCTGACTCCA TCTTCCCAGG GTATCACCTT GGCGGCGTAG GTCAACGCCG CCTCGAACGG CACGCCGTCC ACTGTTTCGC AAACACTCGG GACAACTACC ACGCCCGGTC ACGGGACCGA CACGCCTGCT CCTGGCGCGA CGCAAGCGCA CCGTAACGGA TCCTAACCTT ACCGACAGTA AAACCAGCCC CAACCGCACA CAACAAGCTT GCACTTTGGC TTCCGTACGC TGGACCGGGG GATCCGTCGT CACACGTGGG ATCCATACAT GCTCCAACAA GTTCGATGAA AAGCCATTCT GCATTGTGTT CATATAATTC AAAAACACAT TCGCGTTACA CTTGCCAATG AAGAGAGCTG TGTGTAAAAC CGGACATATT TGGGCATTTT GCCCGCTTGC AACCGTGCCG ACACCAGCTG AAAGAATGTA GCAATACCCG TTTCCATTTC TTCTTGTGGA AAGTGAAGAA TCGCTGGTAA CAGCACGTCA AATGTATTGG TCGCTGTAGC TCTTGACGAA ATCAGCTTTT GGAAAGCACC CAATATGTGA TTTTACTTGT CTCAGCAACT CTGGTGCCGC CTACGAATGT AGGCTTGCAT CAGACATGAC AACGCCGAAC ATTTCCACGC TTATCCCAAA GACCCGGCCC GGGTCAGCAA CGGGGGAAAG AGTGCTTGGT AGGCCGTATC CAAGCCATTA GGGCGATACT CTAGTAGCTG TGCCAAGACT TGGGAGACGG AAGGTGTGAA CTCCTCAATA TCCATTTGCA GCACTATGTC GAATAGTTCG AACAAGAGAG GTTGAGGTTC AAATAGTGCT GTGGCAGTGT GGTCTACGGA GCACCAATGG TGGACCATGC TCTCGTAATG CCGAGTAACA TTGGAGTATC GCACATGAAA CAGGACCGAG TGGTTGGTTC CGTCCTTTTC CTTCACCACG AAATGTGGAC AGTGTGCCGT ACCGTTGGCA TCCTTCCTTG GCTGCCATCT CATGGCGTAT GGTTGTCAAC ACATGGAGGC CATCCAGACG ATGGGATCCC GAATCGCGAC GGCAGCAAAA CATTGTCGGC CTCGAGGTCT TTGTCGTGAG CCGCTGTTGA AATCTTCGTC GTCCACTGCC GTGTGCCTAC CTCAAGGCAC CTGCCATCGT ATGCCCAGTG ATCCAGCAGG GCCGTAAACC TTTCTTGCGG AAGGAAGGTC GCAATTTTGA AGCAAGAGCG CGGCCAACAG ATATATATAT AACTCGCTTT CACGTTCGCT AATGTAAGTC CCTGTAGTTT GCACCAGGCG TCATCACGTT CCTTGACCGT GACCTTAGAA AACCCCCAGC TCCTGCATAC GCAATAGGCG CTCGTCCTCT GTGGCTTCTA ACAACCCCCC ATGTACTGCT TGACCGATCA AAG
|
Protein sequence | MQACRQAVCL YWLVRLVHLV CTVLGMTKRK IPQHRTFGDH SFWDTHCQSY PVFRRRDAVP DPVRLSSDSA IGVTDSVVAN TVPVNPYRRI RPESIPAQRR AGDGSSDSCT SPEHRREPSQ NKTQAVNAPL SKASLSRTGR SAPTSIPPSA ESRDTGMKVP FPGTAYPSEQ SSPTQSVQAL FVHSESIPSN SSPRLLHSNG PQY
|
| |