Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51040 |
Symbol | |
ID | 7201972 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 803362 |
End bp | 805478 |
Gene Length | 2117 bp |
Protein Length | 607 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181264 |
Protein GI | 219121835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCCCCGACG AGGGGCAGAT TGCCTCCCGT TTCATGTATC GATTGATACA TTTATGGTTC TCTTGTGAGG ATGAGGGAAG CTCAAATGGT TCGCTCGTGG AAATCATGGC TGAAGCTGTG ACTCGACTTC CATCATTTCG GTTCGTACCG GCAACCAGCC AACTTTTGTC TCGTGTAGAA AAAAGAAACA CAGGCTCTTT TCAAGAAACT TTGCACACGT TGATCCTCCG AATGTGCCAC GATCATCCTT ACAATTGCCT AGTACAGGTA GTTTCGTTGG CCAATGGAAA GACTATAGGT AGTGGTGTCA GCGGACGACA GGCAGAGGTA TTCCTCGAAA ATACGAGCGA CACGAAGGTT GATGGCGCAA ATGAGATTCT TGCATCCCTT AGAACAAGCG AGCGCAAACC TCTTGGGAGG TTGATGGAGG GTTACATCTC GCTCACTGAT GCATACGTCC ACTTGGCGAT GTATCCAACG CATGACTTCC AGAAGGCCAA AAACAAAAAG TTTCCTTTTT CAGCAGTCAG CAAATCCCAC GCCGAACGCC TGGACCAATG CCTAGGCGTG GGACGCCGCA AAGTGCCTCA CCCACCTTGT GTCTTGACCA AACCACCGCC AATTCGTCCA GCAAGTGACT ACACCGACGA AACGGGGCAG CTCATTGGGT CTGAGAGCGT CGTTGGTTTT GAGCAAGCCT TTTCTATCAC CGAGAGCGGT CTGCATCGCC CCAAGATTGT CTACTGTCTT GGATCCAAAG GTGGTCGTTT TAAACAGCTG GTAAAGGGAG AGGACGAGAT CCGACAGGAT GCCGTAATGG AGCAAGTTTT TGGTTACGTC AACGAATTGT TGTCAAACGG GGATCTGTCA GACAGCCTAG ATGAGATCCG GCGGACGACT GGAGCTGGCC ATTTGCGTTT AGTGACTTAC AATATTGTTC CTTTGAGTCC AGCAAGCGGG GTAAGTGATA AAGGACTACT TACAGGACGT GCATCATCAA ACTCACTTCG TTTTCATTTC TGCAGGTTCT AGAATGGGTC GATCATACCA TTCCATTCGG GGAGTTCATG ATGGACAAGA AAGGTCACGT CGGTGCGCAT TCTCGGTATT ATCCTGGACA ATGGAGCAGC CTTGTTTGCC GGGAGCAGCT GCGGAAAGCA CCGAAAAAGG AAAAACTTCA AGCCTTTAAT GCAATTTGCT TAAACCACTC GCCCGTCTTT CGATACTTTT TCGTGGAGAG GTTTGGGCAC ACGCCAGAAT TATGGCACGA GGCTCGGATG CGCTACACGC GGTCCGTTGC TGTCAATAGT ATTGTTGGGC ACATTCTTGG GATCGGCGAT CGCCACTGCA GCAACATTCT TATTCATGAG GGGACTGGGG AAGTCGTACA CATCGATTTT GGTATAGTCT TCGAACAAGG AAAGGTACGT TCCTTTTGAA CCCTAAGAGG ATACCGCTCT TTTCTTCTTA ATGGAGTCGT ATCGCTCAAG GCATATTTTA TTTCTTGACG ATAGCTCCTG AACACGCCCG AGCTAGTGCC GTTCCGACTG ACGCAAAATA CAGTGGATGG GTTCGGCCCA GTGGGACTCG ACGGTACCTT TACCAAATCC GCCCAACGGA CTTTATCCGT TCTCCGAAAG AATTCAAACG CGCTCCTGAC TATTCTGTCC GCGATTGTTT CGGATCCGCT GTACAAATGG AGCGTAAGTC CAGTCAAAGC ACGGCTGCGG CAAGAGCAGC AACAGCATCA GGATGAGGAA GAACAAGGGG AGAACAAACG CACATCGATG ACAACCACAA CGAGTTCCAC CAAAGTTTCA CGTAGCCAAG GACAAGAAAA CGAAGCCGCG TCACATGCCA TTCGGAGGAT ACAAGAAAAA CTCCAAGGCT ACGAGGATGG CACATCGGGC GAGCAACAAA GCGTGGAAGG ACAGGTTCAG CTCCTGATTA ACTCCGCGAA GAACAAGGAC AATTTGTGTC TCATGTTCTG TGGTTGGGCG CCGTGGGTGT AATTTCCACC AATAAATCTA CATTACTTCT GTTTCGCGAG ACTCTACGAG AGTCAAAAAA CGGACAGTGA ATTACTGTAA TGAACTAAGT AGATGTTGCC AAAATGC
|
Protein sequence | MYRLIHLWFS CEDEGSSNGS LVEIMAEAVT RLPSFRFVPA TSQLLSRVEK RNTGSFQETL HTLILRMCHD HPYNCLVQVV SLANGKTIGS GVSGRQAEVF LENTSDTKVD GANEILASLR TSERKPLGRL MEGYISLTDA YVHLAMYPTH DFQKAKNKKF PFSAVSKSHA ERLDQCLGVG RRKVPHPPCV LTKPPPIRPA SDYTDETGQL IGSESVVGFE QAFSITESGL HRPKIVYCLG SKGGRFKQLV KGEDEIRQDA VMEQVFGYVN ELLSNGDLSD SLDEIRRTTG AGHLRLVTYN IVPLSPASGV LEWVDHTIPF GEFMMDKKGH VGAHSRYYPG QWSSLVCREQ LRKAPKKEKL QAFNAICLNH SPVFRYFFVE RFGHTPELWH EARMRYTRSV AVNSIVGHIL GIGDRHCSNI LIHEGTGEVV HIDFGIVFEQ GKLLNTPELV PFRLTQNTVD GFGPVGLDGT FTKSAQRTLS VLRKNSNALL TILSAIVSDP LYKWSVSPVK ARLRQEQQQH QDEEEQGENK RTSMTTTTSS TKVSRSQGQE NEAASHAIRR IQEKLQGYED GTSGEQQSVE GQVQLLINSA KNKDNLCLMF CGWAPWV
|
| |