Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50034 |
Symbol | |
ID | 7198730 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 210577 |
End bp | 212796 |
Gene Length | 2220 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184916 |
Protein GI | 219129480 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0110404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTTGGGTT TACGCGTTTT CGTCTTTACG GAGGAGCGCA TTGTTGGAAT CTTGAAAAGC CATTCGGTCT CTTCTCTGAA CGCTGTCGAT TCTGTTTCTT TAATCGGATA GGCTGTCTTC CAAGAAGTTT GCAAAGGTAG CTCAATTGAG GAAAACGGGC GGGCCCTTGA CTTGAACTAG GTATTCCTTC CAGTCAACAG TGCCATTAGA TCGGAACGAG GAACTGGAAT CACTCTATTC CTTCCGTAAA TCCCCGAGCA AAGTAAACAA TAGCTACGAC GCTGGTACGA TGAAGTACAT TGTGGTAACA GGTGGCGTCG TGTCGGGCTT GGGCAAGGGC GTGACGATTT CTAGTATGGG TCGTTTGCTC CAGGCCAGCG GTCTCCGAGT TACGGCCGTC AAGATTGACC CCTACCTGAA CGTGGATGCC GGGACAATGA GTCCGTTCGA ACACGGCGAA GTTTTCGTGC TGCGAGACGG TGGTGAATCT GATCTTGATC TGGGAAACTA CGAGCGCTTT TTGGGCATTG AACTCACCAG TGATCACAAT TTAACCACGG GCAAGGTCTA CCGCAAGGTG ATTCTCGCGG AACGCCGTGG TGACTACCTC GGTAAAACAG TGCAAGTGGT GCCGCACATT ACCGATACCA TTCAAGACTG GCTCGAAAAA GTCGCGTATA TTCCTGTCGA CGGTACTGGC AAAGAAGCCG ACATTTGCTT GATAGAAGTA GGGGGAACGG TCGGTGACAT TGAGAGTTCC GTCTTTTTGG AAGCCCTCCG ACAATTTCAG TTCCGGGTCG GGCACGACAA CTTTTGTCTC TGCTTCGTCT CGCTTGTTCC AGTCTTGAGT GACGAGCAAA AGACCAAGCC GACCCAGCAC GGAGTGCGCG ATTTGCGCTC ACTCGGACTC AGTCCGTCGA TTATTTTCTG CCGTTCCACG GAACCCTTGC AAGAGCCAAC CAAGCAAAAG ATATCCAATT TTTGCCACGT GCAAGCTAAG AACGTATTGA GTGTGCATGA TGTAAACAAT GTTTACTTTG TACCGGGGCT ACTGCAGGAG CAGAACTTAC ACGAGATTTT GGGCAAGGAA CTTTGCCTCG ATAAGCCATT AAATCCAGAT TTGGGATCAT GGACCACCAT GGCGCATTCA ATCGAACTAG CATCTCATAC GGTGACGATT GCCTTGATCG GGAAATACAC TGGGCTGCAG GATTCTTATT TGTCGGTCAT AAAGCCTTGC GTCATGCGGC AATCGCATGC AACGTACGTC TACAGTTGGA ATGGATTGAA GCGTCGCAGC TCGAAGACGA AAAGGAAGAA GGATACATTG GGAGTTGGGA CAAACTCAAG GCGTCAGACG GTGTCATTGT TCCTGGAGGG TTTGGACAAC GTGGCTGGGA AGGCAAAATT TTGGCCGCCA AATACTGCCG TGAGAATAGG AAGCCAATTC TCGGAGTTTG TCTAGGTTTC CAGGCAATGG TCGTAGAGTA CGCCAGGAAC GTTTTGGGAA TTGACCAAGC CGATTCAACA GAATTTGAAG AATCCACACC AGAACCTTTT GTTTTTTTCA TGCCCGAGAT TGACAAGGAG ACTATGGGTG GAACGATGCG ACTCGGGGCG CGCACCACCA AGTTTACACA CACCCTTGCC GATGGAAGTA TGAGTGTTTC GCAACGCCTC TACGGAGGAA AAGAAATGGT TTCGGAACGC CACCGACATC GCTACGAAGT CAATCCAGAA AAGGTTGATG CCGTCCACGA CGGTGGCTTG CGTTTTGTGG GCCGAGATGA GACGGGCGAG CGAATGGAGA TAGCAGAGTT ACCGCAATCG GAACATCCTT ACTACCTCGG ATGCCAGTTC CACCCAGAAT TTCTCTCTCG TCCTTTGAAA CCGAGTCCTC CATTCTACGG CTTAATCTTG GCAGCCACTG GTATGTTGGA GGACCACCTG CAGTCCGTAT TGTAAAACAA CGGAATCTTG ATTCGTCTAT ACGATCGCGT GGGCTGCCAT GTCTTTGCAA TAAAGCTTAC GATTTCATGA TTACGACACA GTTGGCAATA AGGATTGCAA AAGGTTAAGG AAATGGAAAA TTAAAATGGT TACTTCTTTC ACAGTAAGTT CAACATCTCG ACAATTGCAA ATTTGGCAAC AGTTCGGTAT CAACCATAGA AAAATTGGTT TGAATCACAA AATCTAACCT ATAGTGGAAA TTAGCCATTG
|
Protein sequence | MKYIVVTGGV VSGLGKGVTI SSMGRLLQAS GLRVTAVKID PYLNVDAGTM SPFEHGEVFV LRDGGESDLD LGNYERFLGI ELTSDHNLTT GKVYRKVILA ERRGDYLGKT VQVVPHITDT IQDWLEKVAY IPVDGTGKEA DICLIEVGGT VGDIESSVFL EALRQFQFRV GHDNFCLCFV SLVPVLSDEQ KTKPTQHGVR DLRSLGLSPS IIFCRSTEPL QEPTKQKISN FCHVQAKNVL SVHDVNNVYF VPGLLQEQNL HEILGKELCL DKPLNPDLGS WTTMAHSIEL ASHTVTIALI GKYTGLQDSY LSVIKPCVMR QSHATYLEDE KEEGYIGSWD KLKASDGVIV PGGFGQRGWE GKILAAKYCR ENRKPILGVC LGFQAMVVEY ARNVLGIDQA DSTEFEESTP EPFVFFMPEI DKETMGGTMR LGARTTKFTH TLADGSMSVS QRLYGGKEMV SERHRHRYEV NPEKVDAVHD GGLRFVGRDE TGERMEIAEL PQSEHPYYLG CQFHPEFLSR PLKPSPPFYG LILAATGMLE DHLQSVL
|
| |