Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21982 |
Symbol | |
ID | 7202993 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 285730 |
End bp | 288271 |
Gene Length | 2542 bp |
Protein Length | 641 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182362 |
Protein GI | 219124126 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATTT TACTTGAAGC CGGTACGGCT AATTTCTTTG CAACATTTCT CAGAAATACA TGCTACTTGA ATTCTTCGAT TCAATGTTTG AGTCACACAC CAATTTTTCG GGAATACTTT ACGTCCAAGG CGTACCTCAA TGACATCAAT ACCACCAATC CACTAGGCCA CCAAGGGCAT TTGGCTCAAG TCAGTGCAGT GCTCATTAAT TCGTTATGGA AGCAGTTCAA TCAAACTCCT CAGGTACCTC TTCGTCGTGT CCGGGCACCT GGCTCGTACG CAATGGTCAA CGCGCCGTCT CTGACGCCAA AGACATTCAA GGATTCGCTA GGCAAGTTCA ATGATCATTT TGCTGGCAAT GAGCAACACG ATGCACAAGA GCTGCTCGCC TTTTTACTCG GTGGTCTTTC GGAAGATTTA AACCGAATAA TGGACAAGCC ATATATTGAG GCACCGGACT CGGACGGCCG ACCGGATCAC GAGTTAGCTG ATATTTGGTG GACAAATCAC TTGAAACGAG AAATGTCAAT CATCGTAGCT TTATTTACCG GTCAATACAA GAGCTTGTTG ACCTGTAGAT CCTGCAAATA CGAAAGCGCC CGCTTCGAGC CATTCTCGTT TTTGCAACTT CCGCTGCCTG AGGATGATCA GCTGACAGTT TCCCTGGTTG TGTATCCGTT GAAAGACGGT ACGGATACGC TGAAGTATTG TGTTCGTGTC AACAGTGATG GAAAGCTTCG CGATGTGTTG CTGGCACTTG CAATGTTACT GTATGTTGAG CAGAATGGGA AGGCCGTGTC GTCAAATTCA GCAGCAGACG AGGAAAGCGA AAAAGAAAGG AGTGAAAGAG AAGCTTTATA CCAAAAAATG GCACAGAATT TCTCTGTTGT GGACATGCGG GACGGTTACA TTTCAAAGAT AGCACCGGTA CGTATTCCCA AATGCGTTTA AGGCGAAGGT GATGAAAAAG ATTTTGTGCT GACAGTCCGT TTGATTCTTG CAGAATACAT GGTCTCTACA AGACCTCCAA AACAAAGAGA CCGGAGACTT ACCCCTCTTA CATGTTTACG AACTGGAAAG TCCGATTGAA GACTCGCCGC TGCAAGGTAA TGCAATGACA GAGAGCGACG GTGTGTCAAC AGATGAAAGT TCGGACGGCG AGGTTCTCTT CGTCAAACCC CGGGCATCCT TTTTGGCAAT TGCACAGCGG CGCTCGGAAC TTGTATCGCA AAATTCCCTG CACCCTTTGG CCCATCGTGT TTTTGGGACT CCAATTCTGA TGCGTGTGAA TGACCTTCAA ACTTGTACTG GGCGTGACTT ATACGACCTG ATTGCAGCAC GGGTTCGAAA CGTCGTACCC AAACAAGCGA TTCGGTTCCT TTCCGAGATT TCTTCCTCTA AAAAAACTGT CAATCTCAAA GAGCAATCAG TTGAACTCAC CAAAACAGGC AAACGACAAT CCGTCGGCAG AACGACAACA GACATGGAAG AAGTGTCTGC TGGACCTGTC CCTCGCTATG GATTCCGATT GCGCATTACG TCTCGTGATG GGCGTCGCTG TCTCATTTGT CGCTGGTTCG ACTGCTGCGT GGGTTGTCTC ATTCCAGATG ACGATGAATT TACCACTGTG TTGGACGGCG ACAGCATAGT GGTTGACTGG CATTTTGCGG TGGATTTAGC AACAGGTGGC TTTGGGCAAC GATTGACGCA ACCGGGGAGC TCGGCATCAA ATACGCAACA AACATTGGCA CGCACGCGTC ATTCAACAGT GTTCGTTAAG AATCACAGTT CTTGTGGCGG AGTAAAGGGC AACCACGCTG GTTCAATAAC GCTGGAGCAA TGTTTGGACG CATTTGCCGA GGAGGAAAAG ATTCCGGAAG CCTACTGCTC ACGGTGCAAA GACTTTCGTG TACAAACGAA ACGCATGAGT CTTTGGCGAT TGCCGCCGGT GGTGATCATC CAACTGAAGC GATTCCAATT TACGCAACAT ATGCGTCGCA AGCTTCGTGA TTTGGTTGTC TTTCCCATAG AAGGTTTGGA TCTATCACGC ATCATGGCTC CGGACTCGGT TGCTCCCAAA ACGGTCCTGA AAATGGAGAA CGACGCTGAA TCCAATGGTG AAGAGAGCAA CGGTGATACG CATATCGTGG GGCAGGATAG GCAGACCAAG GATGATGGTC GTTCTGAAAT GCTGTACGAT TTGTACGGCG TAGTGCACCA CCAAGGCGCT CTCTCGGGTG GACATTACGT AGCCTCGCTC AAATCGGAAT TCGATGGTCA GTGGCGGCTG TTCAACGACG CACAGATTTA TGAGATTCAC GATCGCGATG TAGTAGATGC GAGCGCGTAC ATTTTGTTCT ACATTCGTCG GGACGTTTCG AAGGCACATC TTTCCAACTT TTGGGAGACT TCGAAGGAAG GGACATTAAG CGAGGAAGAT ATGGATACTC TTCTCAAGGG CCGATCTGAT CGCTGCGTCA TTAGCTAAAA TAAGTTAAAA GATAAGGCTG TCGAGTATTG TAGTAAATGA AATCCAATAT TT
|
Protein sequence | MLILLEAGTA NFFATFLRNT CYLNSSIQCL SHTPIFREYF TSKAYLNDIN TTNPLGHQGH LAQVSAVLIN SLWKQFNQTP QVPLRRVRAP GSYAMVNAPS LTPKTFKDSL GKFNDHFAGN EQHDAQELLA FLLGGLSEDL NRIMDKPYIE APDSDGRPDH ELADIWWTNH LKREMSIIVA LFTGQYKSLL TCRSCKYESA RFEPFSFLQL PLPEDDQLTV SLVVYPLKDG TDTLKYCVRV NSDGKLRDVL LALAMLLYVE QNGKAVSPIE DSPLQGNAMT ESDGVSTDES SDGEVLFVKP RASFLAIAQR RSELVSQNSL HPLAHRVFGT PILMRVNDLQ TCTGRDLYDL IAARVRNVVP KQAIRFLSEI SSSKKTVNLK EQSVELTKTG KRQSVGRTTT DMEEVSAGPV PRYGFRLRIT SRDGRRCLIC RWFDCCVGCL IPDDDEFTTG NHAGSITLEQ CLDAFAEEEK IPEAYCSRCK DFRVQTKRMS LWRLPPVVII QLKRFQFTQH MRRKLRDLVV FPIEGLDLSR IMAPDSDDGR SEMLYDLYGV VHHQGALSGG HYVASLKSEF DGQWRLFNDA QIYEIHDRDV VDASAYILFY IRRDVSKAHL SNFWETSKEG TLSEEDMDTL LKGRSDRCVI S
|
| |