Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42955 |
Symbol | |
ID | 7196198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1622745 |
End bp | 1625319 |
Gene Length | 2575 bp |
Protein Length | 755 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176820 |
Protein GI | 219110137 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTTCACGCC CGATTCTGTT GTGTTCACGG AAAGAAAATT AACAAGTGAG CTATCGAAAA CGGAGGGCCG TGACACACAC TCACGAGTTT CCATCGGACG CATCCCTTGG GGCTGGAGAA AGGCAACTGA ATAGTAGCTG TGAACGAAAA CCAAAAAAAC AACATTAACG CCGCAGAAAG GGGAGTCGGA ATTTCCAAGG AGCACTTGTT CTAGCTAGCC AGGGATTCAC ACTTAGCCTT TCGAAAGCGT CCGTGAGGTT TTCTCCCGAA AGGGTGGAAA GTTGCGCTAG CCGAAACTTC CATCACGATG AAATTCAGAA GGTCATCGAA AAGCAAAAGT TCGAAGGAGA GCGATCCCAG TATGGAGCAA GCGCCGAAAA TCGTATCCAT CACTCCTGCT CCTTCTCATG TCCTGCATTC GCCCAGAAAG GCGGATCTTT TCGACTCGCA CCTTGCGAAA GTCTTTGCGA ATGGGATTTC GGTGCCCGTA GATCCGGACG GCCGGATTCA GCATCATCGT GAACTGGCAA GAGACTTGAG GCGCAGGTCA TCCTCAGCTC AGAGATCTGT CCAACAACTG ACTATGCCGT ATTCTAGCTA TCATTCCCGT TCTTCCAGAG CGAGAAGCAT CGAGCGTGGT GGTTCTCGGT CTGGTGGTCA TCCTCCAGAG GTCCAAAATT TTCAGCCACA ACAGTACATC ACAAGTTCCG TCCACAACCG AGACCGAAGC TGGCACGCTC ATTCTATGCC TTCGCTTCCA TACAAAGAAG ATGTATACAA CGGATATCGT AATGTGCAGC ACCAGAATGC TACTGCGTAT GATGCTTACG CAGACTGTAG ACTAAGACAA CGGAGTAACG GTCGCAGTCG GAGCACAGAG AGAAGGAGCC GTCCACAAAG TTATGACCAT TCACGAAGCC ACGAAGAAGA TATAGCGCAG CCCGGCGAAC TTCCGATTCA TCATGATGGC CCAGTGGAAG ATCCCCGGCA GTCTAATTTA TTGTATTCGG AGCCGCCTTC GGATGATTCC AATGATCGGT CGTTGGGGAA AAAGCGAAAA CCCAAAATGG AGAAAATTCA AGAACTGAAA GCGAAAAACG ATTTGTACAA GGAAGAATTC AAGAGGGTGC AAAAGGATAG AAAGAAGCTG AAAAAGGAAG TTGAATATAA GAAGAGCGAA ATTGCCTCGC TAACAAATGA GATTGATTCT CACATTGAGG AGACTTCCGT CCTCAAACGG AAGCTATCAG AAGCGTTGCA GGAATTGGAT AGGACGGATC TCAGCAGCCG CAAGGATAAG AGCAATCTTG TGCGAGTCAA CAAAGAGCTC GTAGAAAGCA GAGAAGAGCT AGATGCTTTA AGAACGCGCA TTAAAGAATT GAACAATGAT ATTGCAACGC TTCACGATGC CGTGAAACGG AAAGATGCCC AAATTGATTC ACTCACAACT GAAGTAACAG AACAAACAGG CTTGATCGAA GCGCTCAGGA ACGAGAATAG ACTCGAAGCA AATGGGAATC AGTATTTGTT TACTAGAATA AACGAAGAGA AAATACAGGG ATTGTTGGAG GAAAATAAGA ATATACAGAA GGAACTGGGC TCGACGTTGG AACGTGCTGC TGCTATGGTA AAGGATCGTG AAGATGCAAT TGCAGATCTG CTCAAAGAAA ACGATGAGAT AAAGAAGCAG CTTTTGGGTA ACTGTCAAGC GCAAGAGTTG CGACCAATGG TTTCTGCGGA TGATCTGGAA GAACTAAAAG ACAACCTTGA CAACACACAT CGGGCACTAG AAGAAGCACA AGATCGAAAT CTTGTATTGG AGGAGGAAAT TGAAGGCTGG CTCGCTCGTG GCGGATCTAT GGAGAGCGAA ATGGTAAGGC TTCGTGATGA AGTCCTCTCT TGGAAGCAGA AGGCCGCAGC TTCGCAAGAC TCTGTGGCAA TCGTTGAAAC TAGCGCAGAA GAAGCGATGG TAGAAGCCCA AGCAGCTCGA AAAGCACTTG CGGAACTTGA AGAAATTCAT GCAAACAGTC TTGCCCGAGC AGAGATTCAA CACAAAGCTG CCATGAGCAA AGCCGAAGAG CGTCTTATGG AAGCATTGAT GGAAGCAAAA AAAGCAAATG ATCGAGCTGA AAATACGAAG TTGGCAACCG AGAGACTAGA GACCGATACA TCCAACTGCG AAAATATGCC CCACCAAGTT GAACCTAGTG ACGATCATGC GCGCCAAGCG ATGCTTCTAG AGCAGGCAGT TGCTTATCGT CGAAGCAAGA CGGGAGTCGC GAACAAAAAG GGATGGTTTT CTGGTCTTGG GATAAACGAT GAAGAAGAAC TGACCGAGGA CCAGAAGCGA ATAAAGGAAC TGGAAGCTAT TAATACAGAC CAAAATGAAG AAATTCAGAA GCTGAAGAGC GAATTAGTCC GATTGAGATC AAGCTACAAC GAAGCCACGT ATACTACAAA GAAAAAGATT GAGCGATTGG AACATGAAAA TGAGGCTTAC TCCTTGAAGG TATCTGCCCT AGAGCAAGCA TCGAACGACA TGGAGGAGAC AACAACGTAT AATGTTGCAA ATTGA
|
Protein sequence | MKFRRSSKSK SSKESDPSME QAPKIVSITP APSHVLHSPR KADLFDSHLA KVFANGISVP VDPDGRIQHH RELARDLRRR SSSAQRSVQQ LTMPYSSYHS RSSRARSIER GGSRSGGHPP EVQNFQPQQY ITSSVHNRDR SWHAHSMPSL PYKEDVYNGY RNVQHQNATA YDAYADCRLR QRSNGRSRST ERRSRPQSYD HSRSHEEDIA QPGELPIHHD GPVEDPRQSN LLYSEPPSDD SNDRSLGKKR KPKMEKIQEL KAKNDLYKEE FKRVQKDRKK LKKEVEYKKS EIASLTNEID SHIEETSVLK RKLSEALQEL DRTDLSSRKD KSNLVRVNKE LVESREELDA LRTRIKELNN DIATLHDAVK RKDAQIDSLT TEVTEQTGLI EALRNENRLE ANGNQYLFTR INEEKIQGLL EENKNIQKEL GSTLERAAAM VKDREDAIAD LLKENDEIKK QLLGNCQAQE LRPMVSADDL EELKDNLDNT HRALEEAQDR NLVLEEEIEG WLARGGSMES EMVRLRDEVL SWKQKAAASQ DSVAIVETSA EEAMVEAQAA RKALAELEEI HANSLARAEI QHKAAMSKAE ERLMEALMEA KKANDRAENT KLATERLETD TSNCENMPHQ VEPSDDHARQ AMLLEQAVAY RRSKTGVANK KGWFSGLGIN DEEELTEDQK RIKELEAINT DQNEEIQKLK SELVRLRSSY NEATYTTKKK IERLEHENEA YSLKVSALEQ ASNDMEETTT YNVAN
|
| |