Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50632 |
Symbol | |
ID | 7199463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 24957 |
End bp | 27087 |
Gene Length | 2131 bp |
Protein Length | 622 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185596 |
Protein GI | 219130911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAGCCACCA TGATGGTCGG CGTCGATAGT AGTCTCGTTT TGTCAACTTT AACGCTTATG CCGGTCGCCT AGTGCGCTCT ATGCGAGAGA AGGTCCTAAC ACCAAATATT GTCGATAGAA TCGCTGTTTA CAGTTAGCTG TATAGCTATA TCTGTAGAGA ATCGGTTTCC GCGAGAAGGA TTCATTATTT TGGGCTCTAA CCAGGGAGGT CTGTGTTAGC GAAGATTCAG TGTAGGAGAA ACATTTTGCA TTGCGCTTCG GTATGACTCC TTCGGCATCG TCTAAGAAGG GCAAAACTAA ACTCGAGGAG TTTTTTGAGG TCGCAGAACG GAGGAAATCG CTGACCTCAA TGGACCTTTG GCTAAAGATG ACATCATCGC AGAGCCAATT GTCAACGTTA AACCAGAGGC GAACTCATAA CGGCTCTTCA AAACCAAAGT CTTCGAGTTT TGCTCTTAAA GGTAATGGTC GCATGCCAAG AGAGTACGAG TCGGAGACAA GCTTGAAAAA TGGCCTGGCC GTGAGAAAGG AATCACACCC GCCTACTTGT CCGAAGGTAC ATACATCGAC ATCTATGGGA GGCATTGAAG GCATTAACAA TCGCCCACCC ATGTTATTAG GAGCGGGCCA TTCCTTTACG CGCACAGTGG AAGGTGCGAG TGCCGGTATT TCCGGCGTTG CTAGCGACGA CGACGATTCA ATTGAAATTT TGAAAGTTGT TCCGTCAGTG ATTGACAGTA TCGGTGCTGC CTTTCCTAGT GAAGCTCCCG TAACAGAGGA AGATGAAGAA CGTTGGCTGC AGCATGCAAT TAGGAAATCA TATGAAGAGA CGCGAAAAGT TTTGCCTTCT AAGCATTTTC CGCCACTGAC GCCCCGCACG TCTCCTAGGA TACTAGCTCG CAAACGAGCA CGGGCTTCGG AAAAGTATTT GGAACCTGAT ACGGAAGGAA AAGCAACAAG TAGCAAGTTT TTGAAGTTTT CTCCCGCTAA GTCCGTATTT CAATCAACAT CTCCGGCTGC ATCTAATGAT GATGACAGCA GCAAAGATAA GGCAACAACT AGCAAGGTTC CGCCTGGTGA TCACCTCTAT ACATTTAATG AGTTTGATCG GATGTTTGAC GATGTCTTCT TGGATTTTGG TATGGAAGTG ACGGATGTGG AAGAAGGCTG CCTAGTCAGG ACGGGTAACT CGAAAGTGTC CCCCAGCGAT ACCAAAGGAC AATCCAAAGC GCAATACGGT AGACTCTTAC CGACTGCCAC CCAGTACCTG CTCGCGGATG CACTGTGTGT AACAAAAAAG GACACATTCG TTGATATTGG ACATGGTATT GGTAATGCAG TCATCCAAGC AGCCTATACA ATGGGCTGCG AATCGCGCGG TATAGAAGTT ATGGCCGGTC GCAATTTGGT GGCCGAGCTC ATTATGGAAA ATTTAGAAGG ACAACGAAAA GTACATCATG AGCGAGATAA TAGGAGCGTG ATTGTTGGAA AAATTCTGCT GCGGCATGGT CGTTTAGAGG TCCCTGAACA CCGCACATTT TTGACCAATC CAGAAGGTGT AACCAAGGCT TTTCTTGACA ATTTCAATGG CGTGTTTGCG GATCGATCGG CCAAGCTACG ACAACGGTAC ACACTAGACC AATACAATGC CGGACTGTTT GCATTGATGA AACCCGGATC AATGCTAGTT GCTTTGCACA AGTTAGACCT AGGTCCAACA TATTTGGAGG CCAACACCTA TCGGAAGCGA CACAATTTAG CCAATCAGGA CAATATCTTG GCTTCATTTT ACGCTCTGGA AGAATTCAGC TTTGGACCTG CATGTAACGC TGTGACTTGG TCACAAGGTG GTGGTTGTAC AGATGCTATA ATTGGGTACA AGTATACCAG GCTAAATCAG AAGACGCCAG AAGGCAAGGC CGTGTTTCTG TGCTGCAATC GTGATTGTGA CATAGCTCGT GCGGGAACAC CAATTGATGC CACGCGCCTA GTGGAGTCGG ATGAGGGGGA TGGTACACGG GTTGTAATTA ATACATGCAT TTGCAAATAC ACACCAATTG CACCTCGATC GGGACGTAAC AGGAGGCCAA GAAAGTTCAG GGAATCATTG TCGGAAGAAT CAGATAGTTG A
|
Protein sequence | MTPSASSKKG KTKLEEFFEV AERRKSLTSM DLWLKMTSSQ SQLSTLNQRR THNGSSKPKS SSFALKGNGR MPREYESETS LKNGLAVRKE SHPPTCPKVH TSTSMGGIEG INNRPPMLLG AGHSFTRTVE GASAGISGVA SDDDDSIEIL KVVPSVIDSI GAAFPSEAPV TEEDEERWLQ HAIRKSYEET RKVLPSKHFP PLTPRTSPRI LARKRARASE KYLEPDTEGK ATSSKFLKFS PAKSVFQSTS PAASNDDDSS KDKATTSKVP PGDHLYTFNE FDRMFDDVFL DFGMEVTDVE EGCLVRTGNS KVSPSDTKGQ SKAQYGRLLP TATQYLLADA LCVTKKDTFV DIGHGIGNAV IQAAYTMGCE SRGIEVMAGR NLVAELIMEN LEGQRKVHHE RDNRSVIVGK ILLRHGRLEV PEHRTFLTNP EGVTKAFLDN FNGVFADRSA KLRQRYTLDQ YNAGLFALMK PGSMLVALHK LDLGPTYLEA NTYRKRHNLA NQDNILASFY ALEEFSFGPA CNAVTWSQGG GCTDAIIGYK YTRLNQKTPE GKAVFLCCNR DCDIARAGTP IDATRLVESD EGDGTRVVIN TCICKYTPIA PRSGRNRRPR KFRESLSEES DS
|
| |