Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42601 |
Symbol | |
ID | 7195970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 554148 |
End bp | 555954 |
Gene Length | 1807 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177109 |
Protein GI | 219110715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTAACTGTCG CAGTCACACA TTTTAGCAGA TGCAATTGCC AGCAAAATAA ACGACCATCC GTAACGACTA CTTGGCATAG TACATTAGGA GATCGTTTTC CAAATCGAAC GTTTCACAAT TTCCTGTTTG TCTGGGCAGA ATCAAGTTTA GGTCGTAGCA AGAGCGTCGG TGAATCGGAT CCGATGGCTT CCGTTTGTAG ACTCTCTTTT GCAATTGTGT ACTGTTTAAG TGCATGGTCT TACACTGTGG AGAGTTTTCT TGCACCTCAG TTCTCCAATC TTAAACCCCT CGGTGTCTCC ACACTCGCTG CGTCGTCGAC GGCAGAGGTT ATATCGAATG CAGCTTCTAG CTTGGAAAAT TCTTTAGCGG CGGTACAAAA GGATATCTGC AAGACGCTGA AGGTCACCGT CGCACCGTCT TCTGTGAATC GGCTAGGTCT AGTGGCGACT GAGAAGATTA GAAAAGGTGA GGTCTTCTTG GCCATGCCGT ATGATGTACG CTATGAACTT TCCGCGGATC TTGCGCGCAA CGTCGTGTTC AAAGACGTAC TTTCAGAAGA CTATAATAGC TGGACAGGTG ATGCTGGCTT GATTGCTTTA CTAATTCTGA ACGAAGTTTG CCTAGCTGCA GACACCGGTC TAGGGACGAA GGAACCTATT CGTCAGAACT CGCTCCAAGC TTTTATGAGC GCGTGGGTTG CGGCACTTCC AGGTCCAGAG GACATCAATC ATCCTTTGCT GTGGTCCGAA GAAGACCAGG AGATTTTGCA ATCCTCATCC ACAAATCGAA TCTATCGCGT TTTGGATGAT ATAGAAGAAG ACGTAACGTG GCTGAAGACG AACGTTTTCG AGAAAGATGG CAACCGGTTC CCTGTATCGA TTCCATGGAA TGGCGAAGAA ATCCCTTGTT TTTCGCTGAC AGGCTTCAAG TGGGCAATGG CTTTGGCTCA ATCAAGATCC TTTTTTGTAG ATAACGCTGT CCGTCTCCTT CCCTTAATGG ATTTCTGTAA CCACGCGGAC GAAGGCACGG AGGAAGCGCG TGCGGGCTTT ATGGGAACCT TTGGTACAAC GAAAGGGGCG GAATTGGTTG CGGGCCAGAG TTACGAGGTG GGCGAGGAAG TCTTCATCTG CTACGGCCCC AAAAGCGCTG CCGACTATTT GTTGGAGCAT GCATTTTGTC CAGAACAAAG TTGGAAAACT GCTGTTTCAG AATTGTTTTT TGAAGTTGAC CCCAAAGATC GCTTCTACGA CGACAAGCTG GATATACTTG AATTCGAAAC TTATGATGCA TCCCCGATGG ACCCCGTTCA ATCCTTTGAT GTGGTGAGTG CGCCTGGAAG AGATGGTGAA CCAGACCCTT TTATGGTTCA ATTTGTTCGA CTTTGCAAGC TGGGTGCGAC GGATGCATTT CTCCTCGAGA GCATCTTCCG CAAAGAAGTC TGGGGATTTA TGGAGCTTCC CGTAAGTGAA ACAAATGAGC GAGATGTTGT TGATGCAATC ATGGAGGCTT GCCAGCTTGC ATTGGACGAT TTTTCAAAAT GTGCGGAAGG TGGCCCAGAG ATTTGCAGCA AACTACGAGA GTCGGAAAGC CAGGCGTTAA TAAGAACGAG AGACTTTTTG CGACGGGATC GAGAGGCTTT GGACCTGAAA GAGTACTACC AGCAAAGACG GTTGAAGGAC TTGGGTCTCG ACTCCGAGTG GTCTCCTGAA GATGATATGA GTCCTGACCT TGGATTTGGG CAAAGTCGAG CCCCAGGCGG TGCCGATTAC GATTGGTAAA AACTGAAATT TTCCCTTTAC TCATTTG
|
Protein sequence | MASVCRLSFA IVYCLSAWSY TVESFLAPQF SNLKPLGVST LAASSTAEVI SNAASSLENS LAAVQKDICK TLKVTVAPSS VNRLGLVATE KIRKGEVFLA MPYDVRYELS ADLARNVVFK DVLSEDYNSW TGDAGLIALL ILNEVCLAAD TGLGTKEPIR QNSLQAFMSA WVAALPGPED INHPLLWSEE DQEILQSSST NRIYRVLDDI EEDVTWLKTN VFEKDGNRFP VSIPWNGEEI PCFSLTGFKW AMALAQSRSF FVDNAVRLLP LMDFCNHADE GTEEARAGFM GTFGTTKGAE LVAGQSYEVG EEVFICYGPK SAADYLLEHA FCPEQSWKTA VSELFFEVDP KDRFYDDKLD ILEFETYDAS PMDPVQSFDV VSAPGRDGEP DPFMVQFVRL CKLGATDAFL LESIFRKEVW GFMELPVSET NERDVVDAIM EACQLALDDF SKCAEGGPEI CSKLRESESQ ALIRTRDFLR RDREALDLKE YYQQRRLKDL GLDSEWSPED DMSPDLGFGQ SRAPGGADYD W
|
| |