Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48122 |
Symbol | |
ID | 7203280 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 263581 |
End bp | 265211 |
Gene Length | 1631 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182500 |
Protein GI | 219124417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACAAG GATCGCGCCG ATTTTCGGTA CCAAGCCAAT CGATATTCTC GTCAAAGAGG TACGTACCAA GTTGGTTAGT TTCCAAGTTT TCTTGCTTTC GTCAAAAATA GAGAACGGCT CAATGAATAT TGACAACAGT GAGTGGTTGG TCTTTTGTGG TGATACACTG TGCCATTGTA GGAACAGCCG CCGTTTTGCG AGTAGTCGAA AACGGCCTTT AGAAAGACTG CCTTCTGATT GGACAGAAAG TGTCGGGATT CTAACACGTC GGACCATCGT TCGCACGTTC GCCAGCAAAC ATATGTACAA CAGGTTATCT TTAATCATTC ATAGTCAACA CGGGGTTTTG TCGTTTTATA GATATTCGAC GATGGCGAAT GTCATCTCTT TGCAGTTAAT TGTGGGTGTT CTATAACGCG CTTATCTTAC TTTTTGATTA ATCAGACCAC TAGAATCCCA GCAATTAGTT CTCAAATGAA GGGAATTTGC TTCGTTTTGG CGTTCTGGCT CTTTACAGCT CACATTTCGG CGTTCCAACC GACGTCGCTT TCGATTTTTC AGAGTTCCGG GGCCCAATTA CGAGTCAGGC GAACAGCAGT ATTTCTGAAG GTTGACCGTG ACAATTCGAT AGAGGGAGCA GAACTATGTT GGAATCCCAA GCTCCGTCGC GTTATGGGCC TGGTTGCTTC AGCTGGCATG ATTGAGACGG CCTACCTAAC CCTTACGAAG TTGACGGACA AGGTGGATAT CCTTTGTGGA GCCGACGGCG GCTGCTCCTC CATTCTGAAC GGTCCATATG CCTTCATTCC AGGGACAAAC ATTCCTCTAT CCTTATTAGG TTTCGTGGCC TACGCTACTG TAGCGTTTCT GGCCGTGGAG CCAATCCGAA CAAATGAAGA AAACGACCAA AGCAACAGAG TCTTGTTGAC GACAGCGACC ACCATCATGG GCGTCTTTTC CGTCTTTCTG ATGTCTATTT TGTTTGGTGT GCTACACGAA TCATGTCCAT ACTGTATTGC ATCAGCAGTG TTCTCTATAG TGTTGGCGAA ACTGGCATGG TTGGGCGGTG CCCTACCCCA GGAGCGCGTT AAAGAGGGCG TCGCCACAAG CGCCGGAGGA GCCCTGGCGG CCTTTGCGGC CGCCACCGTT TTTTACGTGA ATATCAACAA CAACATCAAC CAACCGTCAT CGCAAGTGAA TTTTGCAGGA AATTTTTTTG GCAAGCCGAC CCTGTTGGCC GACGCAAGCG GCGCCAGCAG CAAACAGCTT TTGTACGAGC CTCCTACCGT TTCTACTGTT AGTTCCGAGC GGGCCTTGGC GCTGTCATCT CAGCTACAGG CGCTAGATAC AAAAATGTAC GGGGCATACT GGTGCTCGCA CTGCTATGAC CAGAAAGAAC TTCTCGGCGT CCAGGCCATG GCCAAGATTC CTTATATCGA ATGCAGCAAA GACGGTGTAA ACTCGCAGAC CAAAGCGTGC AAAACTAGGG ATGTGCCGGG CTATCCGACA TGGGAAATCA ACGGTAGGCT TTTTCCGGGA GAGCGAGAGA TCGATGAATT GGAAGATATT GTACGGCAAG TAAAAGTTGA ACAAACAAAA TAAAAGCGAA GACGCCGTCT TTTTTTGTGT C
|
Protein sequence | MVQGSRRFSV PSQSIFSSKS SQMKGICFVL AFWLFTAHIS AFQPTSLSIF QSSGAQLRVR RTAVFLKVDR DNSIEGAELC WNPKLRRVMG LVASAGMIET AYLTLTKLTD KVDILCGADG GCSSILNGPY AFIPGTNIPL SLLGFVAYAT VAFLAVEPIR TNEENDQSNR VLLTTATTIM GVFSVFLMSI LFGVLHESCP YCIASAVFSI VLAKLAWLGG ALPQERVKEG VATSAGGALA AFAAATVFYV NINNNINQPS SQVNFAGNFF GKPTLLADAS GASSKQLLYE PPTVSTVSSE RALALSSQLQ ALDTKMYGAY WCSHCYDQKE LLGVQAMAKI PYIECSKDGF FRESERSMNW KILYGK
|
| |