Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45339 |
Symbol | |
ID | 7199975 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 884613 |
End bp | 886247 |
Gene Length | 1635 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179530 |
Protein GI | 219117471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTTTT GGAAGAAGGT ACGTCCCGAT TTTTCATCTC TGTTGTATAT CGCTAGTCGC GTGATTCAGA AAGCATCATA CGCGTCCTAA GCAAAGCGGT CCATATGTTG TCGTGCCTCT GGTGAACTTC CTTTCTTCGA CGCTTCATAT TGATTATATC AACATCTACG GTTGATATTT CCCAATGACG CTACATACGA TTCTGCTTTC TACATTTTCT CTCACCTCCG GGGTTGTAAG ATTCTTCTAC TGGTAAGCCT ATGCGCTTTC TCTAGAGTAG CGGAGGGCTT CGTTATCCGC TCGAAGAGCT TCTCTTCAAC CAGACATACT GCTGCTACTT ACCGCAAGAT CGATACGTGT CGTTTTTCCT CAAATCCCTC CGATCCCCAG CATGAAAGTT GTACCAGCTT GGTCTCAATC GAGCAACAAC CTGCTACCCC TTCCCGTCGG AAGTTTTTGC AGAAAGTGAC TGCAACCACC TTGGTCGTTA CCGCGGCCTC GTCCGGTCTC GTGCCCGTTG ATTCCACCGC TTGGGCTGCC TCCGGCACCG ATGATACTGC GATTCTCAAT CTGCCATCTC TGAATCTTAT TCCTCAGTTC TCCACCGCAG ATGACGTTCC CAGCGACTAC TTTTCCGACA ATCGCTACAT ATATGGTTTC GTGGAACGTA TAATTGACGG AGATACTATC CGAGTCCGGC ATGTACCGGG CTACGGACTC CGCCGCCAAT CCACACAACC GCTCCAACAG CGGGGCATCG CCAAAGACAC ACTCAGCATT CGCGTGTACG GCATCGATAC TCCCGAAATT GGCAAGAATA AACGACAAGT TTCGCAACCT TTTTCCGAAG AAGCCAAATC TTTTACCTCC AAACTCGTTT ACAACAAAAT GGTCAAAGTA ACCTTTTTGC GGAAGGACCA GTACAGCCGG GCGGTAGCGT CGGTGGAAAC GGTACCACCA CGATTCCTTT CTTGGATTCC CGGATTCGGG CCGAAAGATT TGTCGCTGGA ATTGGCCAAG GCTGGGTTGG CGGAGCTTTA TACTGGCGGT GGTGCCCAGT ACAATGTACG TCTATTGCGA TCCTTTTTGT TCGGCAGACG ACTTGGCTGT GGATTGGCAT GAAACACTGC AAAAGAGCTA TCGGACCCTT ATCTGTGTTG CTCCTTATAT TCCTACAGGG CAAGCGCGCG GAACTGGAGC AGGCTGTTGC GCAAGCGCAG CGTAAAAAGC TTGGTCAATG GTCTTTATCG GAATCGGAGC GGGTCAGCGC AGCCGAACAA AAGCGTCTCC TGAAGCAAGC AGCAGTGACC GGCACTGCTC CGGTACCAGT GTCCCGAAAC GACCGATCGT CGGGCGCGAT GCCTCTCGCG TCGACGAGTC GGAACGGGCA GACGGGTGTC GGCGAATCAC TGCTCGACGC GGCTGTCACT GGTCTAGAGT TTATGTAGAA ACACGTAGAC GACGGCCGTC TACGGCACAC CAGCTATACC GTCTTTTCCT CCTTCTCATC TGCAATGAGA TGCTTGGAAG AATGCTTTGC ATTTCTACAT AGCTTCTAGA TCTAGAAACA GTTCCAATGC AGGCTTTGAC AAATATAAAC GCTGAGTAGA TTCTAAGTGC AAGAGCTCTT TTGGT
|
Protein sequence | MMFWKKIDTC RFSSNPSDPQ HESCTSLVSI EQQPATPSRR KFLQKVTATT LVVTAASSGL VPVDSTAWAA SGTDDTAILN LPSLNLIPQF STADDVPSDY FSDNRYIYGF VERIIDGDTI RVRHVPGYGL RRQSTQPLQQ RGIAKDTLSI RVYGIDTPEI GKNKRQVSQP FSEEAKSFTS KLVYNKMVKV TFLRKDQYSR AVASVETVPP RFLSWIPGFG PKDLSLELAK AGLAELYTGG GAQYNGKRAE LEQAVAQAQR KKLGQWSLSE SERVSAAEQK RLLKQAAVTG TAPVPVSRND RSSGAMPLAS TSRNGQTGVG ESLLDAAVTG LEFM
|
| |