Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47502 |
Symbol | |
ID | 7202279 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 819479 |
End bp | 821171 |
Gene Length | 1693 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181811 |
Protein GI | 219122976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.839927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTCGTTCC GTGAAAAGAC CCATGGGAAT GTCGGAATGG GGGATCTCTC CTTGTCATCC ACATTCACAA CAACAACGAC GACGACGTGG TGGCATTCCA TCCGTACGGC ATTCGGACAC CGTCCGTTCC CTGCCCAACC CCGGTGGGTG ACGGAAGCGG CGGCGAACGC TACCCGGGCC GCCGCTCCCT GGGGCCAGCT CTGGCAACCA GACACGGCCC CTCGCGCCGT CCGAGCCGGT CGGGTGCGCA CCGTCGAGGA CCTCGACCAC AGTAGTAGAT ATGACAACAA GGACGACCGT ACTAGTGACC TCGACGGTAG CACAAATGAC GACAACAACG ACGATACCAA CAACGACGAC AACAGTCTGC GACTCTGTGT CCGCAATCAT TTGTCCATAC CCCTCGTCTG GTGTTGGATG GATGCCCAGG GTCGTCCCCA TCACTTTCGG AAACTCTACC CAATGGCACA CGCCGACACC GACACCGACA CTGTTACCCG GGATGATCAC GTCGAACAGA CCTACACGGG ACACGCCTTT GTCCTCGCCA CGGAACCATC CCCCGCCCAT TCCTCCGACA CCATTCCGTC CCTCGATCCC TCGACCATAC TCGGGGCCTA TAGACCCCAT CGACGACGAG ACCACACCGT TCACATATTG GAAGTCAAGT ACGTACCGAC AACAGACAAC AATGTCAGCA ACAACAACAA CGTCGTCGAG GGGATGGAAC TGTTGTCGAC CCGGAGCACG CTCCCACGTC CGACACGGAT CCCACCGAAT CCACGCCTTT GTACCCACCG ATATTGACAA CGACGACGAC GACAACAACC AGGACGACGA CAACCCACTC GACGTACCCC GGTCTGTACA ATTACACTTG CACATCGGCC GTCTCGATCC CACACCACTG GACACGGTCC GTACAAACAA ATACGTCCAG AGTACCCTCG CGGGCTGGCC CGTCCGTTTC CAGTCCAATT GGGACGGCGG TGACCCGACC CTCCGGGCCC GTCTCGAACA GGATTTAACA CACGCGGTAC ACTGCCTACC GCCCCACGCC GTGCGAACCC TGCGACGCAC CACCCCACTC TGGATCAATC GGGACTTTCG TTACGGACCC GCCGCCTCTC CCGTACGCGC CCGGGGACTC TGTTTCCACC CCTACGCCGA CTGGTTGGCC CACAACCAAT GCCATCCCGC CAAGCAACAC GGTGTCGAGC TCTACGACGC CGACGAATAC CGAAAGGATG CGGCCCTATG GGGAACCGGT GGTGTCCTCT TGCACGAATT CTGTCACGCC TACCACTGTC TGTGCGTCCC ACAGGGCTAC GACAATGCCG AAATTCAGGA ATGCTACCGA CTGGCCCTGG AAGAGGGACT GTACGAGAGC GTGCCCGTGC ACGGCCCGCA GGGACCGCAC GCACGGGCCT ACGCCTGTAC CAATGCCATG GAATACTGGG CCGAATTGTC CACGGCCTTT TTGGGTGGCG TCGACACCGA TACGGAATAC AACAAGTGGT TTCCGTTCCA CCGGAAGCAA CTCCGACAGC ACGATCCGAG AGCCTACCAA TTGTTGCAAC GATTATGGAA GGTTCCATGC GATAACGATA ACGATAACGA TACCGACGAC GACCGTATCG ACAACGAGTC AAAGACTGGA GATGTGCTGT CGAGCGGCAT CCCCACGTAC TAG
|
Protein sequence | MGDLSLSSTF TTTTTTTWWH SIRTAFGHRP FPAQPRWVTE AAANATRAAA PWGQLWQPDT APRAVRAGRV RTVEDLDHSS RYDNKDDRTS DLDGSTNDDN NDDTNNDDNS LRLCVRNHLS IPLVWCWMDA QGRPHHFRKL YPMAHADTDT DTVTRDDHVE QTYTGHAFVL ATEPSPAHSS DTIPSLDPST ILGAYRPHRR RDHTVHILEV KGWNCCRPGA RSHVRHGSHR IHAFVPTDID NDDDDNNQDD DNPLDVPRSV QLHLHIGRLD PTPLDTVRTN KYVQSTLAGW PVRFQSNWDG GDPTLRARLE QDLTHAVHCL PPHAVRTLRR TTPLWINRDF RYGPAASPVR ARGLCFHPYA DWLAHNQCHP AKQHGVELYD ADEYRKDAAL WGTGGVLLHE FCHAYHCLCV PQGYDNAEIQ ECYRLALEEG LYESVPVHGP QGPHARAYAC TNAMEYWAEL STAFLGGVDT DTEYNKWFPF HRKQLRQHDP RAYQLLQRLW KVPCDNDNDN DTDDDRIDNE SKTGDVLSSG IPTY
|
| |