Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49277 |
Symbol | |
ID | 7195567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 454361 |
End bp | 456046 |
Gene Length | 1686 bp |
Protein Length | 553 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183875 |
Protein GI | 219127298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAATCACAA GGTGACAAGC CACGATGAAA GCTCTTCACG CCGAAGTAGC CAAAGATCTT CTGGACGCTC TCTTGGATGA TGAACTGAGT GACGTCTATC TGACCTCTTC CGATGATGTT CAAGTTCCGG CGTGTCGCTT CGTGCTGGCA GCACGCAGCA CGGTTCTGAA GCGTATGCTG TACGGTAGCT TCCGGGAGGC AAAGTCTTCT ACTATCTGCA TGTTGGGTTA CGATAGCGTG ACTTTGCAGG CAATCGTTGA CTTTTGTGCA CACAACGATA TTTCCCGTTG CACAGCGAAA TTAAAGCAAG ATGAAACTTC TATACGTCAG CTGGTGCATC TTGCACGAGC CGCTGACTAT TTGGAACTGC CCGGACTAGA ACAGCTATGC GAGTGCGAAA TTAGAGCACG AATGACGCTG CACCCGCCGC TGGCGTGTGC AGTATACGAC GAGGCCGAGC GCTCAACAGA AGTGTCGGAA TACGCCATGC AAATGCTGGA GTGTCGGCCT TACGTAGCGT TGGATCACGG AGGCGATCGT GCTACTGGTG GAGGCATCGA GTCTTTGCAC TCCGAACGAC TCCTAGACGT GGTGCACAAT ACTCAGCTTG GTGCGGGAGA GTTTTTCCTA TTCCGAATGA TGGAGCGGTG GTTCCAAGAC GCGCAGCTAA AAGACCACAA TGCCGCTCTC CAGACGGCAC ACAAGTGTAC CCAATATCTA CAGTTTGAAA ACATCGAGCC CAGCCTTTTG TTAAAAGAAG TCAAGTCTTG TGAGTTTGTG TCTTCCGAAC TTATTTTTGC CGCGGTCGCC AAGCAAGCAC TTCGAGCTTC GCAAGACAGA GTATGGAGTC TGGGATGCCG CGGAAAAGAA CATGTTGAGC GAGTGCTGGT TGAAGGTGCA GGAAACCGGG ACGCCAATGG CATGTATTAC TTGATTGCTC CTTTAGCCAA CAATGACCTT TACGCGAAAC GTGAAGTCGC ATGCGGCCAG CAGCGTGTTT ACACGCTGTC GTGCAGTACC AAAGAAGAAC AGTATGAGTG TCGCATTTTT TGCAGTACAT TGCTGACGTC TGGCGCAATT TTTAATTTGC AAAGCATGCA AAAGACGTCT GTTGTCGAAC CCGTCTTTCA GCCGGTTCTA CAAATAATAG AATTGAAGGA ACCCGAGAGC TCAGTACCGA CACCGATTCG TAAATACTAT CGAGTCCGTG TGTCGGATGG TGAAGTGCAC ATGGCGGGAA CGTTGGCCAC TGCTTGGAAC AGTATGGTGG AAAGTAAAGA GGTATTGGCA CGGTCAATAT GTAAGATCCT GCAATATGGT TTGTACGAAA TAGACGGAAC AGTCTGTATT CATATCCGGG AAATGTCGAT CGTAACGACC AACCCTGGGC AGCGATTTGG ACGTCCTATC CCTCTGGAAG AAGCTGATGG TTACAGCCGG GAACATCAGT CCGAAAACCT ACTGAGTCTC TACTCTTGTA CGCATCCAAG AGTCCAAGGT GCGAAGGATA TAAAGGTCCC TCGAACTGGT TGGCAAGTCG ACGATCAAGG CGATGGACCC GTCCCATCGT GCACTTGGAT GCCACCGTTG CGCAAAGCAG ACAACCTAAA TACATCATAT ACGTCCTTGC GGTCCTCTAC TTCACCTATA TCAGGCAATA CTGAACCGCA CAATGGCTCT TCTTAA
|
Protein sequence | MKALHAEVAK DLLDALLDDE LSDVYLTSSD DVQVPACRFV LAARSTVLKR MLYGSFREAK SSTICMLGYD SVTLQAIVDF CAHNDISRCT AKLKQDETSI RQLVHLARAA DYLELPGLEQ LCECEIRARM TLHPPLACAV YDEAERSTEV SEYAMQMLEC RPYVALDHGG DRATGGGIES LHSERLLDVV HNTQLGAGEF FLFRMMERWF QDAQLKDHNA ALQTAHKCTQ YLQFENIEPS LLLKEVKSCE FVSSELIFAA VAKQALRASQ DRVWSLGCRG KEHVERVLVE GAGNRDANGM YYLIAPLANN DLYAKREVAC GQQRVYTLSC STKEEQYECR IFCSTLLTSG AIFNLQSMQK TSVVEPVFQP VLQIIELKEP ESSVPTPIRK YYRVRVSDGE VHMAGTLATA WNSMVESKEV LARSICKILQ YGLYEIDGTV CIHIREMSIV TTNPGQRFGR PIPLEEADGY SREHQSENLL SLYSCTHPRV QGAKDIKVPR TGWQVDDQGD GPVPSCTWMP PLRKADNLNT SYTSLRSSTS PISGNTEPHN GSS
|
| |