Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42457 |
Symbol | |
ID | 7196660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 120388 |
End bp | 121877 |
Gene Length | 1490 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177020 |
Protein GI | 219110537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0895277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAGGACTT CCATCGCCCA CAGTTGGAAG ACGTAATCGA CGCCAGTTGA TGCTGTAGTC ACCATGAGAG TCAATATGGC ATCGGGAAAA GCTCGTTTCC ACGAAATCAG CGTGGCGCTG ATGATATTAG TTTTGTCTAC GACCGAGATT TCCAGCGCTT TTGTTCCGCT GCCAATTCTC TGTAGAGCCA AAGATTCCGT CTTGGGTTCG TCGGTAGGCG GCGATGGCCC ACCACCCTCT TCCGGAAATA ATGGTGACAA GAACGACTGG GATGACTTCT TAGATCCCAA CTTTAAAGAA TCGGAAGGTT TGCAAAAAGC AAGAGAGTAC ATGAGTGAAA ATAGTCTACC CATATCCTTC GATGAGGAAG CAGACGATGG CTTACTTGTC AATGATGACA GCAATGGGCA AGCGCAAGAT ATAGTGGTGG ATGAGAAGAA ATCGAATGTG TCATCCTCGG CACTTACTCG ACCCGACGGA GACGGGGGAT TATTCACCTC GGGGCTAAGC GCAGAGCAGC TTGCCAAAAA TCCATATGTA GCTGCCGTAT CCAGACTTAC GCCATCCGAG CTCATTAGCA AGTTCACTTC GACGGCACAT CCCCGGGTAC AAAATGCTGT GCGGCAGACC GTGCTCGGCC TAATCGGAGG CCTACCCAAA ATGGCGTTCG AAACTACCAC TATCACCACC GGGCAGCGGT TGGCGTCTCT CATGTTTCAG CTTCAAATGA CAGGTTACAT GTTTAAGAAT GCAGAGTACA GGTTGAGTCT TCAACAAAGC TTGGGCCTCG ATGGGCACTC CGTGAATCCG TCCACAGAAC GCTTGCTATC GGCAGTCGAC GACGAAGGCA GTGATGATGA TAATGATGAT ACACAAATGG ATACGCTCAA GGGGAAAATT CGAGGAAAGT TGCGCATCCG ATATCCCGGT TCAATGAAGA ACACATTAGA CGACCCAGAA AACCAAAACG ACGTGGACAA TTCGAACGGT TTGCAAATGG AGGTTGATGC GGCTGCGTAC ATGTCCGAGC TGCGATCGGA AGTCTCGCAA CTGAGAGATG AACTCAAAAT TACGCGCAGC GCGAAGGAAG ATGCTCTTCG CAAAGATCTC TTACTCTACA TTCGAACACT CCCGGAAAAG GAGCTTCGAT CACTGACCAA CACTATGGGT CCAGACGTAC TAGTGGCTAT GAAGGGCCTC GTCAAAGCCG TCATGACCGG AATTGGGGAG GATGAAATAG GACCCGAGAC GGTTACAGAG CAATCTAGCG AAGCCATGGC TCAACTATGT ATGTGGCAGC TCGCGATTGG CTACAATCTG AGGACGTTGG AAGTACGGGA AGAGATGAAG AAGTCGTTAA AAGGTAGCAC TGTGGGTGGG CAGGATGGCG ATTTGGCCAG TGGAGCGTTT GAGTAGTTTA CGTTAACAAG GCTCTTTTGG CAAAGCTACA CCTTGCCTTT AATTATGTAT TCATAGCCAG TATTGCAAGT
|
Protein sequence | MRVNMASGKA RFHEISVALM ILVLSTTEIS SAFVPLPILC RAKDSVLGSS VGGDGPPPSS GNNGDKNDWD DFLDPNFKES EGLQKAREYM SENSLPISFD EEADDGLLVN DDSNGQAQDI VVDEKKSNVS SSALTRPDGD GGLFTSGLSA EQLAKNPYVA AVSRLTPSEL ISKFTSTAHP RVQNAVRQTV LGLIGGLPKM AFETTTITTG QRLASLMFQL QMTGYMFKNA EYRLSLQQSL GLDGHSVNPS TERLLSAVDD EGSDDDNDDT QMDTLKGKIR GKLRIRYPGS MKNTLDDPEN QNDVDNSNGL QMEVDAAAYM SELRSEVSQL RDELKITRSA KEDALRKDLL LYIRTLPEKE LRSLTNTMGP DVLVAMKGLV KAVMTGIGED EIGPETVTEQ SSEAMAQLCM WQLAIGYNLR TLEVREEMKK SLKGSTVGGQ DGDLASGAFE
|
| |