Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37270 |
Symbol | |
ID | 7202041 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 574557 |
End bp | 575878 |
Gene Length | 1322 bp |
Protein Length | 387 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181229 |
Protein GI | 219121762 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000276277 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTCA CATTTGTTTT ACAAGTGTAC GTTAGCACAA ACGAAATAGC GTTTGTTTAC CAGCCAATAC CTATGATTGG CACGCGGTTT ACTGCGTCAG AGTCCCGTAT AAAGACAGAC GACACCTGGC GCGCTACTCG CAAGCATTCC AAGCAAGCAA TAGAATACAT TGCGTCGGAA CCAGTTCCTC GACCTTCGAT CGAAACTCTC GTTACTAGAC AAAACGTTAC GGGGGACGTT GCGTGGCTCC TAAATTTGGC CGTGGTGGGA TTCCCCAAGT GCGGCACATC CTTCATGATG CGCTATCTCG GTCGACACGA AGAAATAGCC ATGCTGACTG ACGGTGAACA CTGCGAGTTG TCGACGGGTA CACAAGACTC TGTCCTTATC AAGGCCCTGA TGGATGGGCT TCCCAGCGGT AAGATAGCGC GCGGCTTGAA ATGTCCGCGC CAGTTGGAAA GCCCCAGGGC CATGCTGAGC CTCTCCCGGT ACTTCCCGAA CACAAAGATA ATTGTTGGAG TCCGACATCC TGTCCTTTGG TAAGTAGTAT GCTCGAATAC AAGTATCGAT ATCCGGAAAG GGCTCTATCC TGATTTCTTC TTTTGATGAA ATTGGAACAG GTTTGAATCC TTCTACAACT GTAAGTGAAT ATGTGCTGAA TCCTTCTTTC GGGACTGTCT TTGTACGATA ACTCTCATCC AACATATTTG TAATCAGTTC GCCAGCAACG CACTTGGTTC AACCTGCTGC CAGCACAGGA ACTGATTGGA AAATGTGCGG ATTTGGGGCC CTTTGAAAAC GTGGCTTCGG TCTGTACCGA AAGAGCCAAC TTCCACGAGC CTTTGGCTCG CTTGGGAAAG ACGAGCATGC AGAGCGCAGA TGAGCAACAA TACTTTTCGG CCGACGCGCA GAATGTTTCA GACAACGTTG ATTTCTCCGA CAAGAAGATC TTTTTGTACG ACCTCGCCCA GCTCCAGGAC AAGGACCAAG ATCGTTCTCA ACTGCTTCTA CAAGACTTGC AGAACTTCCT GCAAGTCACA AAGCCCTTCC AGACGATGGT GGCAGAGCGA CAACCAGTCT CAAACGTAAC ACGCATTGAT ATTTGTGACC CCGAGTACAA CCGTTTACGC GAGGTACTTG TGGATACTGG AGTGAAGGCG TCGAGATGGA TTCGGCGATT TTTTTTTCAT GCCAAAGGTG TGACGGTGTC GTCTCCCAAA TTTTTGGACC AGGTGTTGGC CAAGTGGGAA GAAGATCCGT GCGAAGAGCG CCGGGCCAAG AGCAATTCCG CCCTGTCCCC GCCGAATCGG ACTACCTCGT AA
|
Protein sequence | MAFTFVLQVY VSTNEIAFVY QPIPMIGTRF TASESRIKTD DTWRATRKHS KQAIEYIASE PVPRPSIETL VTRQNVTGDV AWLLNLAVVG FPKCGTSFMM RYLGRHEEIA MLTDGEHCEL STGTQDSVLI KALMDGLPSG KIARGLKCPR QLESPRAMLS LSRYFPNTKI IVGVRHPVLW FESFYNFRQQ RTWFNLLPAQ ELIGKCADLG PFENVASVCT ERANFHEPLA RLGKTSMQSA DEQQYFSADA QNVSDNVDFS DKKIFLYDLA QLQDKDQDRS QLLLQDLQNF LQVTKPFQTM VAERQPVSNV TRIDICDPEY NRLREVLVDT GVKASRWIRR FFFHAKGVTV SSPKFLDQVL AKWEEDPCEE RRAKSNSALS PPNRTTS
|
| |