Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33770 |
Symbol | |
ID | 7197831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 313001 |
End bp | 314146 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178205 |
Protein GI | 219114819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCGCG CCTACCAGGT CCATCGGACA CCGGCGTCGT CCAACACGTA CCAAAATACC CAACCGTATC GGGCCATCCC ACAACATCAC GCCTGGCGAG AACATGGCAG CCGGAGAAAT CGGTGGAGTA TGGACGTCGA ACGTCCAAGT ATCCAAGACG ACCGGAAATA CGTACAGCGA TTGAACCACG TGCTGCAATA TTGGAACGGA TCACCATCAT CCCATACGCC GAGTAGAGGC GGCACCAGTA CTTTGTCAAA GTCCGAGACT GCCGGCGCGC TTCACGAACA ATCCGCACAC GCCTCCGCAA CTTCGCACGG AATGCCTTGG CGATCTTCAA TCGATGGCAG TTACAGCCAC GAACAACTGT TCTACATGCC CTTTTGGGAA TGGCAGGTAG AGTTTATGAA ATCGACTCTT ACCAATTTTC GAGGTCTGCC GGTGCGTTCC CGGTCGGGGC GGGACATGTC TTACGTGGAA AGCGGGACAA GCGTGCCGAC AAGGTCTTCT GGAAAACCAA TGCGCATGCA CACGTGTTGC TTTGCTTCGG ACGAATATAA ACAGATACGC TTGACGACTC TGGATGCGGG ACCCCGAACC CAAGTGTTTA CATCCTTGTG GTATCCCAAT CCAGAATATG ATTTGCCTGT ATTGGGCATC GACTTGCTAC AGTTCAACGA AAAGAAGCAT TTGTGCGTAG TGGACTTTCA ACCCTTGCAC ACGAGCGAAA ACGACCATAC AGTTGATCGT CGTCACGTAG AACCACGCCA AGAAACGTTG GCGTCGATTC GTTCACAGTA CCCTAGTTTG CAAGGGAGCA TGACGAAACG ATTCTACGAC GAAACGCAAT TCTTCTCGTC CCAAATGCTG CTGGCTCGCG ATCCACCGGA CGGCAGCGAC CCGACCCGCA TGGTGAACGA CGAGTTGTTC CCAGCCTATC AGCGGTACGT CGAAACGCAC GTGGAAATGG TGCAGTCGGC CACGCCGGAC CCGTCCACTG TACCCACCGT GCTGGATCGA CACGCGGCCT ACGACGAATA TTCAGCCGCC CGGGATCCGG CACACGCCTT GCTAGCTCGT GCGTTTGGAC AAGACTGGGC CGACGAGTAC GTGTACGATG TACTCTTTCC ACTCTCGCAA AAGTAG
|
Protein sequence | MVRAYQVHRT PASSNTYQNT QPYRAIPQHH AWREHGSRRN RWSMDVERPS IQDDRKYVQR LNHVLQYWNG SPSSHTPSRG GTSTLSKSET AGALHEQSAH ASATSHGMPW RSSIDGSYSH EQLFYMPFWE WQVEFMKSTL TNFRGLPVRS RSGRDMSYVE SGTSVPTRSS GKPMRMHTCC FASDEYKQIR LTTLDAGPRT QVFTSLWYPN PEYDLPVLGI DLLQFNEKKH LCVVDFQPLH TSENDHTVDR RHVEPRQETL ASIRSQYPSL QGSMTKRFYD ETQFFSSQML LARDPPDGSD PTRMVNDELF PAYQRYVETH VEMVQSATPD PSTVPTVLDR HAAYDEYSAA RDPAHALLAR AFGQDWADEY VYDVLFPLSQ K
|
| |