Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43595 |
Symbol | |
ID | 7197470 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 910955 |
End bp | 912583 |
Gene Length | 1629 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178031 |
Protein GI | 219112559 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAGAGAAG ACCATTCCTT TTGGGCAACA CACCACGCAA TAGAAACATT ATCAGAAGAA TGAGGCTGTA AGAAACGATA TGTCGCCCCG TCGTTCAGTT GTCCTCGGTT CCACGTTGAC CACCATTTTT GTTTCCATCG TAAATATGGT GTCGGCCTTG ACCGAATCGT TCCTCTTCGA GCAAAGAAAC GTTTCTCGTC GACCTGCCGG CGAGACCAAA ACAGTGCGAT GGATGGTAGA TCTTCCGGAA GAAACGTCGT CCGCTGCACT ATTACAAAGC TCTTGCGACA GGCTTCCAGC ATCACAAATG GCACGAAGCA TTCGAAAGTA CCGGTCGAGT CGAGGAAGAA CCGATTCTGA TGAAGCCGCT TCAACAGATT GTTTCACGCA AACACGGGAT CAACATGGAT CGGGCAGTAG AAAGAAAAGT GGGCCGCGAG ATCAACCGAA GACTGATGAC TTACTTCGAA GAAGAAGTGG GGTTGATCTG CCTTACTATT CCGCTATCAA GGCGCTACGA GCGTATAGCT CCCTCCATGG CAACCTCGTT ATTCCCCGGC GCTACCGAGT ACCATACACG AAAGATTATG CCAACGAGTG GCATGGTGTC GATTTGAGCA CCATCTATGA CATGAAATGG TGGCAACGGA ACGTCAAGTC GAAGCCTGAT CGCGTTGCTG AGCTAAATCA ATTAGGATTT GTCTGGGAAC GGCTGCAACC CGAGTGGAAT CTGATCCTAG AGGCTCTGAT AACATACAGG ACACTATACG GAAATCTTCT CGTACCAAGC AGCTACGTGG TACCACAAGG GGACAATCGC TGGTCCAAAG CAACTTGGAA GATCCCGTTG GGGAATTGCG TGTATCGCAT TCGATCGCGT AGCGATTTTC TGCGTGATGA CAACGCTGGA TCTCGTAGAG ATCAGCTAGA CGGGTTAGGA TTTGTTTGGG ACGCGCAAGA AAGACGCTAC CGGATTTTCT ATGCGGCATT GCGGCACTAC GCCAAGCTCG AAAAATGTGG GGCGTTTAGC GTTGGTCGGT CGATATCTAT ATCCATCCCA TCGAATTACA TTGTGCCGTC GGAAGATCTG TGGCCAAACG AATTGTGGGG GTACCCGCTA GGTGCAAAGT GTATCGCGGT GCGACAAAAG GATTTGTACG TAAAGGACAA ACCCGAACGA AAACAAATGT TGCAAAAACT AGGCTTTCAC TGGGGTGGAA ACGCTGATCT GGGATGGCTA CGGGTTGTAC ACGCCGCAGC AATCTACTCG CGAATGCACG ATAGATATTT AGACGTTCCC TATCACTTTA AAGTTCCGGC TCCATCAACG ACGGAACATG ATGGACAAGA GTGGCCTTGG CCTGAGCATC TCTGGGGTCT CCCCCTCGGG CAGCGACTGA AAGACGTGCG CACGAAAGAC ACGTACCTGA AAGGCGAGAA TGCGCCAAAA CGGCGCAATC AACTCGACGC TCTCGGTTTC GTGTGGAAAC CGAAACAAGG CCGTAAGCAA TCGAAAAATG GATAGAGTGT ATAGAATAGT CTGGTTTTAG TTTTACATGT AGCCGCGACC ACGCCCTCTG CGCATCGACG GGAAGATTTT GTTCAGATCC CCCCTTTTTT TCTCGCTGA
|
Protein sequence | MSPRRSVVLG STLTTIFVSI VNMVSALTES FLFEQRNVSR RPAGETKTVR WMVDLPEETS SAALLQSSCD RLPASQMARS IRKYRSSRGR TDSDEAASTD CFTQTRDQHG SGSRKKSGPR DQPKTDDLLR RRSGVDLPYY SAIKALRAYS SLHGNLVIPR RYRVPYTKDY ANEWHGVDLS TIYDMKWWQR NVKSKPDRVA ELNQLGFVWE RLQPEWNLIL EALITYRTLY GNLLVPSSYV VPQGDNRWSK ATWKIPLGNC VYRIRSRSDF LRDDNAGSRR DQLDGLGFVW DAQERRYRIF YAALRHYAKL EKCGAFSVGR SISISIPSNY IVPSEDLWPN ELWGYPLGAK CIAVRQKDLY VKDKPERKQM LQKLGFHWGG NADLGWLRVV HAAAIYSRMH DRYLDVPYHF KVPAPSTTEH DGQEWPWPEH LWGLPLGQRL KDVRTKDTRD HALCASTGRF CSDPPFFSR
|
| |