Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47962 |
Symbol | |
ID | 7203145 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 562747 |
End bp | 564027 |
Gene Length | 1281 bp |
Protein Length | 270 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182253 |
Protein GI | 219123898 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.581205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCATCGTA CGCGTCGTAC CCACACACAT TCATTCATTC ATTCATCTAT TGTCACCCAA TCTACCGAGA AACGTTGTGA GATCCATTCC GTACGATCTA CAAGCTCGAC CGACCTACAA CCATCGCACG CAATTCACTG TCAGTGACCT CGCAGTCAGA GTCCCACTTT GCTCGTTCGC TACACATACT CGATCGACGC CCAGCCACAA AACGGAACGT TTCTGTTGGT CCTTGGCCCA TTCGATTCTA CAGATATCTC CACTGGTTTG ACAAAGTGTG AATTGCCCAA CATGAGAACA GTATTTACAC TTTTTCTTTC CGCAACCTTG GTACGTAACT AGTTCAGTCC CACGACACCC CCTTCACCAC AAGGACCATC CATACAAATC GCCTAACGTT CGTCTCTACT CTCTGCTACT TTCATTCGGA CAGCTCGGCC GTTGGCTAGC GGCTGCTTCG GCACCCGTCC CGAACGTCCA GGTCACGCTC CGCGGCAAAA AGTACGACGT CACCGACGTG CGCACGGTCC AGGACTTGCA GGATCGCATC GAGGAGGTTT CGGGGATACT GGCGCCGCAG CAGGGACGGG TACTCTTTGA CGGCAAACGA TTGGAGTCGA CCGATGTATT GGCCGATGTG GGTGTCGCGG ACGGCGCCCA ACTCAATATA GTGCCTTCCA GTAAGGCCGC GGGGAAAGTC AAAAAGACCG CGACCACCAC CGAATCCAAA ACCGATTCCG CCGCCATGAT GGAGGATTAC CTGAGACAGG CCGGGCTGGA TGGGGACAAG CTGGATGAAC TCATGAAGGG CATGTCGGGA TCGGATGGGA AAGTACCTTC CATGGAAGAG AGTTTGGGAA TGATGAACGA AATGATGAAT AGCCCCATCT TTCAGGAATA CATGAGCGAT CCCGCGAAGC TCGAAGAGTC CCGGCAGATG ATTCTCAACA ATCCGATGCT TAAATCGATG ATGGCCGGCA TGCCGGGAAT GGAAGACATC CTCAACGATC CCGAGGCTTG GCGAGAAGCC ATGCAAGCAG CAGCCAGCCT CTACAAGAAT ATGGATAAGA ACCAACTGAC ACAAGCAATG ATGGGAATGG GTGGTATGGG CGGTGGTATG CCAGATTTTG GTGGAAACAT GTTTGATGGC ACTCTGGACA ATTCAGCCGC CGCAGCGGCA CTGGACGAGC TGGACGAAGA CGACTAAATC TTTTTCCGAT AAATACTACA ACATTCACAC ACTCATACAT ATACACATAC ATTTATTCCT GCATTGTCTC C
|
Protein sequence | MRTVFTLFLS ATLLGRWLAA ASAPVPNVQV TLRGKKYDVT DVRTVQDLQD RIEEVSGILA PQQGRVLFDG KRLESTDVLA DVGVADGAQL NIVPSSKAAG KVKKTATTTE SKTDSAAMME DYLRQAGLDG DKLDELMKGM SGSDGKVPSM EESLGMMNEM MNSPIFQEYM SDPAKLEESR QMILNNPMLK SMMAGMPGME DILNDPEAWR EAMQAAASLY KNMDKNQLTQ AMMGMGGMGG GMPDFGGNMF DGTLDNSAAA AALDELDEDD
|
| |