Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47954 |
Symbol | |
ID | 7203139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 545569 |
End bp | 546639 |
Gene Length | 1071 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182415 |
Protein GI | 219124237 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.237863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGGAGTTT GATCGGTTTA TACAAAGATT CGCGAATACA AAGACCCAGA AATCGTCGTT GTCACCATGA AAGTTCTCCA AGCTTGTCAG CTTGCTATTG TTCTTCCTCT CGGAGTCATC GAGCCTTCGT CGGCCGCATC CTCGACGTTC AGTGCCTTGA AGTCCATCGA TTATCGCTAC TTTGTTGCCG GAGGAACTTG TGCCGCTATT TCCCACGGAA TCACTACACC AATTGATGTC GTCAAAACGC GCATCCAATC CGATCCAAAA AAATACAATC AAGGTCTCCG CAAAGCAGCT ATTAATATTG TCAAAGAAGA TGGTACCGGC GTTCTTCTAG GGGGACTCGG GCCTACTGTT GTTGGATACG GCATCGAAGG GGCCATGAAA TTCGGCGTAT ATGAATTGAT GAAGCCGGTA TTTGCTTTGC TACTAGGCTC GAGCGAGGGA GGGAATACTG CTGTTGCATT TCTATCGGCG TCCGTAGTTG CCGGAGCTGT AGCAGCTCTC CTGCTCTGTC CCATGGAATC TACCCGAATC CGTATCGTGA CTGACCCAGC TTATGCAGGC AAGGGATTGT TAACAGGACT TCCGAAACTG ATTTCGGAGG AAGGACTATG GTCCACATTT TCGGGTCTCT GGGCGATGCT GGCAAAGCAA GTTCCGTACA CTTTTGGCAA GCAAGTTTCG TTTGATGTTT TTGCTGGGTT TCTGTACGTC TTTTTCAGCG CACTTCAAGA AAACGCAACA TGGTTGTCTG ACAGCCAAAC TAAGTGGGCC GTGTCCGTCA TTGCAGCATT TATGGCTTCA ATTATCGCGT GCATTTTCTC TCAACCAGGA GATATGATCC TTACAGAGAC CTACCGGCCC AAGGATCCGA AAGCAAAGGT AGCGGTAGAC GGCAATTTTG CTGATGTCAT CAACAGCATC TACACCAAGG GTGGTGCTTC CGGATTCTTT ACGGGAACCG GTGCGCGTAT TGTTCACGTC GGTTTGATTA TCACCAGTCA ACTTGTTATT TACGACATTG TGAAGCAAAT GTTGGGACTC CCTGCAACTG GATCGCATTA A
|
Protein sequence | MKVLQACQLA IVLPLGVIEP SSAASSTFSA LKSIDYRYFV AGGTCAAISH GITTPIDVVK TRIQSDPKKY NQGLRKAAIN IVKEDGTGVL LGGLGPTVVG YGIEGAMKFG VYELMKPVFA LLLGSSEGGN TAVAFLSASV VAGAVAALLL CPMESTRIRI VTDPAYAGKG LLTGLPKLIS EEGLWSTFSG LWAMLAKQVP YTFGKQVSFD VFAGFLYVFF SALQENATWL SDSQTKWAVS VIAAFMASII ACIFSQPGDM ILTETYRPKD PKAKVAVDGN FADVINSIYT KGGASGFFTG TGARIVHVGL IITSQLVIYD IVKQMLGLPA TGSH
|
| |