Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16891 |
Symbol | |
ID | 7199152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 186197 |
End bp | 187546 |
Gene Length | 1350 bp |
Protein Length | 450 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185337 |
Protein GI | 219130364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGG GGGGTATGAG TCTCTCCAAA GAATTCTTCG AACTCCTCAA GGCCATCGGC GAGTCCAAAT CCAAGCAGGA AGAAGACCGG ATCGTTCAGA AAGAAGTGAC GCGCTTGAAG AGCAAACTCG AAAACACACC GGGGAATCCT TACCACTCCA ATACGTTGCT CACCAGCAAG AAGCGCGCCA AGGAGTTCCT GGTGCGACTT TTGTACGTGG AAATGCTCGG TCACGACGGA TCCTTTGGAT ACATCAAGGC CGTCGAAATG GCCGCCTCGG CCTCGCTTTT TCACAAGCGT ACCGGCTATT TGGTCTGTGG CGCCTGTCTC CCGCCCTCGC ACGAATTCCG TTTCATGCTC GTCAACCAAA TGCAACGCGA TCTACAGTCC ACCAACGTAC TTGAATGCAG CGGTGGTCTC CTCGCCTGTA CCAACCTTAT TACGGCTGAT ATGGTCCCCG CCGTCGCCAA CGAAGTTAGT AAACTGCTGC AGCACGATTC AGCGACCATT CGCAAAAAGG CGATTCTCTG TCTGCATCGA TGTCACCAAC TCGCGGATGA CGTTGTTACC AGCGAATCTC TGCACGAATC GCTACGGAAA CTTGTTTGTG ATAAGGACCC TTCCGTGATG GGGAGTTCGC TGAATGTCAT TGAGGCCTTG TCTCTCACGA ATACCGCGCC TTTCAAAGAC CTGGTCCCCT CCCTCGTTTC CATTCTCAAA CAGATTTGTG AACACCGGTT GCCTTCCGAG TTTGACTACC ACCGTGTCCC GGCGCCGTGG ATGCAACTTA AACTCGTACG CATTCTGGGT CTCCTCGGCA AGGCCGACAT GCCCGCGAGC AAGGGGATGT ACGAAATTCT ACACGAAACG CTGCGCAAGG CCGATACCGG GATCAATGCG GGATACGCGA TTGTTTACGA ATGCGTTATT ACCATTATTG CCATTTATCC CAACGCCAAC CTGTTGGACG CCGCAGCCGA AGCCATTGCT CGCTTCATGC AGTCTCGATC GCACAATCTC AAGTACCTAG GAGTTACCGG ATTAGCCATG ATTGTGGAAC AGCATCCACA GTACGCGGCG CAGCATCAGT TGGCCGTGAT GGATTGCTTG GAAGATGACG ACGAAACGCT ACAGCGAAAG ACGCTCGATC TATTGTACCG CATGACGAAC GTAGTTAATG TGGAATTTAT CGCCGAAAAG CTGGTGGAAT TCTTACGCCA CACGACCGAT TTATTCCTCA AACAGACCTT GACGACCCGT GTTTGTTCCA TTGCCGAGCG CTACGCCCCC AACAACGCCT GGTATATTCG TACCATTACC TCTCTGTTGG AAGTATCTGG AGACATGGTT
|
Protein sequence | MATGGMSLSK EFFELLKAIG ESKSKQEEDR IVQKEVTRLK SKLENTPGNP YHSNTLLTSK KRAKEFLVRL LYVEMLGHDG SFGYIKAVEM AASASLFHKR TGYLVCGACL PPSHEFRFML VNQMQRDLQS TNVLECSGGL LACTNLITAD MVPAVANEVS KLLQHDSATI RKKAILCLHR CHQLADDVVT SESLHESLRK LVCDKDPSVM GSSLNVIEAL SLTNTAPFKD LVPSLVSILK QICEHRLPSE FDYHRVPAPW MQLKLVRILG LLGKADMPAS KGMYEILHET LRKADTGINA GYAIVYECVI TIIAIYPNAN LLDAAAEAIA RFMQSRSHNL KYLGVTGLAM IVEQHPQYAA QHQLAVMDCL EDDDETLQRK TLDLLYRMTN VVNVEFIAEK LVEFLRHTTD LFLKQTLTTR VCSIAERYAP NNAWYIRTIT SLLEVSGDMV
|
| |