Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39391 |
Symbol | |
ID | 7195128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 379218 |
End bp | 380585 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183476 |
Protein GI | 219126462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0155798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAACA CCAGATCTAA TCCAATTCCA ATGGCAGATC AACCTCCCCA AGAGGGAAAT GTCGAAGAAG CATGGGTTGC TTCGGAGACA ACCTTCGAAT CAATGGGTAA CGTGCGCTTC GCTATCTCAC CCAGTCTTGC GCAAGGACGT CGAGTCATCC TCGACTACTC CATCCCGAAA ATTGCAAAGC TTTATGTAGC AGCCACAAAG CCACTTTCGA AGACAGAGTT TGACATCAAA GCGAGCGAAC TCACTCCTTT GCTCTCTAGC CTTTCGTTAC GCGCGAGAGA ACATGGATGG CATACACACC AGAACGACGG AATCCTCAAT ATTCCGGACA ATCTCAATGC TCCCCACGGC AGTTCCAAGT CGTTACTCAA GGAGTACGGT CAGATTTCCC TCGCTCACAT TCGCAACTAC GTCTCAACGT ATGCGAACAC GGAAACACGC GACGTTCAAG ATGATGAGGC ACTCTATCAG TGCTTAAAGG TGTCACTTAC TCATGAAGCA ATTGCCAAAA TTGACCTTTA TGAAAGCGAG TGGACTGTTG CCGGACAACC ATCAGGAATT GCGATGCTCA AGGTCATCAT TCGCCAAGCG TACGTTGACA CAAACGCAAC TACAATGCAC GTGCGTACCA AACTCAGTAA GCTCGACGCT TACATGGAAT CGCTCGCGGA GCACAACGTG ACGAATTTCA ACGAATACGT ATACGAGCAG CTCCAAGCTC TCACGGCACG AGGAGAACGG ACACTTGACT TACTCCCAAA CCTATTCAAG GGTTACGAAG CAGCCAAGGA CACGCAATTC TTGGAGTACA TCCGCAAGAA GAAAGCAGAA TTCGAAGAAG GAACGCTCAC ATTGCAACCG GAGATCCTAA TGTCGCAGGC ATCGATCAAA TACCGAACAT TGGTGGAAAA GGGGGAATGG GACGCTCCGT CAGAAAGCGA AACGAAGATA ATCGCGTTGA CAACGCAAGT TCAGCAGCTT CAGTCCGCAG CAAAGAAGGC ACCAAAGCCA AAGTCCAAGG AAAAGGGAGA TACAAAGGGA AAGAAGAAGA AAGGAAAGAA ATCCGACAAA CGAAAGATGG ACAAGTACGC TTCTCTCAAG ATCCCAGCGG CGTCGGAACC ACATACGAAA GTTTTCGACG GCGATACGAT TTCGTTCTGC ACCAATCACC AAGCTTGGGG AACGCACCTC GCGAAGGACT GCAAAGGCTA CGGACTCGAA AAGGACTCTG GTGGCAAGCC TATCCCCAAA ACTGCGGGAC AAAAGAAGGA CGACAAGAAC TCCAAAACTC AAGCCGCAAT CATGCGGATG AGCAAAGCTC TGACCGCCGA GATCAGCAAG GCTGGCGAAG AGGAATGA
|
Protein sequence | MVNTRSNPIP MADQPPQEGN VEEAWVASET TFESMGNVRF AISPSLAQGR RVILDYSIPK IAKLYVAATK PLSKTEFDIK ASELTPLLSS LSLRAREHGW HTHQNDGILN IPDNLNAPHG SSKSLLKEYG QISLAHIRNY VSTYANTETR DVQDDEALYQ CLKVSLTHEA IAKIDLYESE WTVAGQPSGI AMLKVIIRQA YVDTNATTMH VRTKLSKLDA YMESLAEHNV TNFNEYVYEQ LQALTARGER TLDLLPNLFK GYEAAKDTQF LEYIRKKKAE FEEGTLTLQP EILMSQASIK YRTLVEKGEW DAPSESETKI IALTTQVQQL QSAAKKAPKP KSKEKGDTKG KKKKGKKSDK RKMDKYASLK IPAASEPHTK VFDGDTISFC TNHQAWGTHL AKDCKGYGLE KDSGGKPIPK TAGQKKDDKN SKTQAAIMRM SKALTAEISK AGEEE
|
| |