Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_10479 |
Symbol | |
ID | 7204041 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1020957 |
End bp | 1022213 |
Gene Length | 1257 bp |
Protein Length | 300 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186170 |
Protein GI | 219113173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCATA AACTCGATCG ATTTGGCTTC ATTCTGAACA TGGACTCACA CGGCAATGTT CACGAGCACG ACGAGCCGGA GCCCATACGG ACCTTTGCTG CACAAAAGCG CGTCGACGTC CGCACGCGCA AATGGAACGT TATGCTATTC GGGACCGGAA GTAACTCAAC CAAGACCAGC AATAGGAAAT TTATGGGTTC TTTGCATCAT CGTAAGCTCA AGTCCCGATT ACGGAAGGGC GTCCCCGATA CCCAACGAGC CGCCGTATGG TGTCGGCTCG CCGGTGTGGC GGAAAAAATT AAGACGCATC CCGGAACGTA CAAACGATTG GTCCAGCAGT CTGTACTTCA AAATCCTCGT TACCATTTCG TCGTTGGTTC CGGCATGCCG AGCAGTACCT CTCCAACCCC CACAAAAACA TCGTTTCGGA ATATCCAGGA AACCATTGAA CGGGACATTC ACCGCACCTT TCCTCGGCAT TCTATGTTTT TTGAACGTGT TCAGGAAGAA GCAGAAGAAG ATGAGAACGA TCCGGAACGG TCCCCAACCG ACATCTGTGG GACCACCGAG ATTTCGGACA TGATTCGGGA GTTGGAGCGC TCGCAGAGGC TTCTCGACAC CCCTCAGGAA CCATCGGCGC ATCCAGATCA CGTGCCGGCC TCGCGCGTTT TAGAAGGTCG AGGCGGACAA GCAAGTTTAC GCCGTGTCTT GAAAGCCTAC AGTCTATACG ATCGGGAAAT TGGGTACTGT CAAGGGATGA ACTTCATTGC GGGGATGTTT CTGACTCTAA TGACGGAAGA GGAAGCGTTT TGGTTACTTG TTGGTCCGTA CAAATGCTGA GATTGTTGAT TCTGTTTACA TCAACCTGAT GTTGCTTACG TGCATTCTAT TGTTTTTCTT ACAGCGGTGA TGAATGACAA ACCGTGCTGC ATGCGTGGAT TATTTGGGGA AGGCATGCGG GAGACTCACC AGGTACTCTA TGTGGCCGAG AAGCTGATCC ACCAGTTTTT ACCCAAACTT GCCCGACATT TTGACAAGGA GCATTTGCAC ATAACAATGT TTGCAACGCA ATGGCTCTTG ACTCAATTCA CCAGCTCCTT TCCTTTTGAA TTGGTCACTC GTGTATGGGA TTGTTTCCTA CAGGAAGGCT GGAAGATCAC GTACCGCGTC ATGCTTGCGC TCTTATCAAC GAATCAATCG AACATTTTGC AACATGGGTT CGAAGAAATT TTAGCGCTCT TTCGGGAGTT GCCAGAC
|
Protein sequence | MKHKLDRFGF ILNMDSHGNV HEHDEPEPIR TFAAQKRVDV RTRKWNVMLF GTGSNSTKTS NRKFMGSLHH RKLKSRLRKG VPDTQRAAVW CRLAGVAEKI KTHPGTYKRL VQQSETIERD IHRTFPRHSM FFERRGGQAS LRRVLKAYSL YDREIGYCQG MNFIAGMFLT LMTEEEAFWL LVAVMNDKPC CMRGLFGEGM RETHQVLYVA EKLIHQFLPK LARHFDKEHL HITMFATQWL LTQFTSSFPF ELVTRVWDCF LQEGWKITYR VMLALLSTNQ SNILQHGFEE ILALFRELPD
|
| |