Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21207 |
Symbol | |
ID | 7201993 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 2836 |
End bp | 4520 |
Gene Length | 1685 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181114 |
Protein GI | 219121523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGATATTGCT GCAAGGACGC AATGAAAAGG ATTCGACCAG TACGAAGGAT TAAAAAATGA CCTACATAAA CTTCTCTATA ATTCTCTGCT CGCTACTTTA TTTGAGGAAG TGCTGCTGTG ATGCGTTTGT GGTGGACAAG CAATCAAAAA GGTGTTTTGT CGACCCTCTA TCGATGGCTT TTTCAGAGAA TGGAGGTAGC GGGAGCCGGC GTGTACAGAA AAGCGTCCGT GATCGCACCC AGGAAGAAAC GTTTAGTCTG ATAAAAGATG TGCTCCGAGC TGCAGTTGAT GCCGGTCCCC GTGCTGGACC TGCCAGAACT ATGCAAGCAT ATCGTGCCTT CGCCACAACA GCCCAGGAGT TCCTGCCCAA GTTGGCCCAA GGGCCTGAAA CCATCTCTCC GGCGGCCGTT CTCCGAACCC TTTTTGAGCG AATGGGTGCT ACCTACATCA AGCTAGGCCA GTTCATTGCA AGCAGTCCGA CTATTTTCCC AAAAGAATAC GTATTGGAAT TCCAGAAATG TTTGGACCAG ACAGAATCTC TACCTTGGCC AATAATCAAG AAAGTTATTG AGGACGAGCT AGGGCCTATT TCAAAGAATT TCCAATACGT CAACAGCAAA CCTCTTGCCT CCGCTAGTAT TGCTCAGGTC CACGTTGCAA GACTAAGAAC TGGAGAAGAC GTCGTCCTCA AGGTACAAAA GCCGCGCATC GACGAAAGTC TCAAAGCTGA CCTTGGTTTC ATTTATGTTG CTGCTAGGAT TCTAGAATTC TTCCTCCCTG ACTGGGAACG AACATCTCTA GCGGCAATTG CTGGAGATAT TCGTTCGTCT ATGCTTGAAG AACTTGATTT TGAAAAAGAG GCACAAAATA CAATAGAGTT CAGAAGATTT ATTCAAGACA AGGGACTAAC AAAGCAGGCA ACAGCTCCGT TTGTGTACCC GGCTTTCACA ACAAAGAAAG TATTGACAAT GGAGAGACTC AATGGTGTTA GTCTTTTGGA TGAAAACACA ATGGGAAAAG TCACCAAGAA TCCTCAAATG GGAACGGACG TAATTATTAC TGCTCTGAAT ATTTGGAGCC TCTCTGTCAC AGCAATGCCT TGGTTTCACG CGGATGTACA TGCTGGAAAT TTGTTACTGC TCAACGACGG ACGGGTCGGA TTCATTGATT TCGGTATTGT AGGGCGGATC AGCGAAAAGG TGTTTCGCTC TGTGAACGAG TTGTCAGCGG CCTTGGTTGT CGGTGATTCT GAGGGGATGG CTATTGCCCT TTGCAACATG GGAGCGGCAG GTAAAGATGT GGATACGAAT CAATTTGGAA AAGACATCCA GCGTGTGCTA GATCGAATGA ATACTGTGCA GCCAGAAATG ACTGTCACTG CACACCAGGA TGGTATTCTT CAAGGGCAGC TCAACGTGGA CGAGGGCGAG GGTACAGATC TGCTCCTTGA ACTAGTTGAA GTAACAGGTG CGAAATGAGA AACCTAAAAG TTACTTTCTT TGCTCTTCTT ACTGTTAGTT TTCTCACTCA TTTTTCTTTT TACAGAGAAG AATGGCCTGA AGCTACCTCG AGAGTTTGGG TTGCTGGTGA AGCAAAGTCT CTATTTTGAC CGGTATCTCA AGATCCTCGC ACCAAATGTA GATGTTATGA ATGATTCACG AGTCATGATT GGGGGTACAA AAAGA
|
Protein sequence | MTYINFSIIL CSLLYLRKCC CDAFVVDKQS KRCFVDPLSM AFSENGGSGS RRVQKSVRDR TQEETFSLIK DVLRAAVDAG PRAGPARTMQ AYRAFATTAQ EFLPKLAQGP ETISPAAVLR TLFERMGATY IKLGQFIASS PTIFPKEYVL EFQKCLDQTE SLPWPIIKKV IEDELGPISK NFQYVNSKPL ASASIAQVHV ARLRTGEDVV LKVQKPRIDE SLKADLGFIY VAARILEFFL PDWERTSLAA IAGDIRSSML EELDFEKEAQ NTIEFRRFIQ DKGLTKQATA PFVYPAFTTK KVLTMERLNG VSLLDENTMG KVTKNPQMGT DVIITALNIW SLSVTAMPWF HADVHAGNLL LLNDGRVGFI DFGIVGRISE KVFRSVNELS AALVVGDSEG MAIALCNMGA AGKDVDTNQF GKDIQRVLDR MNTVQPEMTV TAHQDGILQG QLNVDEGEGT DLLLELVEVT EKNGLKLPRE FGLLVKQSLY FDRYLKILAP NVDVMNDSRV MIGGTKR
|
| |