Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35627 |
Symbol | |
ID | 7200940 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 391756 |
End bp | 392906 |
Gene Length | 1151 bp |
Protein Length | 345 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180034 |
Protein GI | 219118527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0500584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAAC GGGACAACGA AGACGAAGCC ACTCGTCTGC AACAACAAGC CGCCAAACTT CGTGAACAAA TACGTCAAAT GGAAGCTAAC TTGGGCGACC AGCGTCCGCG CAATTACGAA AGGCCTCCGC CACCGCAGTC ACAGCCCGAC CCCACCGATA CCAATCCATC CCTCAAAGGC AAACGGGTTT TGGTGACGGG CGCCAACGGA CGTCTCGGCA GTATGGTGTG CCGCTATTTG TTACGGAACC ATCCACAAAC CGAAGTGGTT GCTGCCGTGC ATGTCGTGGG AGAAAACAGT TCCACCAGTC GTGGTTATGG ACGATTGTCT TACGAAGTCG GAGCCGAAGA TGGGGTGGGG CGGATTGGGC CAGCCTGGTC CTCCGAAGAC CGGACGGCAA CGTTTGAATG GGACATTTCC ATGAAAGATT ACAATCTGCA AAATCTACGT CTCGTCGAAG TGGAATTACT AGATCCGGTA CAGTGTCGGA CTGTGGCGGA AGGTTGTGAT GCCGTCATTT GGTGGTACGT CTGCAAGGAT GTTTCTCTCG CGTTTGCTGC TTCCACAAAA CAAAGTACAG CCTCTCACCC ATAGTGCGTA CCGGCTTTCT TCTAATCGAC TTTTTTGACT CCATACACAG CGCCACGGAT TTCAACGGCA ATCGTCCGCG AGCAATTTCC GGATTGAACG TGGCTTTTCT TTTCCGTGCG GTGGCATCCC CCACCAAAGG ACGGGTCGAA GTGGAAGGAT TGGAGAATAT GCTGGGGGCC CTCAAAACCG CCAGACAAGA CAAGCAGCGA GCCACCGGAC GGGTACCGAC GAACGATCCC GTAAACGTTG TGTTGGTATC CACGGCTCCG GACGCCTACG ACGATTTTGA AACGCCGTTC GGTTCTTTTC GAGGTATAAA GCGCCAGGGG GAACAAATGC TGCAAAGTGA CTTTCCCAGT TTGAGCCACA CCATATTACA ATTGAGCCGA TTCGAAGACA ATTTTGTAGA GGAAGGTTTG GATGTTTCCA CGGAGCCGTC CCGGGCGAAC GATATGGAGG CTCCGGGCGA TGCGGACAAG GCCCGGCGGC GCATTAACCG AAGAGATGCT GCCAAGGTAG CGGTAGATGC ACTTCTGGAC GAAGAGCTTA AGGACAAGAC C
|
Protein sequence | MAERDNEDEA TRLQQQAAKL REQIRQMEAN LGDQRPRNYE RPPPPQSQPD PTDTNPSLKG KRVLVTGANG RLGSMVCRYL LRNHPQTEVV AAVHVVGENS STSRGYGRLS YEVGAEDGVG RIGPAWSSED RTATFEWDIS MKDYNLQNLR LVEVELLDPV QCRTVAEGCD AVIWCATDFN GNRPRAISGL NVAFLFRAVA SPTKGRVEVE GLENMLGALK TARQDKQRAT GRVPTNDPVN VVLVSTAPDA YDDFETPFGS FRGIKRQGEQ MLQSDFPSLS HTILQLSRFE DNFVEEGLDV STEPSRANDM EAPGDADKAR RRINRRDAAK VAVDALLDEE LKDKT
|
| |