Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_13606 |
Symbol | |
ID | 7202097 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 133017 |
End bp | 134171 |
Gene Length | 1155 bp |
Protein Length | 338 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181142 |
Protein GI | 219121581 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.336999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC AGAACAATCA GCAAGTGAGT CGTTTGCAAC AACTGGTGCC GACAATCGGT ATGTTTCACA CCCACCTACC TCTCCGTCGG GCGTTTGAAG TATATAACGA AAAGCACAAA TTGACGAAAA GACAGCATAT CCAGATTTCA TTCAATGAGA TTCGTCATAT TTTGAACCTA GCTCAGATTA TGGCTCTCCG CAAAAATGTC TCTGGAGAAA GGCGGCATCC TTTGGATATC TTTGCGCCGG AGCTAGAGCC AAATTCTTTT GTCAGCTGCA GAAAGTCTGA CGATGTCGAG ATCGCAGACG GAGATTCGAC CGTGGACAAG GATATTCCTC CGGGTCAGCT CAATGGTCCT CGTTTGATAA CTTTCGATGG AGATCAGACT CTTTATGCTG ACGGCGCAAA CTTTGACAGT AATCCTCGAC TCGCCAATTA CCTGTATTTG CTACTTCGCC ATGGAGTATC TGTTGCTGTT GTTACTGCCG CCGGATACGA GTACAATGTC GAAAAGTATG AATATCGTCT TTCGGGACTT CTGCATTTTT TCAGACAACG AGGGCTTTCA AATGCCGAAT GTGCGCGATT CTACTTGTTT GGAGGAGAGT GCAACTACTT ATTTCAGCTA GGACATGGGT ATAGACTGCA GCCTGTGAAG GAATATGGGC CGGGTGGGTG GATTACGTCT ACTTCATTCA TCAAAGAAAG CCCCGGGAAC TGGTCCGAAG CCCATATCAA CACGGTTTTG GACTTGGCTG AATCCAACGC CAACGAGACC TTGAAGGAGC TGAACCTTCG AGGACGCATT GTCCGCAAAC GGCGATCAGT TGGGCTGTGT CCGAATCATG GACAAGAAAT ACCTAGAGAG AGTCTAGACG AACTAGTTCT CCGCTCTCAC GAGAAGCTCA ACCGTATGAA TGAAGGTACT GGCCCTGGAA TACCTTACTG CGCCTTCAAT GGTGGAACTG ATGCTTGGGT CGATGTTGGC AATAAAAGGG TTGGAGTTCA GGTTCTGCAA TCCTATCTTG GAATTCCAGT GCAAGAAACA CTGCACATTG GTGATCAATT TCTGAACACC GGCAACGACT ACGCAGCGCG CGACGTCAGC TGCTGTGTAT GGATAATTAG TCCCCAGGAA ACTACGTATA TTCTT
|
Protein sequence | MSEQNNQQVS RLQQLVPTIG MFHTHLPLRR AFEVYNEKHK LTKRQHIQIS FNEIRHILNL AQIMALRKNL NGPRLITFDG DQTLYADGAN FDSNPRLANY LYLLLRHGVS VAVVTAAGYE YNVEKYEYRL SGLLHFFRQR GLSNAECARF YLFGGECNYL FQLGHGYRLQ PVKEYGPGGW ITSTSFIKES PGNWSEAHIN TVLDLAESNA NETLKELNLR GRIVRKRRSV GLCPNHGQEI PRESLDELVL RSHEKLNRMN EGTGPGIPYC AFNGGTDAWV DVGNKRVGVQ VLQSYLGIPV QETLHIGDQF LNTGNDYAAR DVSCCVWIIS PQETTYIL
|
| |