Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40919 |
Symbol | |
ID | 7198748 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 268093 |
End bp | 269441 |
Gene Length | 1349 bp |
Protein Length | 422 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184849 |
Protein GI | 219129340 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGTC GCGAGCCCAA CAACGATCCG GACAACCGTC TTGGAGAAAA ACCGATGGAA TCCCTCGAAA TTGAAGACAT ACATCAGTCG AGTCGAAGGG CCAACGCCTA CGACAGTGAT TCGTATCATC ACGATGCTTA CACAGATTCC GAAGAAGAAT CTCCCATCGC GCCCTACGAG TACCGTCATC GGAAAAGCGA ACATAACGAT ACGACTTTAA TGTTGGATCC CGACGATGCT CGTCCGCTGG TGGGAAGCAG CATGATTGGA ATCAACAATA CGCGATTGAC AGCCTTGCAG ACCGTGGTCA AAAGTCGTGC CTGCTGGCCC GTGTACATTG GGATAGCCCT CGTGGTCTCA CTCTTGATTG CGGCCATTGC TTACGTACCC CCGGCTCGAC GCAAAGTCAC GCGACCCGAT TTTATTTGTC CCACCGAACC GGCGTCGTCA CTGTTTTGGA CACACTTCAC GCACAAGGTA CAAACGTGGG TGGGGCCAGA ACGGTGTCGG ACTGGACGAA ATAATGAGGT GTGCTCGTGC GAGGACCCTA CCCAGCCGTC AATTCCGCAA AGTCCTGATT CTTGGCGGGA GGGATGGCAG AGAGCAACGT TACGGAATGC GGCTCTCATT CAAGATCGGA AACACCTCGC GCTGGACGTT GTCTTGCTGG GAGATTCCAT TACTGAACAC TGGTTGGGCA CAGGTTTCGC CGAACCAAAC AACGACTATC AAGCAAATGT GCCGGTCTAT CAGTCACTCT TTTCCAAAGA ACACGGCGCC GTGATTGAAG GCCTGGCTTT GGGTATTATT GGAGATCGTT GTCCAAACTT GTTGGCACGG CTGCAGAACA ACGAAACGGT ACAGGGCTTG TCCGTCAAAG TTTTGTGGGT TTTGGTAAGT TGTGTCGGGT TCAACGGTAT TTTTTTCGTA GCGACACAAT GCGCTGACGC GCTGAAATTC CTATATTGTA ACAGATCGGG ACCAACGACT ACGCGAGCAG CTTTTGTCGT GTGGATTGTA TCGTGGCGGG CAACTTGGCC ATTGTCCGAG AGTTGCGACT CCAAAAGCCC GAGGCCACCA TTGTAATCAA CGGCCTACTG CCTCGCAGTA AATCGCGTAC CGACGTGGCT TTTGCCGATG ACTTTGCTGA GATTAATCGG CGACTCTCAT GCATTGCCGA TACCTTGGAC GATGTTGTCT TTTTCGACGC TGCGTATCTA TTCTTGACCG AAGATGGTGG CCTAAATCGA ACCATGCTGC CGGACGGTTT GCATCCTGGT GAAGTAGGAT CACGTGTATG GGGGCAAGCA ATTGTGGATC GAGTTTTGAA GATTGATGGT GGACTGTAG
|
Protein sequence | MRSREPNNDP DNRLGEKPME SLEIEDIHQS SRRANAYDSD SYHHDAYTDS EEESPIAPYE YRHRKSEHND TTLMLDPDDA RPLVGSSMIG INNTRLTALQ TVVKSRACWP VYIGIALVVS LLIAAIAYVP PARRKVTRPD FICPTEPASS LFWTHFTHKV QTWVGPERCR TGRNNEVCSC EDPTQPSIPQ SPDSWREGWQ RATLRNAALI QDRKHLALDV VLLGDSITEH WLGTGFAEPN NDYQANVPVY QSLFSKEHGA VIEGLALGII GDRCPNLLAR LQNNETVQGL SVKVLWVLIG TNDYASSFCR VDCIVAGNLA IVRELRLQKP EATIVINGLL PRSKSRTDVA FADDFAEINR RLSCIADTLD DVVFFDAAYL FLTEDGGLNR TMLPDGLHPG EVGSRVWGQA IVDRVLKIDG GL
|
| |