Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36398 |
Symbol | |
ID | 7201797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 353679 |
End bp | 354839 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180809 |
Protein GI | 219120127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCTT CACAGTCTCT TTCGCGAATT CGAATGCCTG CGGAGTGGGA ACGGCACGCC GCGTGCTTAA TTTTATTTCC TCACAATGCT GCAACCTTTC GACTCTCGTT GGCCCAGCCT CAAGTCTTAA GAGTAGCGCG AACGATTGCC ACCGTCGGCC AAGAGCCTGT GATATTGTTC GCCAATGATG AAATGGAAAC ATTCCGGTTA CGTGAATTGC TGAAGCTGGA CGAAAATATC CGGGTCTTGA CTTGTCCCAG CAACGATACT TGGGCTCGTG ATACGGCTCC GACTTTCGTC ACTCTAAACG ATGGCGACGG GCAAAACAAT GAGTTATTGC TCAGAGGTTT GGACTGGGAT TTCAATGCCT ACGGAGGTGC CGAGGAAGGA TGTTACTGGC CCTGCTGTCT TGATCAGAAA GTTGCGGCAA CAATGTGCCG ACAAATAAGT GACGTAGGAA TTTTGGCGGA GCCGATTGAG TCGCTCCCGA TTTCCTTGGT GCTAGAAGGA GGATCCATCC ATACCGATGG TGAAGGAACT ATTTTGACAA CCAGAGAATG CCTTTTGAAT AACAACCGGA ACCCCAGCAT GTCGCGGCAA GAAATCGAGG AAATCATTTT ATGTAACACG GGCTGTACAA AGATGATTTG GCTAAGCGAT GGGCTGGCCA ACGACGATGA TACGAACGGC CACGTCGACA ACTTTGCCTG CTTTATCAGA CCAGGACACG TTTTGTTGGC TTGGACGGAT GATGAAGTTT ATGACACCGA AAATTACGTC CGATGCCGCG CCGCTCTGCA AATATTACAG AAGGAGCGAG ACGCCCGTGA ACGCAACTTG ACGGTGGACA AATTATACCT ACCGACGCCA ATGACGTACT CCCAAGAAGT AGTTGATTCT CTCAATTCTT GTATATCTGG TCCAAATATC GCTGCTAGAC ATGCTGGTGA GAGACTTGCT GCTTCTTACA TCAACTTTTA TATTGCGAAC GGTGCCGTAA TTGTTCCTCA ATTTGATGAC GATGTTTATG ATTCCAAGGC TATCGAGACT CTTGAGGAAC TCTTCCCTGC GCATAAAGTA GTCGGTGTTT CCAGTAAAGA AATTCTTATT GGCGGTGGGA ATATTCACTG CATCACACAA CAAGTTCCTT CACTACTTTA G
|
Protein sequence | MKASQSLSRI RMPAEWERHA ACLILFPHNA ATFRLSLAQP QVLRVARTIA TVGQEPVILF ANDEMETFRL RELLKLDENI RVLTCPSNDT WARDTAPTFV TLNDGDGQNN ELLLRGLDWD FNAYGGAEEG CYWPCCLDQK VAATMCRQIS DVGILAEPIE SLPISLVLEG GSIHTDGEGT ILTTRECLLN NNRNPSMSRQ EIEEIILCNT GCTKMIWLSD GLANDDDTNG HVDNFACFIR PGHVLLAWTD DEVYDTENYV RCRAALQILQ KERDARERNL TVDKLYLPTP MTYSQEVVDS LNSCISGPNI AARHAGERLA ASYINFYIAN GAVIVPQFDD DVYDSKAIET LEELFPAHKV VGVSSKEILI GGGNIHCITQ QVPSLL
|
| |