Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43976 |
Symbol | |
ID | 7204192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 595077 |
End bp | 596805 |
Gene Length | 1729 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186379 |
Protein GI | 219113591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTTGGGTT GATTGTACCG ACGTCGTTAC TTTCCTCTAC TCGCATACCG ATACTTCGAC ATATCCGGTG GTTTCTACAG TCAATCGGTC ACTTCACTTT TTGGTTTTCG TGATTTCCTG TTCTCATCCA TCCCGCTTCG GCGAAATTGA ATGGAGGCTA TTCTCATCCG TGCAGCCCGT AACCGAGCTC TATCCCTTCA TCATTGCAAG CTACCTTCGA TTGCAACTGC TGCTCTTTGC TTAAGCAACT CACTCTTTCA CACGGCAACT GCCATCACTA TGCCGAGTCC TCCATACGCC TATCGCTCTT CAGATTCGAA ACCACTAGAC CTTTCGGCTT TAGAAGCACA AATCACAACC CTTGAAGATC AGCGACAACG TGCATTTGAG CTTTCTCGAG ATATGCAGGT TGCCATCCTA CAAGCCAAAA CGGCGCTGGA ATACAACGAG GACACATCAT CCGTGACCAC CAAACTCGAT ACTTTGCTAG AAAATCCGCT TTTGAAGCAG CTCGATGCGT CAGATCGGGC TCCTCGCCAT GCGAATTTGT CCTTCAAAAT GGAGGACTAC CTCCGGTTTT TGGCCTTCCA ACGTTTTCTG CAGTCCGCCG ACTTGCTTCC TCCTGCGGCA CCCTTCACGG ACGAAGAATA CTTGGGAGCA TGCATGGGGC TGGCGCAAGA CTTACAGCGC TACGGACTAG GTCGGGCTAC CGTTCGCGAT GTCGCGTCCG TCCAGGCAGC CGCTGATTTG GTCGGTGACC TCTTGGACTT TCTGCTGCAG CTGGATTTTC GCAATGGGCC ACTCCGACGT AAATACGATG GGACCAAGTA CAGTCTGAAG GCACTAGAAA CCTTGTTGTA CGAGCTGGCG GTAACGGATA GTTCCCGTGC GGTCGAGGGG ACATCACCAG CGAAGCGTTC AAAGGTGGAA AAAGCTTCTA CTATGCTGCT GCCGCTGGAC AGTTTGCAAG CGCTTAAATC CCGCATGGTC TATCGCGATG ACCTGCGTGA ATCCTTGATC AAAAAGTGCC GCGATGGTCA AAAGGCTGCC AAACAGTCGA TCTTTGCACT CCATCGCGGT GACAAAGAAA AAGCTTTGGA ACTTTTGACT GAATGTCACA ACGGTATTGT CAATGAGCTA CTACCGATAG TTGTCGAAGA ACCACTTCTT CGGAATGGAT CGTTCGCCAA CGTTTTGGAG GAGTATGTGG AAGGCAAGCT CTTTTGTGCT TGGCTCTACG GAAAAGACTA CGGTAGGGAT GTTGAATCGG ACCAGCCAAG CGGCACTGTC CTCAAACCCG AAGACTTTGA TATCGCCTTA GAACCAGCGG AATACTTGGG AGGGCTTTGC GATTTGACGG GTGAAGTCGG CCGATATGCC GTGCAACGTG GAACAGCGCG TGATGTCAGG GGAGTGCAGC TATGTTTGGA AACCAACACG AGTATTTATA CTGCGCTTCA AGCGATTGGT CGCCTTCCCC AAGGCATTCC AAAAAAGATG GATCAACTCC GATACAGCGT GGAAAAAATA GAACGTATGC TATACGAAAT GAGCCTTTCC GAGGCCGCGG GTGGACGGAA TGTTCGCAGT GAGGTTGAAG AGTCCTCTGC TATGAACGAA GAGAACTAAG ATTCCTTGTG TTTGGATTTA TATCAGCACG AAGAGAATTT TATCTCGACA TCGTAGCTTC TAACCATTTT ACACGCTACC TAGTTTTATA AGTGGCGTT
|
Protein sequence | MEAILIRAAR NRALSLHHCK LPSIATAALC LSNSLFHTAT AITMPSPPYA YRSSDSKPLD LSALEAQITT LEDQRQRAFE LSRDMQVAIL QAKTALEYNE DTSSVTTKLD TLLENPLLKQ LDASDRAPRH ANLSFKMEDY LRFLAFQRFL QSADLLPPAA PFTDEEYLGA CMGLAQDLQR YGLGRATVRD VASVQAAADL VGDLLDFLLQ LDFRNGPLRR KYDGTKYSLK ALETLLYELA VTDSSRAVEG TSPAKRSKVE KASTMLLPLD SLQALKSRMV YRDDLRESLI KKCRDGQKAA KQSIFALHRG DKEKALELLT ECHNGIVNEL LPIVVEEPLL RNGSFANVLE EYVEGKLFCA WLYGKDYGRD VESDQPSGTV LKPEDFDIAL EPAEYLGGLC DLTGEVGRYA VQRGTARDVR GVQLCLETNT SIYTALQAIG RLPQGIPKKM DQLRYSVEKI ERMLYEMSLS EAAGGRNVRS EVEESSAMNE EN
|
| |