Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46381 |
Symbol | |
ID | 7201763 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 166999 |
End bp | 168199 |
Gene Length | 1201 bp |
Protein Length | 201 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180769 |
Protein GI | 219120043 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.38852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGACCGACC TACAAACTAG AAACATCGAC CATGCCGCCT TACGGAGAAC CAGACTGGGC CACCCCCGGA AACACATCCA ATGTTGCTAC GCAGAATGCA GGAACACCTA CTGCAGCGAC AGCTTCTTCA GGAATGAACG GCAACAGCAG TGAATCGCGG TACGTTCGAC TGTCATTGTG CAGTGTTCCG TTGAGCTTTG GTTGAGTACC GGATTCCGTT TTTGTCAGAT TCGACAAGGT TTTGCCGCCA GACGAGGGAA TTTTGCTCTA CCGCAAGGAG ATCTATATCT CTGTGGTCCA TTCCACCACC GTTTTACTCA TCCTGCTGAC TTTTGTCACT TCTTTCTGCA GGCAAAAGGC TCGATGGGCG ATTTCATTGC TGTCGTTTCT TAATTTCGGG CTGGCTGCTA TGATGGGAAC TCTAGGTGTT CTCTCCCTCA TCCATTTCAA CCCTGGGAGT TCTTCGGACT ATTCAGCAGC ATTTCTTTCG TCCTACATGG TCATTTTTGC TGTGATCTTG TTCCTCTACG AACTTATTTG GTGGACACCA ATTGCCGCAT TGAACAAAAT GTTCCGAATG AATTTCGGTT TCATGTATGG ATTGCGAGGG AAAGGTCTTT TCTTGGTTTT TATTGCGTTT CTTTGCCTAG GTCTTCGAGA TGAAAATGCC TCTGGGGTGA AAGGATTGGA CTGGGCAACC GGTCTCGCTT GGTTGGGCGC AGGATGTTTC AATATTTTTA TTTGGATGAC CTGGTCGGAA GCGTCTGCGG CTTACAAGCC ACCGACAGCT GGTCTGACTG GACCCAGCGA CAGTAACACC GTTGTGTAGA TCAAGTGAAA GGATCAAGAG AAGGGATGGT CAGCCAACCG CACTACATCT TAAAGCCGAT TTTCGTGTAC TTAGCTCTAC TCAAAGTCTA CCTACAGTCT ATGTATGTCG GGCTCACGAC GTCGTCCCCA TCATTGTTTA CAGGCGGGGA TCCGAGCCAT ACAATGTAGA TCACCCTTTA GTTCAAAATC GATGAAGGAG TTAGCAATTC ACAGTGATGC GGTACACCGT TTCTGGCAAG CTGGAGCGAG CATAGCGAAT GTGAATGTAC CAATTTTATG TTGGGTTGTA ATCCTACAAT GCGGCTCCTA CAGCTGCAAA TTGTTGACCA TGCACATCGA TAACTTATCT AGACGTGGGA CGACGTCTAT T
|
Protein sequence | MPPYGEPDWA TPGNTSNVAT QNAGTPTAAT ASSGMNGNSS ESRQKARWAI SLLSFLNFGL AAMMGTLGVL SLIHFNPGSS SDYSAAFLSS YMVIFAVILF LYELIWWTPI AALNKMFRMN FGFMYGLRGK GLFLVFIAFL CLGLRDENAS GVKGLDWATG LAWLGAGCFN IFIWMTWSEA SAAYKPPTAG LTGPSDSNTV V
|
| |