Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36301 |
Symbol | |
ID | 7201904 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 115782 |
End bp | 116862 |
Gene Length | 1081 bp |
Protein Length | 328 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180757 |
Protein GI | 219120018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.208439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGACTA AAAGCGAGGG ACGTCTTGGG ACTGACGAGA TGGTGCCTTT ACACGCTTGC CTTGCACAGT TTGACCTTTG GATTGCCGCC CTGCATGACG ATACTCACGA TTGGTCGCAG ACATCACGCT CACATTGTAC CAGCGCCATA CTTTCCGTCC CAGCTTTGCT ATCTCTCATT CTTTCCAATA GCAGAACTAG CACAATTTTA AAGGACAAAG AGCAAAGCAA TCTGGACCCG GTTCTAAAGC TAACCAATGT TTGCTACGCT GGCAATACTA TTTCAAACAA GAAAACGCTC ACCACGAGCT TGACTGCTGC CGTGGGTGTT GGTGGCTCCT CTGTTTTGGT TGCCATTGCA GGATTGGGCT ACACCTCGGC CGGTATTACC GCCGGCAGTT GGGCTGCTTG GATCATGTCG GCTGAGGCTG GCATGGCTGG TGGCGGCGTA GCTACCGGGG GTCTGAGCGC CACTCTACAG AGCGCAGGAG CAGTTGGCCT CATGGGAGCC GGCTTGGGCC TCACTACATG TCTCTTTGCG GTAGGAGCTG TAGTGGGTGG TACCGCGGCG ATGTATACCG TACGACACAA CCGGATGAAT GCAATTCGAC TCGGTACAAT TGCTACATTG GATCAGGGCC TCGCAGTATT CGCCCATGGG AATGTCGTTG CGTTGGTGTC GTCCAAGCAT AACCGCATTT TGAGAGTGGG TGATCAGTCG CTGATAGATG CGTATGGAGA ATGTAGCGAT CAACCGCCTA GTTTACCGCT CGAATGGGAT AGGGAACGAT TTTTGGTAGT CCGTGTTGGG GAAAGAAATT GTGCGCTTTA GCCTGTCTGC GAGGCGTTTT ATACGCGTGG TGGATGAGAA CGGTCTTTCG TTGAGTGGAT TGGAGCGAGC CTCGAATCTG GAACCGACGG GAGGAGAGAT CTTCACCATT CAAAAAGGCA TCAATGGCAA TGTTTCCTTC TTTTCACAAA TGACGGGAAG CTTCGTCAGT ATCAATCAGA AGGGTGTTGT TTCCGCTTTG GCTACTGAGG CTGGAGAATC CGAGCACTTC AAGGTATTGG TCTTGATTTA G
|
Protein sequence | MLTKSEGRLG TDEMVPLHAC LAQFDLWIAA LHDDTHDWSQ TSRSHCTSAI LSVPALLSLI LSNSRTSTIL KDKEQSNLDP VLKLTNVCYA GNTISNKKTL TTSLTAAVGV GGSSVLVAIA GLGYTSAGIT AGSWAAWIMS AEAGMAGGGV ATGGLSATLQ SAGAVGLMGA GLGLTTCLFA VGAVVGGTAA MYTVRHNRMN AIRLGTIATL DQGLAVFAHG NVVALVSSKH NRILRSVLGK EIVRFSLSAR RFIRVVDENG LSLSGLERAS NLEPTGGEIF TIQKGINGNV SFFSQMTGSF VSINQKGVVS ALATEAGESE HFKVLVLI
|
| |