Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33371 |
Symbol | |
ID | 7204217 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 719170 |
End bp | 720190 |
Gene Length | 1021 bp |
Protein Length | 333 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186116 |
Protein GI | 219113065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGACA AAGTGTGCCC GGCAATGATT CTTCCTCAAG TATCAAAATT CGCTTGGAAT GGTAGCATCC TAATATTGTT GCCTCTGGCT GCGTTGGCTC AGGTCAAAGT GTTTCTGCTG GCAGGACAGT CAAATATGGT CGGTATGGCA TCAGTACAGC ATTTAGAGAT ACTGATAAAC GATCATAACA TTACCCATAA CGATTTTCGG GAAGATCTTT GGAACGGGAC TGGCTTTCGA TCGCGTGACG ATGTTTTTGT TAAATACAAT GATCGCGTCG GGAAATTGGA GCCCGGATAC GGAGCCTCTG TGAGCAAGTT TGGTCCTGAA TTGGGATTTG GGTGGACAGT CGGGGACGCC TTCACAGATA ATCCTGTCAT CCTTATCAAG ACAGCTTGGG GGGGGAAGAA ATCTTGCTGT TGATTTTCGG CCCCCACTGT CGGGTGAAGG TCAATTCCCT GACGTCAAAC CATCAAAGTA TGGATGGGAA TACCGACAGA TGATTCATGC TATTTTGGAT GGGCTTGAGG CTATCCAGGA AATTTATCCA GACTACTGTG AAGACCAAGG CTACCAACTT TGCGGTTTTG TGTGGTTTCA GGGATGGAAT GACATGCTTT CATGGCCTTT TGTTAGAGAA TATGGCTTCA ACCTTGCGAA TCTTATCCGG GACATACGCC GAGAAACGGA CGAGCCGTCC CTTCCTTTCG TTGTCGGGGA ATTAGGTATG CATGGAAACT TGACTGGCGA TCACAGCACA GCAGCAACGC GCGTCAAAAC GATTCGGGCC ATGGAGCAAG GCGTCACTTT GCTGAGCGAA TTTCAAAATA ACACTATCTT TGTGAAAACG TCACCGTACG TTATCAACAA CGGAACCAAA TACAACAAAA TATATCACTA TAATGGACGC GCTGACACTT ACTATCACAT GGGAAAAGCT TTTGGAAGGG GGCTTTTGCA GATTCTAAAC AATTCGGCTG CCACGCGACA AAGAGTTCGC AAAGCACGAC CAAAAAACTG A
|
Protein sequence | MIDKVCPAMI LPQVSKFAWN GSILILLPLA ALAQVKVFLL AGQSNMVGMA SVQHLEILIN DHNITHNDFR EDLWNGTGFR SRDDVFVKYN DRVGKLEPGY GASVSKFGPE LGFGWTVGDA FTDNPLGGGR NLAVDFRPPL SGEGQFPDVK PSKYGWEYRQ MIHAILDGLE AIQEIYPDYC EDQGYQLCGF VWFQGWNDML SWPFVREYGF NLANLIRDIR RETDEPSLPF VVGELGMHGN LTGDHSTAAT RVKTIRAMEQ GVTLLSEFQN NTIFVKTSPY VINNGTKYNK IYHYNGRADT YYHMGKAFGR GLLQILNNSA ATRQRVRKAR PKN
|
| |