Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42934 |
Symbol | |
ID | 7196188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1563209 |
End bp | 1564903 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | arylsulfatase |
Protein accession | XP_002176810 |
Protein GI | 219110117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTAA GGCATGATCG GCTAAGGCTG ACTCGCCCGT GGATCTCGCT GCGTACCTTC TACCTTGCGG CATCGCTATG TTCGGGAAAG ACTTGGTGCC TACCGGAGAC GCAAGGCTCG ACCACGTCGG ACAACCATGC CTCCGCCGTC GAGGGGATCA CCAACACCAA TAGCTCGTCC TTCTTCAGAC CACATATACT CATGATTATC ATGGATGACT TGGGTTCGCA CGATCTCGGT ATACACGAAA ACAGCGGCAT CCAAACCCCG CACGCGGACC AACTCGCTCG GGATGGGTTG TATCTGGACC AGTACTACGT CCTACCCTAC TGCTCCCCGA CCCGCGCTTC GCTTTTGTCG GGGCGGTATC CGCTACACAC TGGTTGTCAC ACGATCGTCA ACGACTGGGA AACGCAAGGT TTGCCCTTGG ACGAGGAAAC CTTGCCGCAA GTATTGCGCC GTGCCGGGTA CCAAGCCCAC GCCGTAGGCA AGTGGCACGT TGGACATTCA CGGTGGACGC AAACCCCAAC TTTTCGCGGC TTTCAATCCT TTTTTGGATT TTATTTGGGC GCGCAGGACT ATAATACCCA CATCAAGCAA GGGGAGCGAG GAAATGCCTA CGAAATGCAC TGGGATGCAC GGGGAAAATG TGGACGGGAC TGTTCGAGGC TCGTCGACGA AAGGGGAAAC TATTCGACCC ACGTCTTTAC ACGAGAAGCC ATTCGTGTTA TTGAAAACCA TCCGCAGCGA CCGCATGAAC CTCTCTTTCT TTATCTGGCA CACCAAGCGG TACATTGGCC AGACCAAGTG CCGGAAACCT ACCGAAAGTT TTACGAGGGT GCAACGTATT CAAACTGGAC GGATCAGCGC AAAACGTATG CAGGTATGCT GAGTGCAGCG GATGAGTCAA TAGGAAACGT TACCAAAGCT CTACAGGACG CTGGTATGTG GGAAAACACT CTTGTCGTCT TTACCACGGA CAACGGCGGA CCGACAGCCG TGTGCGCTGC TCAAGGATCG TCGAATTATC CAAAGCGAGG TGGAAAGTGC ACCGTTTACG AAGGCGGAAC GACGGGTGAC GGCTTTGTCA GCGGACCGGC CTGGAATAAG GTTGCTAGGT CAAGAAAGAA AGAATATTCG GAAACGTTGG AGCTGTATTC CAAAGTGTTT CACGTTGTGG ATTGGTTGCC AACATTAGCC CGCATGACGG GTGCGACACC CAATGGCAAG CCGCTGGACG GTGTCAATCA ATGGGACTCC ATGCTTCAGA GAGAACCGAG TGCCCCACCG CCTCGCGAAG AAGTATTTGT CGGTTACGCC TACTTTGGAA ACCAATGGTA TGGACCCGCG ATTCGGTACA AGCACTGGAA ACTCATTCAA GGACAGTCTG GGGGACCGGA AACATCCCAC GATTTACCAC CTGGATCGTT TCTACCCGCA CCAGGCGGTG CTCCTGGAGA GTATCAACTA TACGATTTGC AGAGTGATCC TTCCGAGACG CAGAACATTG CATCGAGTTA CCCCCTCATT GTACAAATAC TACAGGGCAA ACTTATCGAG TATCACGCGT CCTTCGTACC GCCTATTTCG AACGATCCGA CCTGTCCCTT TACCGGAACA ACCAACACGA GTACATTTGG TCCAACCTGG TTGCCTTGGT GTGAGGGGTC GTCTGAGCTA CTGGTATACA CATAA
|
Protein sequence | MRVRHDRLRL TRPWISLRTF YLAASLCSGK TWCLPETQGS TTSDNHASAV EGITNTNSSS FFRPHILMII MDDLGSHDLG IHENSGIQTP HADQLARDGL YLDQYYVLPY CSPTRASLLS GRYPLHTGCH TIVNDWETQG LPLDEETLPQ VLRRAGYQAH AVGKWHVGHS RWTQTPTFRG FQSFFGFYLG AQDYNTHIKQ GERGNAYEMH WDARGKCGRD CSRLVDERGN YSTHVFTREA IRVIENHPQR PHEPLFLYLA HQAVHWPDQV PETYRKFYEG ATYSNWTDQR KTYAGMLSAA DESIGNVTKA LQDAGMWENT LVVFTTDNGG PTAVCAAQGS SNYPKRGGKC TVYEGGTTGD GFVSGPAWNK VARSRKKEYS ETLELYSKVF HVVDWLPTLA RMTGATPNGK PLDGVNQWDS MLQREPSAPP PREEVFVGYA YFGNQWYGPA IRYKHWKLIQ GQSGGPETSH DLPPGSFLPA PGGAPGEYQL YDLQSDPSET QNIASSYPLI VQILQGKLIE YHASFVPPIS NDPTCPFTGT TNTSTFGPTW LPWCEGSSEL LVYT
|
| |