Gene PHATRDRAFT_42934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42934 
Symbol 
ID7196188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1563209 
End bp1564903 
Gene Length1695 bp 
Protein Length564 aa 
Translation table 
GC content53% 
IMG OID 
Productarylsulfatase 
Protein accessionXP_002176810 
Protein GI219110117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTAA GGCATGATCG GCTAAGGCTG ACTCGCCCGT GGATCTCGCT GCGTACCTTC 
TACCTTGCGG CATCGCTATG TTCGGGAAAG ACTTGGTGCC TACCGGAGAC GCAAGGCTCG
ACCACGTCGG ACAACCATGC CTCCGCCGTC GAGGGGATCA CCAACACCAA TAGCTCGTCC
TTCTTCAGAC CACATATACT CATGATTATC ATGGATGACT TGGGTTCGCA CGATCTCGGT
ATACACGAAA ACAGCGGCAT CCAAACCCCG CACGCGGACC AACTCGCTCG GGATGGGTTG
TATCTGGACC AGTACTACGT CCTACCCTAC TGCTCCCCGA CCCGCGCTTC GCTTTTGTCG
GGGCGGTATC CGCTACACAC TGGTTGTCAC ACGATCGTCA ACGACTGGGA AACGCAAGGT
TTGCCCTTGG ACGAGGAAAC CTTGCCGCAA GTATTGCGCC GTGCCGGGTA CCAAGCCCAC
GCCGTAGGCA AGTGGCACGT TGGACATTCA CGGTGGACGC AAACCCCAAC TTTTCGCGGC
TTTCAATCCT TTTTTGGATT TTATTTGGGC GCGCAGGACT ATAATACCCA CATCAAGCAA
GGGGAGCGAG GAAATGCCTA CGAAATGCAC TGGGATGCAC GGGGAAAATG TGGACGGGAC
TGTTCGAGGC TCGTCGACGA AAGGGGAAAC TATTCGACCC ACGTCTTTAC ACGAGAAGCC
ATTCGTGTTA TTGAAAACCA TCCGCAGCGA CCGCATGAAC CTCTCTTTCT TTATCTGGCA
CACCAAGCGG TACATTGGCC AGACCAAGTG CCGGAAACCT ACCGAAAGTT TTACGAGGGT
GCAACGTATT CAAACTGGAC GGATCAGCGC AAAACGTATG CAGGTATGCT GAGTGCAGCG
GATGAGTCAA TAGGAAACGT TACCAAAGCT CTACAGGACG CTGGTATGTG GGAAAACACT
CTTGTCGTCT TTACCACGGA CAACGGCGGA CCGACAGCCG TGTGCGCTGC TCAAGGATCG
TCGAATTATC CAAAGCGAGG TGGAAAGTGC ACCGTTTACG AAGGCGGAAC GACGGGTGAC
GGCTTTGTCA GCGGACCGGC CTGGAATAAG GTTGCTAGGT CAAGAAAGAA AGAATATTCG
GAAACGTTGG AGCTGTATTC CAAAGTGTTT CACGTTGTGG ATTGGTTGCC AACATTAGCC
CGCATGACGG GTGCGACACC CAATGGCAAG CCGCTGGACG GTGTCAATCA ATGGGACTCC
ATGCTTCAGA GAGAACCGAG TGCCCCACCG CCTCGCGAAG AAGTATTTGT CGGTTACGCC
TACTTTGGAA ACCAATGGTA TGGACCCGCG ATTCGGTACA AGCACTGGAA ACTCATTCAA
GGACAGTCTG GGGGACCGGA AACATCCCAC GATTTACCAC CTGGATCGTT TCTACCCGCA
CCAGGCGGTG CTCCTGGAGA GTATCAACTA TACGATTTGC AGAGTGATCC TTCCGAGACG
CAGAACATTG CATCGAGTTA CCCCCTCATT GTACAAATAC TACAGGGCAA ACTTATCGAG
TATCACGCGT CCTTCGTACC GCCTATTTCG AACGATCCGA CCTGTCCCTT TACCGGAACA
ACCAACACGA GTACATTTGG TCCAACCTGG TTGCCTTGGT GTGAGGGGTC GTCTGAGCTA
CTGGTATACA CATAA
 
Protein sequence
MRVRHDRLRL TRPWISLRTF YLAASLCSGK TWCLPETQGS TTSDNHASAV EGITNTNSSS 
FFRPHILMII MDDLGSHDLG IHENSGIQTP HADQLARDGL YLDQYYVLPY CSPTRASLLS
GRYPLHTGCH TIVNDWETQG LPLDEETLPQ VLRRAGYQAH AVGKWHVGHS RWTQTPTFRG
FQSFFGFYLG AQDYNTHIKQ GERGNAYEMH WDARGKCGRD CSRLVDERGN YSTHVFTREA
IRVIENHPQR PHEPLFLYLA HQAVHWPDQV PETYRKFYEG ATYSNWTDQR KTYAGMLSAA
DESIGNVTKA LQDAGMWENT LVVFTTDNGG PTAVCAAQGS SNYPKRGGKC TVYEGGTTGD
GFVSGPAWNK VARSRKKEYS ETLELYSKVF HVVDWLPTLA RMTGATPNGK PLDGVNQWDS
MLQREPSAPP PREEVFVGYA YFGNQWYGPA IRYKHWKLIQ GQSGGPETSH DLPPGSFLPA
PGGAPGEYQL YDLQSDPSET QNIASSYPLI VQILQGKLIE YHASFVPPIS NDPTCPFTGT
TNTSTFGPTW LPWCEGSSEL LVYT