Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47749 |
Symbol | |
ID | 7202733 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 719781 |
End bp | 720948 |
Gene Length | 1168 bp |
Protein Length | 327 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181962 |
Protein GI | 219123294 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGCCG CTTCGTTGCA TGCACCGAAC GACGCTTTGG CGGAGCTTCT CGCTCCGGAC GGGTATTACA AGTACCTAGG CGTTTGCAAG CCTTCACCGG CAGCGGAAAG CTCTGGTAGA TCGTCGGAAA TGGAAGGTCC TTCGGGGTCT TCTGCGAAAG AAGACACACT CGACGAAGAT ACTGTAAAGA AAGCATACCG TAGACTGAGT CGAAAACACC ATCCCGACAA GCCTGGCGGC GATGCCGATA CGTTTCGCAT GCTCAACCGA GCGCAGAAGG TTCTTCTCAA TCCGAAGCTC CGCCAACAAT ACGATATTTT GGGAATCGAT TTGGATGATG ACGAAGAGGA GCATGCCGAC AACAATCATC ACGATCATCC TGACGATAAA AAAGATGGGA ATACGGCGCA AGGCATTGTT CATGAAATCG CTAGTATGGC GTTGACAACT ATCGTACAGC TCGGAGTTCG AACCCGTACG TACTTGTTTT CGTCAGACGA AGAGACGGTA ACTGACACCA TCGCTCACGC GCTTCATCCC TCTTCCGTTT CCGGAATTCT TTGGCAGTCA TGCTGGCTGG AGTATCAATC CTCGTTACCC GGTATCGCTG GACCGTGTAT CCAGCCATAC TTTTTCTAGC ATACATTGCT TTCACTATTC TGAAACAAGC CCGGTTGCCG GGGCATTCCC TGCTTGATAT GTTACCTCCG TTGCTGATTG CAACTGGTCT TTTGTGCATG TTCTACGGTC GCGTAGTATC GGTCGGCGAC AGTGATTCGC CGGATGCCGG TACAACGACC GCACCTTCAT GGACCTGGCT GTTCTGGAGC GGCGAAGTGC TGGTTATTGC CATGTTCACC TTCAATTCCA TGAGTGCGAT ACCCAAGACT CCCCTTGTTT TGTCGTTACT CGGTATATTC TCGGCCCTTA CGGCACTTTG GTTTCGCGGA AAGTTTTGGA ATTACGTAAT TGTCCTCGTC ATGGAAGGCT TGCTTGGTGT ATTCGTGGCC TTGGCCTTCC CCGTAATGGA ACTCATTCTG GAAGCGGTTC TGAATGAGAA ACTGAAACGG GTGGGCGACA AGGTTCGTGC GCATCATCGA CAGTTAGAAG CCTACTACGC GGCCAAACTG CAACAAAGGG ATCACTAAAT TAAGAAGTGG TATCTTTG
|
Protein sequence | MPAASLHAPN DALAELLAPD GYYKYLGVCK PSPAAESSGR SSEMEGPSGS SAKEDTLDED TVKKAYRRLS RKHHPDKPGG DADTFRMLNR AQKVLLNPKL RQQYDILGID LDDDEEEHAD NNHHDHPDDK KDGNTAQGIV HEIASMALTT IVQLGVRTPY IAFTILKQAR LPGHSLLDML PPLLIATGLL CMFYGRVVSV GDSDSPDAGT TTAPSWTWLF WSGEVLVIAM FTFNSMSAIP KTPLVLSLLG IFSALTALWF RGKFWNYVIV LVMEGLLGVF VALAFPVMEL ILEAVLNEKL KRVGDKVRAH HRQLEAYYAA KLQQRDH
|
| |