Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48554 |
Symbol | |
ID | 7194723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 199459 |
End bp | 201128 |
Gene Length | 1670 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | DNA-binding transcription factor |
Protein accession | XP_002183175 |
Protein GI | 219125831 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.911244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTGTTTTC GTAAAACGAA GTGCCTCCAT GTCCTCATCC CACAAAAGTG TTTTGCGGTT CGCGTCGGGT TTCATAAAAG GTTGATCGCA AGGTTTCTTG TCGCCAATTG TAGCTCCATA AAGAAATCGC CGACCAGAAG GAACCAGTCT CAAGACGTTG CCTCACGATC GACCATGATC AGCAACAGCA ACAGCAACGG TACAAAATCA CACTCTATTC TATCGAAACC CCTCAGCACG TTGTTGAAGC AGATGAGCGA AGAGCCAGAT TTTCTTCGAA TATTCGAAGA CGAAGATTTC GGGGGAATGT TGCAGCAGTC CATAGAGCAC ATAGGGACGT TTTCAAGAGA TGTCCCATCC GTTCCCAAAG TTACAGGAAC GAATATGGAG AGATCAATCA AGACTGTCAC TTCTGGCCGA TCTTGCTCTG CTTCCACTTG CTGTGAATTT GTGAACGTCG TTTTGATAGT CAATCCAATC ACACGCCAAG TATGTTGTCC ACAACAAACA CTAATCCACA ATTCACAAAA GGCTGTGATA GGGCCAGCTG TCTTGGACGG CTTTCCGATC GACACAATCG ATGGGCTATC ACAGGAGTTC CAACGAATTG GACCGGACGG ACAAGATAGA GCCCCTGGTT GTGCAACGTC AGAAACGGGT GGTTCTCAAG AAATCCCTTT GCAAACGCCT TCGCCCGTTA CGCAGTCGCA ATACCATGTA AAGCATCCTT CGCCTTGCAT ATTGCACTCG AAGAAATCGA GAGTCTCTCA AACGCTTTCC CTGGCAAAAT TCGACGAATG CGTTTCAAGT GGTACCCGAT CGCTTTCTCC GTTGCATATC GATCACATAA CTATACAGAG TGCTTCCTCC GTCAACGCTG CACCACTACG AGCCTTGTCC GCCTACAATT TTTTCTTTCG TGACGAGCGC CACCGACTTT TACATGGCGG CGACCATGAT TGGAGCAACC AGAAACGTCA AGCTCTACTC AATTTGCACT GGCATCGAGA TCGGGCGAAA AAACGAAAGC ATCGTAGAAC GCACGGCAAA ATCGATTTTA CAACTCTCTC TAAGTGTATC TCGCAACGAT GGAGGGATCT TTCGGAAGAA AGTAAGGACT TTTACCGAGA CATAGCGACA GCCGACTGGC AACGCTACCA ACAACAAGTC AACCACATCG CGAGCTTTCG CATGCAAAAC GATGACTCTT TTCTGCCTGC TGATTTTTCG AGTGTAGTGG GATGAGGAAT TCAAGAAACT GTACGCGAAG CAGAAGCCCC TGTACCAGAA AGCCGAGAGC CCAAGAGTTT TCGCTTATGT TTATGTTTCA ATTTTTAAGA ACATCACAAC GATCAATGTC TTGCCTCCGC ATTAGAATAA CCTACCTTGC TCCGGAAAGA AAAGCAAATC GAAAGGTTTG TACGACCGGT ACAACGGTTG TCGCTGGGTC GCTATGAGGA ATAAACAACC CCAAACCAGG GAAAGGTAGA ACTACCACCT GTAAGATTGG CTTACTGTTC GTTTCTACAT TACAGCCTCG CAAGTTACGA GAATACTTAG TGGCACAAAT GTTTTTATTT ACTTGAATTG TTTGAACGAG GCGAAAGTCT TTCTATATGT TAGGCATCGT TCATTTGGTT GCCTGGATTC AACTACTCGT TTCTTTGAGG
|
Protein sequence | MISNSNSNGT KSHSILSKPL STLLKQMSEE PDFLRIFEDE DFGGMLQQSI EHIGTFSRDV PSVPKVTGTN MERSIKTVTS GRSCSASTCC EFVNVVLIVN PITRQVCCPQ QTLIHNSQKA VIGPAVLDGF PIDTIDGLSQ EFQRIGPDGQ DRAPGCATSE TGGSQEIPLQ TPSPVTQSQY HVKHPSPCIL HSKKSRVSQT LSLAKFDECV SSGTRSLSPL HIDHITIQSA SSVNAAPLRA LSAYNFFFRD ERHRLLHGGD HDWSNQKRQA LLNLHWHRDR AKKRKHRRTH GKIDFTTLSK CISQRWRDLS EESKDFYRDI ATADWQRYQQ QVNHIASFRM QNDDSFLPAD FSSVVG
|
| |