Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_52094 |
Symbol | RNAP-II_1 |
ID | 7204582 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 111632 |
End bp | 112673 |
Gene Length | 1042 bp |
Protein Length | 319 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185825 |
Protein GI | 219121193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.830096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATAC TGGAACGTCC ATCCGAGCAC GAACTGGTCT TGGAATTTCT CCACGTTGAC ACGTCTTTCG TCAACGCGTT GCGGCGAATC CTTTTGGCTG AAGTACCGAC GGTGGCGTTG GAAAACATTT ACATGTGGGA AAACTCTAGC CTAATACACG ACGAAGTGTT GTCGCACCGG CTGGGACTGA TTCCCTTGAA GATTGACGCT CGCTTATTGG ACGAACAAGA CGACGACGAT CCCAGTCCGA CCGATCGCAA CACGGTCGTT TTTCGCTTGG GAGTCTCGTG CGGCTCAGAT CCCAACAAGG GCAAAGCCAA GCACAAAGCC GATTCTCACG AGAATGCCAA TGACGCTGCG GAAGCGGATC AGGATGGTGT AGTCCGTGAC ACGGAACTAG AACAAGCTGC TGCCGAAGCC GCACAGTCGA GCAAGGCTCC TAAAGTCCGC TTTCCTCGAG AGCGACCCTA CACCAAACAT GTTTATTCCA AAGACCTAGT ATGGGTCCCG CAAGGCGATC AGGAAGAACG ATTGCAGGAC ATTTGTGCCA TGCACGACGA TATCTTGATT GCCAAACTCC GACCGGGACA ATCAATTGAA TTGGAAGTCC ACGGCCGGGT AGGGGTGGGC AAGGACCACG CAAAATTCTC TCCCGTCGCC ACAGCGTCCT ACCGTCTCAT GCCGCACATT GAACTGTTGC GGGACGTCTA CGATGAGGTG GCAGACGAAC TCGTGCACGT TTACGAGCCC GGCGTCTTTG AAATCGTACC GACCGACGCA ACGGATCCGG CAGGAACGTC CCGCAAAGCT CGTGTGTGCA ATCCCTATGC TTGCACCATG AGTCGCAACT ATATGCGACA CACCGAACTC GCTCAAGCCG TACGGATGAG CCGCATCCCG GATCACTTTA TTTTCTCCGT CGAAAGTGTC GGCATGTACG CACCCGGGGT GCTAGTGGCC GAAGCCTTGC GCATTCTGCA ACGCAAGTGT AGAAAAGTCA TGGATTATGC GGACGAAGCA AACCTGAAAC AGGATCATGT GT
|
Protein sequence | MRILERPSEH ELVLEFLHVD TSFVNALRRI LLAEVPTVAL ENIYMWENSS LIHDEVLSHR LGLIPLKIDA RLLDEQDDDD PSPTDRNTVV FRLGVSCGSD PNKGKAKHKA DSHENANDAA EADQDERPYT KHVYSKDLVW VPQGDQEERL QDICAMHDDI LIAKLRPGQS IELEVHGRVG VGKDHAKFSP VATASYRLMP HIELLRDVYD EVADELVHVY EPGVFEIVPT DATDPAGTSR KARVCNPYAC TMSRNYMRHT ELAQAVRMSR IPDHFIFSVE SVGMYAPGVL VAEALRILQR KCRKVMDYAD EANLKQDHV
|
| |