Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54289 |
Symbol | RNAP-II_2 |
ID | 7199698 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 2690 |
End bp | 6619 |
Gene Length | 3930 bp |
Protein Length | 1211 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178910 |
Protein GI | 219116230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGGT TTGACCACGT TCATGGACAA GACGTGATCG ACGCAGTCGA TCAGCACCCT TTAGGTTCCG CAGGCCGAAA GCTTTTGGAA GACGACGACC ACGGCAAGGG AGGCAAGCAG CACATCCAAC AACCTCAAGT TCACGACTAT GGATCTACCA AGACACCCGA TGACGCCCCA GTTGGAAGTC TTCGGGATAA ATGGAGATTG CTTCCGTACT TTCTAAGACT ACGATCGCTT CTCCGCCAGC ACATTGACTC CTTCGACCAT TTTGTCGATG TCGAAATGCA GCAAATTGTC CAAAGTCCTT CGGCGTGCGA GATTCGTTCC GAACACGATC CAAAATTTTA TTTGCGTTAT GAGGCTTGCT GGGTCGGAGA ACCCTCTGTG GAGGAAGATT CTTATTCGGT GAAATCCGCC ACGCCCTTCC AATGCCGACT TCGCGACTGT ACTTACTCGG CACCTATCTA CGTCAACGTT CGCTACACCC GGGGACGGCA GATTGTCGTC AAAAGAAAGG TCATGATTGG GCGAATGCCA ATCATGTTGC GCTCCAAGAA ATGTCTTCTC CGGGACAGGT CGGAAGACGA TCTGGCGGGT ATGAAGGAAT GCCCTTACGA TCCTGGCGGC TACTTCGTTA TTAAGGGCGT GGAGAAGGTA ATACTTATTC AGGAGCAATT GTCCAAGAAT CGAGTCATCT TAGAAGAAGA CAATAAAGGT ATCATGGCGT CGATCACGTC CTCCACCCAT GAAAGGAAAT CGAAGGCCTA TATTCTGATT AAGAACGGCC GCGTGTATCT TAAAAACAAT ACTCTCGGCG ACGATATTCC AGTTGCTGTT GTACTGAAGG CAATGGGCAT AGAGTCTGAT TTGGAGCTTG TGCAGCTTGT TGGTAGCGAG TCACCGATCA TCAACGCACT GGCCCTTTCG CTGGAAGAGC CTAGCCGTCT CGGTATACAC ACACAAGCTC AGGCACTACG TTTCATTGGT GCCAAAATTC GTGGCCGATC GGGGCCCAGT GGGCCAAGCA GTAGCTATAG AAAGAATGTG TCTCCCGAAG ACGAGGCGAG AGAGGTGCTG GCGAACGTTG TACTGAGCCA CGTTCCCGTT ACACATTTTG ATTTTCGTGA GAAGGCTCTC TATATTGGCC ATATTGTTCG GCGAGTATTG TTGGTACATC TTGGCAAGAT GCCGTTGGAT GACAAGGATT ACTACGGAAA CAAACGCCTC GAACTAGCTG GAAATCTATT GAGCTTACTT TTTGAGGATT TGTTCAAGAT TTTCAACAAA GATTTGAAGC GACAGGCCGA TCTCGTCCTG TCAAAGCCAA ATCGCACACA AGCCTTTGAT GTTGTCAAAA CGATTCGCCC TGATACAATA ACAAACGGAA TGATCAACGC TATTGCAACA GGAAATTGGG TGTTAAAGCG ATTTCGTATG GACCGAGCCG GGGTCACGCA AGTTCTGTCT CGTCTTTCGT ATATGTCTGC CCTTGGCATG ATGACCCGAA TCAACAGCCA ATTTGAGAAA ACTCGGAAAG TGAGCGGCCC GAGATCATTG CAGCCTTCGC AATGGGGTAT GTTGTGTCCC GCAGATACTC CCGAAGGTGA AGCCTGCGGT TTGGTGAAAA ATTTGGCGTT GCTTGGACAC ATTACAACCG ATGAAGCTGA CACAAGACCT ATTGAACGGA TTTGCAGAGA CCTTGGAGTT GAAGACGTCA AGCGGATGAC GGGCCACGAA ATTAATTCGC ACCAAGCCTT TCTAGTGTTT TTGAACGGAC TGATACTAGG GGTGCACACT CGGCCGAGAG AACTTGTTCG GAACCTCCGA AAGATGCGAC GTAGAGGGCT GGCGGGAGAG TTTGTCTCCG TCTATCTCCA CGACGAACAG TGCGCGGTTC ACATCGCCAC TGATGGTGGT AGAGTTTGCC GGCCACTTTT GATCGTGGAC GAAGAAACTG GCCTACCCAA ACTACAACAA ATTCATATGG AATCGTTGGC ATTGGGTACG ATGGACATCA CCGACTTGAT GCGGCAAGGA ATCGTTGAAT ATGTAGACTG CAACGAAGAA AATAACACGC TGATCGCGGT CACAGAACGT GATTTGGAAG TCGCTATCTT GCACGGGTTG GAAACGCGTA AAATGCGTTA TACACATTTG GAGGTAGATC CATTCACCAT TTTGGGTGTC GTTGGGGGGA TCATTCCGGT ATGTATGATT GATACGGTCG AATGATGATG GATCGCACCA TCCAAACGAA TGTCTCACCC GAAGGATTCT TCGCTTCAGT TTCCGCACCA CAACCAGTCT CCGCGAAACA CCTATACTGT TGCTATGGCA AAACAGGCTA TGGGTTCTGT GTCAATGAAC CAATACGAAC GAATGGACGG TCTTATCTAT ACCCTAATTT ATCCACAAAA ACCAATGGTG AAAAGTAGAA CCCTGGATTT GGTAAATTTC GACAATATAC CAGGTGGTCA TAACGCGTGC ATTGCCGTGA TGAGTTACAG TGGCTATGAT ATTGAAGATG CAATCATTCT GAACAAGGCA GCAGTTGATC GAGGGTTTGG GCGTTGTATG GTTCTTCGAA AACACCAAAC AAGTGTTCGG CGCTACGCCA ATGGAACTAT GGACCGGACT TGTGCTCCTC CTGATCCAGG TAGTTTTCCG GATGGCGAAG ACGATAAGCG TTTCGCTCGA TACATGGCAA TTGACAAAGA TGGAATTTGC CGAGTCGGGG AAAAGATTGA AAATGGCTCA GTAATGGTTA ACAAGGAATC CCCTGCAGAC ACCACTAGCA ATATGGCTGG CGTCGATTTC GGATTTAACA GCGGCATGTC GATGGCCCAC TTGAATTACA AACCCTCTGG AGTATCTTAT CGCGGAAGTG CGTCTACCTA TGTCGATAAA GTGCTGATAA CATCAAACGA AAACGAGCAA TTTCTGATCA AAGTTATGCT TCGTCAAGTT CGCCGGCCAG AAATCGGGGA TAAATTTGCT TCACGGCATG GGCAAAAAGG AGTCTGTGGT TTAATTGTAC CGGAAGAAGA CCTTCCTTTC AACGAATTTG GGCACGTCCC CGATCTAATA ATGAACCCAC ATGGCTTTCC TTCTCGGATG ACTGTCGGGA AGCTTTTAGA GCTTTTGGTA GGAAAGGCAG GCCTGTACGA AGGTCGTCAA GGATATTCCT CGGCTTTCGG AGAGGAATTT GGTTCTGCTG ACACTGCACA ATCTGCGTCG GAGGCTTTGC TGAGAAATGG TCTCAACTAT ACCGGTAAAG ATATTCTCTA TAGTGGCACT AACGGCGAGC CATTGGATGC ATACATTTTC TCCGGCCCCG TGTTTTATCA AAAGCTCAAG CACATGGTTT TGGACAAGGC ACATGCGCGG GCCCGGGGTC CTCGCGCCGT GCTAACACGT CAGCCGACAG AAGGTAGATC TCGTGATGGT GGGTTGCGTT TGGGTGAAAT GGAACGCGAT TGTTTAATTG CGTATGGAGT ATCAAACTTG ATCATGGAGC GCCTTATGCA TTCTTCGGAT GCCTTCAGTG CCAACGTTTG CCTCACTTGT GGGCTACTAC AATACGAAGG TTGGTGTCAA TACTGCCGGT CCGGCGAAAA AGTGGCTGAT ATCCGTTTAC CCTATGCTTG TAAGCTTCTA TTCCAGGAGC TCCAATCGAT GAATGTCTTA CCTCGTCTCC GACTGCAAGA TAAATGAATA CGCTGCCAAT TGCCGTCGGG GAAGGTAGTG TTTACTGTTA ATTGTGTTGG TGTTGACCGT GATAAATAAT AGGTGGGCAC AATCGGACAG TAACCCCTTA TTCAAAATTC ACTGTCACGT CATCTGTTTC CTGTCTGTTA TGCCAGCAAA TCGGTTTTTC AAAAAAAGGA ATATCCTAGA CGACAAATAA TTTATGTTTC TTATAAATTT
|
Protein sequence | MEGFDHVHGQ DVIDAVDQHP LGSAGRKLLE DDDHGKGGKQ HIQQPQVHDY GSTKTPDDAP VGSLRDKWRL LPYFLRLRSL LRQHIDSFDH FVDVEMQQIV QSPSACEIRS EHDPKFYLRY EACWVGEPSV EEDSYSVKSA TPFQCRLRDC TYSAPIYVNV RYTRGRQIVV KRKVMIGRMP IMLRSKKCLL RDRSEDDLAG MKECPYDPGG YFVIKGVEKV ILIQEQLSKN RVILEEDNKG IMASITSSTH ERKSKAYILI KNGRVYLKNN TLGDDIPVAV VLKAMGIESD LELVQLVGSE SPIINALALS LEEPSRLGIH TQAQALRFIG AKIRGRSGPS GPSSSYRKNV SPEDEAREVL ANVVLSHVPV THFDFREKAL YIGHIVRRVL LVHLGKMPLD DKDYYGNKRL ELAGNLLSLL FEDLFKIFNK DLKRQADLVL SKPNRTQAFD VVKTIRPDTI TNGMINAIAT GNWVLKRFRM DRAGVTQVLS RLSYMSALGM MTRINSQFEK TRKVSGPRSL QPSQWGMLCP ADTPEGEACG LVKNLALLGH ITTDEADTRP IERICRDLGV EDVKRMTGHE INSHQAFLVF LNGLILGVHT RPRELVRNLR KMRRRGLAGE FVSVYLHDEQ CAVHIATDGG RVCRPLLIVD EETGLPKLQQ IHMESLALGT MDITDLMRQG IVEYVDCNEE NNTLIAVTER DLEVAILHGL ETRKMRYTHL EVDPFTILGV VGGIIPFPHH NQSPRNTYTV AMAKQAMGSV SMNQYERMDG LIYTLIYPQK PMVKSRTLDL VNFDNIPGGH NACIAVMSYS GYDIEDAIIL NKAAVDRGFG RCMVLRKHQT SVRRYANGTM DRTCAPPDPG SFPDGEDDKR FARYMAIDKD GICRVGEKIE NGSVMVNKES PADTTSNMAG VDFGFNSGMS MAHLNYKPSG VSYRGSASTY VDKVLITSNE NEQFLIKVML RQVRRPEIGD KFASRHGQKG VCGLIVPEED LPFNEFGHVP DLIMNPHGFP SRMTVGKLLE LLVGKAGLYE GRQGYSSAFG EEFGSADTAQ SASEALLRNG LNYTGKDILY SGTNGEPLDA YIFSGPVFYQ KLKHMVLDKA HARARGPRAV LTRQPTEGRS RDGGLRLGEM ERDCLIAYGV SNLIMERLMH SSDAFSANVC LTCGLLQYEG WCQYCRSGEK VADIRLPYAC KLLFQELQSM NVLPRLRLQD K
|
| |