Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49483 |
Symbol | RpoT |
ID | 7195832 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 384999 |
End bp | 389478 |
Gene Length | 4480 bp |
Protein Length | 1188 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | rpot-like RNA polymerase |
Protein accession | XP_002184121 |
Protein GI | 219127811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.088117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGCAGTGT ACACTCTTCC TGGCGTAGTC GGTCGAACTG ACAGTGAGAC CCTTCGTTCT CGACCTTACC GAAAGAAAGA TTGGCGAGAC TAGAGCGCTT GCTATCAATC GTATAGTTAG TGTATCAATT GTCCATTTAT AGATCGGTGT GTGCCCGAGC GATACCTTGG CAAAACGCCA AAGAACACAA AACACAGCCA ACACCGCACA CATACAGATC CTGTACAAAA CCATAACAGC ACAAACCCCC TTTGCCGACA ACGATGCAAT CGCACTTATT CCGCCGAAGT CGCCCCTGGG GTCGGCCCTA TCCACTCCAC TGCGGAGCAC GAGCATCTTC TGCACAACCC TGTTTCACTG TCACAATTGC TGCACGCCTA CGATATTCCT CGGTATCCTT TGGAAACATT GGCGGTATGC CTTCAACCCC CTGGCAATCG CGACCTCGGT ACTACCGATC TTACTCGTTG ACATTTCATA CTGGCAACAC CAACTTCAAC AGCAACAACT ACCACCAGGC TTTGGCGGGA GTGCGCAGTT TCCGTTCGAC GGCGGGTGCG TGGGCCCCAA CACGAACCGC CATTCAGGCG CTTTCGACGG ACGCCCTCCA CGACGAACAA AATGACGAGG CAATGTCGGA ATCGGTTGAG GATCTGTTCC GAGCCGAACT CGAACGATAC ACCTTTCATG CCAAGGCGCG TATGATGCCA GGGTACGAAC ACTACAACCC AACGAAATAC GACGACATCG ACAACGATGC CTCGGTATCC GCTTCCCAGG ATCTACAGGT AGAGTTGGAT ACGGGAAACG AATCCCTTTT TGCGAGTAAC AACGAACACG CTCGCAAACT CGGCAGCGAA GTAACGTGGG AGGAAGAACT TGTGGATCTC GAGCAAATCG ACTTGGACGA GGCGGTCCGT TCCCCGGAAG CGGCCAACGC ACATCAACCC AATCCACCTA CATCGAACCA CAAAACCCAA AAATCCGCAG TGGAACTCTT GCTTGCGTTT GACCCCCAGA ATCCGCCCAG TTGTGACGAT TTGGAGGAAC TGCAACTGTG GCTGGAATGC GAGGCTCAGC AAGAATCGAT TACGCGGTAC CAACACGTTA TAGATTCGGC TCGCACGCGG AAGGACTACG CCTCCCTTTC ACTGGTTCAA CGCCATGTCT TGCAATGGTA TCAACCACTC AAGCATGAAA TTGAGTCGTT GCAAAAGGCG TATATATTCA AGGAGCAGGA TCCCGAAGGG GTCGTATTAA CCAAGCGCGC CGCCAACGTG TACGGACCCT TTATCTGTTC GTTGCCACCG GAAAAATTGG CTGTCATATG TGCCCACGAG GCCATCATGA GTGGGCTCTC GAATCCCGGT GTCGATGGCA GGGATGGCAC CGCCTTTACT GCTGTGGCCA GACGGATTGG TGACGCTGTG GAGCAAGAAG TCGTCATTCA ACGGATGCTG CATCGACGCT TCAAAGAGTC ACAAGCACGA CATTCGACGC AAGACCCGAA TAGAATTTTG CCGAAACCGG AGGAAGAAGT TGAAGGGGAT GTTGAGCTCT CCACGGACAA GATTGACGAG GACATGAACG ACGGCAACGC TACAACTCGG TCCAAGTCTA GCACTGGAAC CGAAGACGTG CATAAATGGG CGTACGCCGC ATCACACCTA AAAAATTACT TGGACGAAAT CAGTTACCGC AAACCTTCCG TCAAGAAACG ACTGGTTATC GAATACGCTA TTCGTCGAGC CCGCACCATA TTGGAAAATG ACGAAGCATG GACCGAATCT CAGAAAATTC AACTAGGTGC GGTACTCTTT CAAGTTTTGC TTAACAAAGC GAACGTTCGC ATCGACGGTA AGCAAGAAAA GGCATTTACG TACGAAAAGT TTTGGATCAG TAAGCAAAAA GTCCAATCGA TTGTCTCCAT GAATGATCGG TTGCACAAAA TGCTAGTTTC GGACAAGCTA CAATCGTTTG GAACCACGAC CACACGGCAA AAACCTATGA TTGTGCCGCC CAAACCATGG TCTAGGCCGG ACGAAGGTGG CTACAAGATG CTCAGAACAG AGTTGCTCCG CTATCACGGA TGCAATATGC AAAAGGTACG CAGACTACAT CCATGTCCAT CGTGACACGC TAGAGGGCAG TCTTTCTCAT TTTGCTTTTA ATCTACAGGA AGTCTTGCAG AATGTGGATT TAAGCATTGT TTTCGATGGT TTAAATGCTC TTGGACGAGT ACCATGGAGA ATCAATCAAC GAGTATTGAC TGTTGCGCAA CGCTGCTGGG ACAACAATAT ACCACTAGGT GACATTCCGT CAAAAGATGA CAACCCGTTA CCCGAAGAAC CATTACACCC AAAACGACAG GAACAATTTT GGGAACCGGA AACACCGGGG TACGAGTCAT ACGTCGAGGA ATATCGTCTG TACCGTGAAT CATGGAACAA GTTTAGGAGA GTCAAACAAA GAAACATGGT ACGTGACTTA GGGAGGGAAT ATGGTGACCC TGTATATATT ACGATCTCAT GTAAATTGCT CACAATTGTT GCTCATTGTT TTGAATCCAG GATCTACGCT CGCTGCGATG CTCGGCGATG TTGAAACTGG ACCAAGCCGA AAAGTTTAAA GAATTTGAGG AGATTTTCTT TCCTTACAAT ATGGATTTTC GTGGGCGGGC GTACCCCGTT CCTCCGCACT TGTCGAACGT CGGCTCTGAT TTATGTAGAG GAGTGCTTAA ATTTGCGGAA GCCAAACCGC TCGGGCCGCG TGGCCTCTAC TGGCTCAAGG TGCACTTGGC GAATTTTGCG GGAAAAGACA AAATGTCGTT TGATGATCGT GTAAAGTTTG TTGACGATAA TTTGGAACAA GTACGATTAT CTGCGGAAGA CCCGTTTGCA GGAGAACGTT GGTGGATGAG CCTTGAGGAC CCTTTTCAAG GTCTAGCTAC CTGTTTAGAA ATAATCGATG CAATTGATTG CGGTAATCCC GAGTCGTTTT TGTGCTCACT GCCAGTGCAC ATGGATGGCT CGTGCAACGG CCTCCAACAC TATGCTGCTC TTGGGCGTGA CAGCGTCGGT GGCAAAGCGG TAAATCTGTG CGCGTACGAT GAACCTCAAG ACGTATATGT CGGAGTTATG CACGAAGTAG TTCGTCGTGT CGCTGCAGAA GCCGAGCGAC AGCTCGACTT TGATACATCT GACTTGGAGT CTCTGAGCCG AAAACAGAAG AAGGAATTGT CTTATAACCG CGCAGCAAAA CTTGTGAACG GACTGATAGA CCGAGGAGTC GTGAAACGTA CCGTAATGAC GAGCGTTTAC GGGGTAACCT ATATTGGAGC CCGGCAGCAA ATCCAAGAAA AGATTATTTC AAAGGTGCGT CACGGCAGCG TAGGGTTTTC GGACGTTTCA TTTCTTCAAG AAGCAGCCAT GAATTAACCT TTTTGTTTAC TAATAGCTGG AAGCACAGGG CCACGATGTC GATGAAATGA GCAATGAAAT ATTTCAGGCT TGTGGTTACG TAGCAAGTCT CACCATGGAA GTTATGGGCG ACCTTTTTAC CGGTGCTCGA GAAACCATGA ATTGGCTCAC CACTTGCGCT AGGATGATCA CCCAGCACGG TTTTCCTTTA GCTTGGGTAT CACCTATCGG TTTACCGGCG ATCCAGCCCT ATCGCCAGAA GAAAGCGGCT ACCATTGTTA CACTGCTGCA AACGGTTACA CTCATTAATG AAAGCGACGA CCTGCCTATT CACAAGCAGC GACAAGTATC GGCTTTTCCA CCAAATTTTA TTCATTCGTT GGATTCATCC CATATGCTTT TGACAGCTTT GGAAATGGAT CGTCGTGGGT TGACTTTTTC AGCCGTACAC GACTCTTTCT GGACACACGC ATGCGACATC GACGAAATGA ATGAAGCTCT ACGAGACGTA TTTGTTGATT TATACAGTCA GCCATTGCTG GAGCGACTCA AGCAGACATG GGAAATGCGT TACCCTGAAC TCGAGTTCCC CGACATCCCG AAACGCGGAG ACCTCAATCT TGCCGAGGTC AAACAGGCAA AGTACTTCTT CCAATAGAAA ACGCAAAAGG GATATAACTT GTACAGACCA ATCAGAAGGC TAGAAAAATA CGAATGTTTC CGTATTCAAG TGCTGCTTGA CACAACAGTG CGAGTCTTGA AACGTTGTAT TGATGAACAG CCAACTCCAG AAATTGATGA GTTCATTGGA TCCAATCTCT CATATGCTTC AAAGTCAGAG AAACATTCCC GGAAACCTGC TTCGTTCGGA TTTTATCCGG AAGTGTCTTG TCTATACGCT ACTAGAGGGG TCAAACAATC TTTTGAAAGA CTGTTTACAG TTAATGTCTT CTTGAGAATA GAGAGATGTC GGCAAGTTTC CGCTAATCCA TCTTGTAAAT TGTGGCGGAA CATTCCGAAA CGTTGGAGAA GACATCATGA AAAAGTGCAC TATGTCGTAT
|
Protein sequence | MQSHLFRRSR PWGRPYPLHC GARASSAQPC FTVTIAARLR YSSVSFGNIG GMPSTPWQSR PRYYRSYSLT FHTGNTNFNS NNYHQALAGV RSFRSTAGAW APTRTAIQAL STDALHDEQN DEAMSESVED LFRAELERYT FHAKARMMPG YEHYNPTKYD DIDNDASVSA SQDLQVELDT GNESLFASNN EHARKLGSEV TWEEELVDLE QIDLDEAVRS PEAANAHQPN PPTSNHKTQK SAVELLLAFD PQNPPSCDDL EELQLWLECE AQQESITRYQ HVIDSARTRK DYASLSLVQR HVLQWYQPLK HEIESLQKAY IFKEQDPEGV VLTKRAANVY GPFICSLPPE KLAVICAHEA IMSGLSNPGV DGRDGTAFTA VARRIGDAVE QEVVIQRMLH RRFKESQARH STQDPNRILP KPEEEVEGDV ELSTDKIDED MNDGNATTRS KSSTGTEDVH KWAYAASHLK NYLDEISYRK PSVKKRLVIE YAIRRARTIL ENDEAWTESQ KIQLGAVLFQ VLLNKANVRI DGKQEKAFTY EKFWISKQKV QSIVSMNDRL HKMLVSDKLQ SFGTTTTRQK PMIVPPKPWS RPDEGGYKML RTELLRYHGC NMQKEVLQNV DLSIVFDGLN ALGRVPWRIN QRVLTVAQRC WDNNIPLGDI PSKDDNPLPE EPLHPKRQEQ FWEPETPGYE SYVEEYRLYR ESWNKFRRVK QRNMDLRSLR CSAMLKLDQA EKFKEFEEIF FPYNMDFRGR AYPVPPHLSN VGSDLCRGVL KFAEAKPLGP RGLYWLKVHL ANFAGKDKMS FDDRVKFVDD NLEQVRLSAE DPFAGERWWM SLEDPFQGLA TCLEIIDAID CGNPESFLCS LPVHMDGSCN GLQHYAALGR DSVGGKAVNL CAYDEPQDVY VGVMHEVVRR VAAEAERQLD FDTSDLESLS RKQKKELSYN RAAKLVNGLI DRGVVKRTVM TSVYGVTYIG ARQQIQEKII SKLEAQGHDV DEMSNEIFQA CGYVASLTME VMGDLFTGAR ETMNWLTTCA RMITQHGFPL AWVSPIGLPA IQPYRQKKAA TIVTLLQTVT LINESDDLPI HKQRQVSAFP PNFIHSLDSS HMLLTALEMD RRGLTFSAVH DSFWTHACDI DEMNEALRDV FVDLYSQPLL ERLKQTWEMR YPELEFPDIP KRGDLNLAEV KQAKYFFQ
|
| |