Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1551 |
Symbol | |
ID | 4600900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1499991 |
End bp | 1501019 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774325 |
Product | transcriptional regulator, TrmB |
Protein accession | YP_920950 |
Protein GI | 119720455 |
COG category | [K] Transcription |
COG ID | [COG1378] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTCAG ACAGCCTAGT CGAGATGTTC CGAGTGCTCG GTGCAGGCAG ACCCGAGGCA GAGATCTACC AGGCGTTACT GAAGTACGGT CCCTCCACTC TTCGCGAGCT TAGCGACAAG GTGGACATGG TTGGATCGCA GCTCCATCAA TACCTCAAGA GGCTGGTAAG GCTTGGGCTG GTAGAGGTCA GCAGGGGCAA GCCCAGCATA TACAGGGCTG TCTCCGTGGA AACGTTTGAC GCCATATACA GGAGCAGGGT CGAGAGCCTC AGGAAGAATG CTCTCGAGAG GCTGAAATCC ATTTCCGCGC AGGCGCACCC AGCCTCGCGG GAGTACGGGG TCTACGTGCT ACGTAGCTGG AGAGCGTTCC GCATTCGGGG GCTCGAGTAC ATACGGTCCG CTAAGTGCGA CGTCATAGTG TGCGGCGACA GCTCGTTCGT GAAGCCGTAC TGGGAGGAGC TGAGAAAGAA GGAGGAAGAA GGGGTTAACG TCTTCGTCAT TCTCTACGAG CTACCAGGCA TACCGGTCAG GGAGGACGAG GTGGTCGTCA GGAAGGCTAG GAAGGCTGTC TCCGGGGATA TGATGGTGGT CGTGGACTCC CGGGTTGCGC TTGTGGCGCA AAGGAGGCTC GGTCCCGGCG AGAGACCCGA GTACGGTCTA TCCGTGGAAG AGCCCGTGCT GATAGACTAC TTGGAGCAGG ATTTCTTCAA CCGCTGGCTG AGGGGTAGCG TTATAAGGGA TGAGCCTGTG CGCCTGCCAT CCTGCTTCAC CGTGAACAGG CTCGCACTCA TCGAGGCACA GAGGCTCCTC TCAGAGGGAA GGAGGCTGAG CCTAACGGCG TACGGGCGGT ACACGGGCTC CCGGGGAGAA GCCATCGTCG AGGGGATAGT GCGGGACGCC GTCGTAGACC AGGCAACGGG CGTGGCGCAC TTCGTGGTGG ACACGCGCGC CGGGAGCATC AGGATAGGGG GGCCCGACGC CGTAGTAGAG GAGTTTGCCG CGTCGAGAAT CGAGTTAAGG GGTGCGTGA
|
Protein sequence | MSSDSLVEMF RVLGAGRPEA EIYQALLKYG PSTLRELSDK VDMVGSQLHQ YLKRLVRLGL VEVSRGKPSI YRAVSVETFD AIYRSRVESL RKNALERLKS ISAQAHPASR EYGVYVLRSW RAFRIRGLEY IRSAKCDVIV CGDSSFVKPY WEELRKKEEE GVNVFVILYE LPGIPVREDE VVVRKARKAV SGDMMVVVDS RVALVAQRRL GPGERPEYGL SVEEPVLIDY LEQDFFNRWL RGSVIRDEPV RLPSCFTVNR LALIEAQRLL SEGRRLSLTA YGRYTGSRGE AIVEGIVRDA VVDQATGVAH FVVDTRAGSI RIGGPDAVVE EFAASRIELR GA
|
| |