Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0696 |
Symbol | |
ID | 4602011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 646891 |
End bp | 647841 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773470 |
Product | ribonuclease Z |
Protein accession | YP_920101 |
Protein GI | 119719606 |
COG category | [R] General function prediction only |
COG ID | [COG1234] Metal-dependent hydrolases of the beta-lactamase superfamily III |
TIGRFAM ID | [TIGR02651] ribonuclease Z |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.275565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCT CGAGCATTCG CATTATGTTC TGGGGAACCG GAGGAGCGGT GACCCCGAAG AGTGCCGAGC AACCATGCAT AGCGGTGAAG ATGGCCGACA CAGTCTTCCT GTTAGACGTG GGAGAGCGTT GCCAGAAGGC CTTGGAGTGC TTCCACCTAG GAGTAAACTC GCCCCTTTAC ATTTTCGTGA CTCACCTTCA CAGCGACCAC TACAGTGGGT TAATCCCTCT CCTAGAGACG CTGTCCTTGC TCGGAAGAGA GAGGGCTCTT AGCGTTTACG GACCCCCGGG TCTTGGCTCG GTGGTTACCG GTAGGAGGAC TACAGGCTAC CCGGTAAGCC TGACGGAGCT CTACGGGTGG GAAGGAATCC TTCGTTTTCC GGGCCTGCCG CTTACAGTAA GCTACGTGGC GGCGCCGCAC AGCTACGGGG CTTTGTCCTA CATTATCAGG GTTGAAGATA AAGTGAAGCT AGACGAGGAG AAGCTGGAAC AAGAGAAGAT TCCTCCTGCG CTACGCAAGG AGCTTCTCGA AAAGGGAAAA GTGAGGGTAG GCTCTAAGGA GTACAGTCTC AACAGCTTCG TTAAGCAAGT GGTCCGCGGC GTAAAGATCT CGTACACGGG AGACTCGCTT CCTAGCCACA GGTTCGCGTC TAAAGCTATG GAGTCGGATG TCCTTATACA TGATTCAACG CTCCTTAAGA GAGACTGGTA CAGGAAGCCT TACATGGCGC ACTCGACTGT CGAGGACGCG GTTGCGGTGT TTAAGGCGAC GCGGTCGAGG CTTCTCGTCT TGACGCATTT CAGCGCGATG TACAGAAACC CGGCAGTCGA CCTTTCAGAG GCTGTGGGCG AATACCCGAG GGTACTAATA GCCGAGAAGG GGCTAACTAT AGACGTAGCA TTCACCGAGC CACGCGTTTT CAGCGTGCGC ATTAATACCG ATTGCATCTA A
|
Protein sequence | MNISSIRIMF WGTGGAVTPK SAEQPCIAVK MADTVFLLDV GERCQKALEC FHLGVNSPLY IFVTHLHSDH YSGLIPLLET LSLLGRERAL SVYGPPGLGS VVTGRRTTGY PVSLTELYGW EGILRFPGLP LTVSYVAAPH SYGALSYIIR VEDKVKLDEE KLEQEKIPPA LRKELLEKGK VRVGSKEYSL NSFVKQVVRG VKISYTGDSL PSHRFASKAM ESDVLIHDST LLKRDWYRKP YMAHSTVEDA VAVFKATRSR LLVLTHFSAM YRNPAVDLSE AVGEYPRVLI AEKGLTIDVA FTEPRVFSVR INTDCI
|
| |