Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0009 |
Symbol | |
ID | 4601061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 7764 |
End bp | 9167 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639772762 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_919422 |
Protein GI | 119718927 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.906543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGGGG GATCCTGCTA CACGAACACG CTGGAGTACT TGAAGCACCC GTTGCCGGCT GAGGTCTCGG AGCTTTTGAG GAGAGGGGAG TTGAGGAAGG CGGAGGGGGA GTTGAGGAGG CTTGCAGAGA CGTCTCCGCA ACCGTTGGCA AAGCGCTTCG AATTCGAGCT TGAAAGGATT AGAAGGATGG GCTACGAGTA CCCCTACACC GTCGAGGAGG CATTCAGGCA GCTCAGGAAG CGCGTAGAGG ATCTCACGCT GGAGGAGTTC GCCTCGCTGA TATCCAGGGG GTGCGTGGAC CACGCGGTTA TAGAGGGGGA GGTAAGAGTG CACAGACGCT TCGAGCCGAA CGCTTTCTGG CTCTGCGGGG ACCTCTCAAG CCGTAGGAAA AGCCTGGGCG ACGAGCGGAG CGACCTGAAC GAGCTGACGT TGAAGTGGAG AGCCGAGAGG GTTCTCAGCG CCGCACGCGA GAAGGGCGGG GGCTACGTCC TACCTTTAAG GTACCGCGTA AGGGCAAGGG TGAGCCTGAA GAGGAGCCCG AGGGAGCTCG GAGAGCCCGT GAGGGTCTGG ATCCCGCTAC CGAGGGTCGA GGGTATTCAC TCGGAGTTCA GGCTTCTCGA CTACAGCGTG AAGCCTGTAC ACGTAGCGCC CCCGGATCAC CCTCAGAGGA CAGCTTACTT CGAGCTTTAC GGGGACACTA GAGAGGTGTG GGTGGAATAC GAGTTCACCT CTAGGGGCTT CCACGTAGAG GTAGACCCCA GGGAGGCGAG CGTCGACCCG GACTCCGAGG TCGCTAGAAG GTACCTGTCC GAGAGGCCGC CGCACATAGC GTTCACCGGC GAGCTCAGGG AGTTCGTGGA CCGGTTGACT AGGGGCGCCG CGTCCCCCTA CGAGAAGGTT CAGCGGATAT GGGGCTGGAT AACCGAGAAC GTCAGGTACA CCTACGCCAA GGACTACATA TTCTACGACA ACATACCGGA GTACGTTTTC CGCGAGAAGA GGGGCGACTG CGGCATGCAG GCCCTGCTCT TCATAACGAT GTGCCGTATC GCCGGGGTAC CCGCCAGGTG GGAGTCAGGC TGGTACATGC ACCCGTTATC GCCCGGCATG CACGACTGGG CGCAATTCTA CCTCGAGCCC TACGGCTGGC TGTACGCGGA TCCAAGCTTC GGGAACAAGA GGAAGGGCGG CGAGTGGCGC AACACCTTCT ACCTGGGGTC AACCGAGGGG TACCGCCTAG CCTCGAACAT AGAGGTGTAC GCAGAGTTCG ACCCGCCTAA ACGCTACATA CGCTCAGACC CCGTGGACAG CCAGCGCGGA GAGGTGGAGA CACCGTCAAG GAACCTGTAC TACGACGAGT GGGACTTCGC TCTGGAGATA GTGTCCGTCG AAAAGCTTGC GTAG
|
Protein sequence | MSGGSCYTNT LEYLKHPLPA EVSELLRRGE LRKAEGELRR LAETSPQPLA KRFEFELERI RRMGYEYPYT VEEAFRQLRK RVEDLTLEEF ASLISRGCVD HAVIEGEVRV HRRFEPNAFW LCGDLSSRRK SLGDERSDLN ELTLKWRAER VLSAAREKGG GYVLPLRYRV RARVSLKRSP RELGEPVRVW IPLPRVEGIH SEFRLLDYSV KPVHVAPPDH PQRTAYFELY GDTREVWVEY EFTSRGFHVE VDPREASVDP DSEVARRYLS ERPPHIAFTG ELREFVDRLT RGAASPYEKV QRIWGWITEN VRYTYAKDYI FYDNIPEYVF REKRGDCGMQ ALLFITMCRI AGVPARWESG WYMHPLSPGM HDWAQFYLEP YGWLYADPSF GNKRKGGEWR NTFYLGSTEG YRLASNIEVY AEFDPPKRYI RSDPVDSQRG EVETPSRNLY YDEWDFALEI VSVEKLA
|
| |