Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0171 |
Symbol | |
ID | 4601426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 146300 |
End bp | 147322 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639772925 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_919584 |
Protein GI | 119719089 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTGAGT TCGTGAAGAT CAACGTACGC CTGGTAGCTT CTCTACTGCT ACTGCTCGCC TTATTGGCTC CGGGTTTTTC CCGCGCGACC GGCGAAGCCG CTGTCACAGG CGTGTTTGTC ACAGGCAAGG TGACTTACAG GATTACTGGG GGCTTCCTCC TGAAAAACGA GAACAACGTC TCCGTGAACG ACTACGTATA CGTGGCGCTT CCTCAGAACA CAACTTTCCA GAAGAGCTAC GTGGTCTCAA TAAACCCGAA GCCCCTGCGG TTCGTGGTGG ATGAGGACGG CAATACCTAC GCCGTTGTGC TCGTTAGGGC GGAGCCCGGC CGCAAGTACT GGGTAAACGT GTCCTACGTA GTCGTGGTGT ACAGCTACCA GATAGACGAG TCGGCGTCTA AAGGCGACTG GCCGCCACTA AGCTTCGTAC GCAGGTATAC CGTGTCGTCC GGCTACTGGA ATGTCTATAA CGTGACGCTG ATTAGGCTGG CAAGGGAGGT TGCCTTCGCG CAGACCCCTC TCTCGACGGT GAAGAAGCTG GCGTCCTGGG TCGAGCAAGC GAACAGGGGG AACTACCGGG TCTCTTTCGG AAGAGCCGGG AGCGACCACG CAGTCACGTA CGGGTATAGA GGCTACGCGA TCACCGGGGA CTGCGTCGAG GTAGCGGACG TCTTCGTGAC GATGGCTAGG ATTCTCGGGG TACCCGCCAG GACGGCGTTC GGAATCCTCC TAACGGACAG TGAGCGCATG TGGCTCAACT TCTCCACCAT AGAGGCCGAG GGCGAGAACA TCTTGACGCA CTGGGGTGGG CATATGTGGC CGCAGGTCTA CGTGGAGCCA TATGGGTGGA TAGACGTTGA CATGCTCGAC GGCATGGCGC CTAACGTTGG AGTATACAGT GCGAGGCACA TACTGTTTGG GGTAGAGGAG ACGAAGTACT ACGGTTCCTC CCTCTCCAGT AGCGCGATAC CAAGCTACCT AACGCTGGAG TACGTTGAGT ACTACTTCGG GAGGGGTGAC TAG
|
Protein sequence | MGEFVKINVR LVASLLLLLA LLAPGFSRAT GEAAVTGVFV TGKVTYRITG GFLLKNENNV SVNDYVYVAL PQNTTFQKSY VVSINPKPLR FVVDEDGNTY AVVLVRAEPG RKYWVNVSYV VVVYSYQIDE SASKGDWPPL SFVRRYTVSS GYWNVYNVTL IRLAREVAFA QTPLSTVKKL ASWVEQANRG NYRVSFGRAG SDHAVTYGYR GYAITGDCVE VADVFVTMAR ILGVPARTAF GILLTDSERM WLNFSTIEAE GENILTHWGG HMWPQVYVEP YGWIDVDMLD GMAPNVGVYS ARHILFGVEE TKYYGSSLSS SAIPSYLTLE YVEYYFGRGD
|
| |