Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1168 |
Symbol | |
ID | 4601182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1108637 |
End bp | 1110541 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639773944 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_920569 |
Protein GI | 119720074 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGAGAA GGCTTTCCGC GGTTGTGCTC CTCCTCTTGG TGGTTTTCTC GGTAGCTGAG AGCTCCGGGT CTCCGGGGGT CGAGGTTACA GGCTTAACAG TCAAGGTCGG GACTCTAGTA GGCGGCGTGT TTGTCGAGAG GCCTAACAGG ACTGTGAACG CCGGGGAGAA GGTCGCCGTC AAGGTCGTAG TGGAGGTGCG CGGGGATGCG GGCACGAGGT ACGAGGGGAG CCTCAAGCTG TCCGTAGCCA ACCCCCTCGG AGTAGTCGTG GGCGAAGACT CCAGGAACTG GTCCTCCACC CTCCGGGGCG CTGGCGAGGA GTGGGAGTTC ACGTTCGTCT ACGAGTTTCC CCCCTCCGCC CTGAGGGGGT ACTACAACGT CGACGTCTCG GTGAAGGTTT TGAACAGCGT TTCCCGCGGG AGGGTCTCCT TCTTCTACAG GGGGCTCGTC GATAGGAGGA ACGTCGTGAA CGTTACCTAC GTGCTCGTGC TAGAGGGCAC GGGGGAAGTC GGGGAGCTGA GGGTAGCGCT CCCCCAGGCC GACTCCATGA CCTTCGCCGC CGGCCCGGTA GTGTCGCCTA GGCCTTCGAG GGTAGAGAAG GACGAGCTCG GCAACGTGTA CGCGGTTTAC GAGAAGGTAG CCGAGGGGGC GTTCAGGAAG GAGTTCAAAG TCAGCTTCGT AGGGGTGCAG GAAGTCAGTC TGGTGAGCGC CGACGCGCCT ATAGACTCTC TCAGAAGCCT CCCGCCCGGG CTCGAGGAGT TCCTGCGCCC GTCCCCGTAC ATAGAGAGCG ATAGCCCGGA GATAGTCGAG GTTGCCAGGA GGCTTTCGTC GGGCGTCTCC ACGGTGCGCC AGCTAGCGTC GAGGATAGCG GACTACGTCT CCTCGACTCT CAGGTACAAC GACGCCCTCA GGAGCATAAG GGACTCCTGG AGCCTAGGAG CCCTCTGGGC TCTCCACGCG AAGCAGGGCA TGTGCCTCCA GTTCGCCCGG CTGTACGTCG CCATAGCGCG TGCCGCGGGG CTACCAGCCA GGGTCGTCGA GGGGCTCGTG GTTACGCCGC CCGGAGGCTC CTCCTCGTAC CTCCACGCGT ACGCCGAGTT CTACCTGCCG GGCTACGGGT GGGTGCCCGT AGAGCCGCAA CTCCCCGGGA GGTACGTGGG GCTCGTCCCT CCTGTCCCCG GGTACGTCCC GCTGGTCAAG GGGCTTGGGG AGGAGAGGGC GGGCTCCCGG GACTCTGTGA GCACGCAGTT TACCTACTCG TACCGCTTGT ACCCCTACGA GGGGTTGTCG GGGAGCGTGA AGCTATCAGT GAAGTACCCG AGAGAGCTAC TCTACGGCGA CCTGATCAAG GTCAACGTCA GCGTGGAGCC TAGGGACGCG GTATCGGAGG TCGCCGTCAC CGCCCCTAAC GGCTCGCGCT ACGAGTACAG GCTCGTAGGC CCCGGGAGCG TAGTCCTCGC GGCGAGCGAC GCGGGCAACT GGACCGTCGA GGTGTTCTCG ATGAGGCAGG GCTACCTGCC AGCCTACACC GTTGCAGTCG TACCGGTAAG GCCGCGTCCC ATCAAGCTTT CCGTCGAGGT CGAAGGACTA CCCCTCCTCG GCAGGCCAGC TTTCATAGTC AGAGTGAGCC CCCCTGTACC CGGCATAACC GTAGCTGTGA ACTCCTCCAA CTGCCTCTAC TACGAGGCTC GGACCCTGGA GACGAACTCC TCCGGCGTCG CCGTCTACGA GCCCCCGGTA CTCCTGTGCC CGGCTACCAT CGAGTTCCGG GCGAGGGGGA GGGGATACAC GGAGGCGGTA GAGGTTTACC AGTACGACTA CTCGCGCCTA GTCCCCCTAT ACGCGCTAAT CCTGCTCGTC GCAGTCCTGC TCGTAGCCGT GCGTGCAATA CGCAAAAAGC ATTAA
|
Protein sequence | MARRLSAVVL LLLVVFSVAE SSGSPGVEVT GLTVKVGTLV GGVFVERPNR TVNAGEKVAV KVVVEVRGDA GTRYEGSLKL SVANPLGVVV GEDSRNWSST LRGAGEEWEF TFVYEFPPSA LRGYYNVDVS VKVLNSVSRG RVSFFYRGLV DRRNVVNVTY VLVLEGTGEV GELRVALPQA DSMTFAAGPV VSPRPSRVEK DELGNVYAVY EKVAEGAFRK EFKVSFVGVQ EVSLVSADAP IDSLRSLPPG LEEFLRPSPY IESDSPEIVE VARRLSSGVS TVRQLASRIA DYVSSTLRYN DALRSIRDSW SLGALWALHA KQGMCLQFAR LYVAIARAAG LPARVVEGLV VTPPGGSSSY LHAYAEFYLP GYGWVPVEPQ LPGRYVGLVP PVPGYVPLVK GLGEERAGSR DSVSTQFTYS YRLYPYEGLS GSVKLSVKYP RELLYGDLIK VNVSVEPRDA VSEVAVTAPN GSRYEYRLVG PGSVVLAASD AGNWTVEVFS MRQGYLPAYT VAVVPVRPRP IKLSVEVEGL PLLGRPAFIV RVSPPVPGIT VAVNSSNCLY YEARTLETNS SGVAVYEPPV LLCPATIEFR ARGRGYTEAV EVYQYDYSRL VPLYALILLV AVLLVAVRAI RKKH
|
| |