Gene Tpen_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1168 
Symbol 
ID4601182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1108637 
End bp1110541 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content64% 
IMG OID639773944 
Producttransglutaminase domain-containing protein 
Protein accessionYP_920569 
Protein GI119720074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGAGAA GGCTTTCCGC GGTTGTGCTC CTCCTCTTGG TGGTTTTCTC GGTAGCTGAG 
AGCTCCGGGT CTCCGGGGGT CGAGGTTACA GGCTTAACAG TCAAGGTCGG GACTCTAGTA
GGCGGCGTGT TTGTCGAGAG GCCTAACAGG ACTGTGAACG CCGGGGAGAA GGTCGCCGTC
AAGGTCGTAG TGGAGGTGCG CGGGGATGCG GGCACGAGGT ACGAGGGGAG CCTCAAGCTG
TCCGTAGCCA ACCCCCTCGG AGTAGTCGTG GGCGAAGACT CCAGGAACTG GTCCTCCACC
CTCCGGGGCG CTGGCGAGGA GTGGGAGTTC ACGTTCGTCT ACGAGTTTCC CCCCTCCGCC
CTGAGGGGGT ACTACAACGT CGACGTCTCG GTGAAGGTTT TGAACAGCGT TTCCCGCGGG
AGGGTCTCCT TCTTCTACAG GGGGCTCGTC GATAGGAGGA ACGTCGTGAA CGTTACCTAC
GTGCTCGTGC TAGAGGGCAC GGGGGAAGTC GGGGAGCTGA GGGTAGCGCT CCCCCAGGCC
GACTCCATGA CCTTCGCCGC CGGCCCGGTA GTGTCGCCTA GGCCTTCGAG GGTAGAGAAG
GACGAGCTCG GCAACGTGTA CGCGGTTTAC GAGAAGGTAG CCGAGGGGGC GTTCAGGAAG
GAGTTCAAAG TCAGCTTCGT AGGGGTGCAG GAAGTCAGTC TGGTGAGCGC CGACGCGCCT
ATAGACTCTC TCAGAAGCCT CCCGCCCGGG CTCGAGGAGT TCCTGCGCCC GTCCCCGTAC
ATAGAGAGCG ATAGCCCGGA GATAGTCGAG GTTGCCAGGA GGCTTTCGTC GGGCGTCTCC
ACGGTGCGCC AGCTAGCGTC GAGGATAGCG GACTACGTCT CCTCGACTCT CAGGTACAAC
GACGCCCTCA GGAGCATAAG GGACTCCTGG AGCCTAGGAG CCCTCTGGGC TCTCCACGCG
AAGCAGGGCA TGTGCCTCCA GTTCGCCCGG CTGTACGTCG CCATAGCGCG TGCCGCGGGG
CTACCAGCCA GGGTCGTCGA GGGGCTCGTG GTTACGCCGC CCGGAGGCTC CTCCTCGTAC
CTCCACGCGT ACGCCGAGTT CTACCTGCCG GGCTACGGGT GGGTGCCCGT AGAGCCGCAA
CTCCCCGGGA GGTACGTGGG GCTCGTCCCT CCTGTCCCCG GGTACGTCCC GCTGGTCAAG
GGGCTTGGGG AGGAGAGGGC GGGCTCCCGG GACTCTGTGA GCACGCAGTT TACCTACTCG
TACCGCTTGT ACCCCTACGA GGGGTTGTCG GGGAGCGTGA AGCTATCAGT GAAGTACCCG
AGAGAGCTAC TCTACGGCGA CCTGATCAAG GTCAACGTCA GCGTGGAGCC TAGGGACGCG
GTATCGGAGG TCGCCGTCAC CGCCCCTAAC GGCTCGCGCT ACGAGTACAG GCTCGTAGGC
CCCGGGAGCG TAGTCCTCGC GGCGAGCGAC GCGGGCAACT GGACCGTCGA GGTGTTCTCG
ATGAGGCAGG GCTACCTGCC AGCCTACACC GTTGCAGTCG TACCGGTAAG GCCGCGTCCC
ATCAAGCTTT CCGTCGAGGT CGAAGGACTA CCCCTCCTCG GCAGGCCAGC TTTCATAGTC
AGAGTGAGCC CCCCTGTACC CGGCATAACC GTAGCTGTGA ACTCCTCCAA CTGCCTCTAC
TACGAGGCTC GGACCCTGGA GACGAACTCC TCCGGCGTCG CCGTCTACGA GCCCCCGGTA
CTCCTGTGCC CGGCTACCAT CGAGTTCCGG GCGAGGGGGA GGGGATACAC GGAGGCGGTA
GAGGTTTACC AGTACGACTA CTCGCGCCTA GTCCCCCTAT ACGCGCTAAT CCTGCTCGTC
GCAGTCCTGC TCGTAGCCGT GCGTGCAATA CGCAAAAAGC ATTAA
 
Protein sequence
MARRLSAVVL LLLVVFSVAE SSGSPGVEVT GLTVKVGTLV GGVFVERPNR TVNAGEKVAV 
KVVVEVRGDA GTRYEGSLKL SVANPLGVVV GEDSRNWSST LRGAGEEWEF TFVYEFPPSA
LRGYYNVDVS VKVLNSVSRG RVSFFYRGLV DRRNVVNVTY VLVLEGTGEV GELRVALPQA
DSMTFAAGPV VSPRPSRVEK DELGNVYAVY EKVAEGAFRK EFKVSFVGVQ EVSLVSADAP
IDSLRSLPPG LEEFLRPSPY IESDSPEIVE VARRLSSGVS TVRQLASRIA DYVSSTLRYN
DALRSIRDSW SLGALWALHA KQGMCLQFAR LYVAIARAAG LPARVVEGLV VTPPGGSSSY
LHAYAEFYLP GYGWVPVEPQ LPGRYVGLVP PVPGYVPLVK GLGEERAGSR DSVSTQFTYS
YRLYPYEGLS GSVKLSVKYP RELLYGDLIK VNVSVEPRDA VSEVAVTAPN GSRYEYRLVG
PGSVVLAASD AGNWTVEVFS MRQGYLPAYT VAVVPVRPRP IKLSVEVEGL PLLGRPAFIV
RVSPPVPGIT VAVNSSNCLY YEARTLETNS SGVAVYEPPV LLCPATIEFR ARGRGYTEAV
EVYQYDYSRL VPLYALILLV AVLLVAVRAI RKKH