Gene Tpen_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0009 
Symbol 
ID4601061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp7764 
End bp9167 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content61% 
IMG OID639772762 
Producttransglutaminase domain-containing protein 
Protein accessionYP_919422 
Protein GI119718927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.906543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGGG GATCCTGCTA CACGAACACG CTGGAGTACT TGAAGCACCC GTTGCCGGCT 
GAGGTCTCGG AGCTTTTGAG GAGAGGGGAG TTGAGGAAGG CGGAGGGGGA GTTGAGGAGG
CTTGCAGAGA CGTCTCCGCA ACCGTTGGCA AAGCGCTTCG AATTCGAGCT TGAAAGGATT
AGAAGGATGG GCTACGAGTA CCCCTACACC GTCGAGGAGG CATTCAGGCA GCTCAGGAAG
CGCGTAGAGG ATCTCACGCT GGAGGAGTTC GCCTCGCTGA TATCCAGGGG GTGCGTGGAC
CACGCGGTTA TAGAGGGGGA GGTAAGAGTG CACAGACGCT TCGAGCCGAA CGCTTTCTGG
CTCTGCGGGG ACCTCTCAAG CCGTAGGAAA AGCCTGGGCG ACGAGCGGAG CGACCTGAAC
GAGCTGACGT TGAAGTGGAG AGCCGAGAGG GTTCTCAGCG CCGCACGCGA GAAGGGCGGG
GGCTACGTCC TACCTTTAAG GTACCGCGTA AGGGCAAGGG TGAGCCTGAA GAGGAGCCCG
AGGGAGCTCG GAGAGCCCGT GAGGGTCTGG ATCCCGCTAC CGAGGGTCGA GGGTATTCAC
TCGGAGTTCA GGCTTCTCGA CTACAGCGTG AAGCCTGTAC ACGTAGCGCC CCCGGATCAC
CCTCAGAGGA CAGCTTACTT CGAGCTTTAC GGGGACACTA GAGAGGTGTG GGTGGAATAC
GAGTTCACCT CTAGGGGCTT CCACGTAGAG GTAGACCCCA GGGAGGCGAG CGTCGACCCG
GACTCCGAGG TCGCTAGAAG GTACCTGTCC GAGAGGCCGC CGCACATAGC GTTCACCGGC
GAGCTCAGGG AGTTCGTGGA CCGGTTGACT AGGGGCGCCG CGTCCCCCTA CGAGAAGGTT
CAGCGGATAT GGGGCTGGAT AACCGAGAAC GTCAGGTACA CCTACGCCAA GGACTACATA
TTCTACGACA ACATACCGGA GTACGTTTTC CGCGAGAAGA GGGGCGACTG CGGCATGCAG
GCCCTGCTCT TCATAACGAT GTGCCGTATC GCCGGGGTAC CCGCCAGGTG GGAGTCAGGC
TGGTACATGC ACCCGTTATC GCCCGGCATG CACGACTGGG CGCAATTCTA CCTCGAGCCC
TACGGCTGGC TGTACGCGGA TCCAAGCTTC GGGAACAAGA GGAAGGGCGG CGAGTGGCGC
AACACCTTCT ACCTGGGGTC AACCGAGGGG TACCGCCTAG CCTCGAACAT AGAGGTGTAC
GCAGAGTTCG ACCCGCCTAA ACGCTACATA CGCTCAGACC CCGTGGACAG CCAGCGCGGA
GAGGTGGAGA CACCGTCAAG GAACCTGTAC TACGACGAGT GGGACTTCGC TCTGGAGATA
GTGTCCGTCG AAAAGCTTGC GTAG
 
Protein sequence
MSGGSCYTNT LEYLKHPLPA EVSELLRRGE LRKAEGELRR LAETSPQPLA KRFEFELERI 
RRMGYEYPYT VEEAFRQLRK RVEDLTLEEF ASLISRGCVD HAVIEGEVRV HRRFEPNAFW
LCGDLSSRRK SLGDERSDLN ELTLKWRAER VLSAAREKGG GYVLPLRYRV RARVSLKRSP
RELGEPVRVW IPLPRVEGIH SEFRLLDYSV KPVHVAPPDH PQRTAYFELY GDTREVWVEY
EFTSRGFHVE VDPREASVDP DSEVARRYLS ERPPHIAFTG ELREFVDRLT RGAASPYEKV
QRIWGWITEN VRYTYAKDYI FYDNIPEYVF REKRGDCGMQ ALLFITMCRI AGVPARWESG
WYMHPLSPGM HDWAQFYLEP YGWLYADPSF GNKRKGGEWR NTFYLGSTEG YRLASNIEVY
AEFDPPKRYI RSDPVDSQRG EVETPSRNLY YDEWDFALEI VSVEKLA