Gene Tpen_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0044 
Symbol 
ID4600465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp33323 
End bp34651 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content64% 
IMG OID639772797 
Productpeptidase U62, modulator of DNA gyrase 
Protein accessionYP_919457 
Protein GI119718962 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGGAG AGATACCGTA CTTGGCGGAG TGGCTCGTAA AACGCGCCCT CGAGCTAGGG 
GCGGACGAGG CGGAGGCTAG CTTCTCGGTG TACAGGGAGA GGAGCGTCAA GTCGGAGGGC
GCCTACCCCA AGGCGCTCTC GCGGGCATCC GTGGACGTCT GGATACGCGT GGCCAAGGGC
AAGAGGGTCG CCATCGTCAC GGCGACCTCC CTGGACAGGG ACGTCCTCTC CGAAATCCTC
GAGAGGGGGG TGGCGATGGC GAGGGTAGCC GAGGAGGACC CACACTGGGG AGGGCTACCC
GACCCCGAGG GGCCGACGCA CGGCTGGGTG GGCTTCGACG AGGGGGTAGC GACTTTGGGC
GCGGATTGGC TTATCGCCGC GGTTAGAGAG CTGGCGGAGG AGGTGCGGGG AAGATTCCCG
GACGTGAAGG TCACCGGGTC GGGGGCGAGC GCGGCCTCCG GCTACAGCTA CGTCTACAAC
AGCAGGGGGC TGAGAGCGGA GGACAGGGGG ACGTCGATGG GCGTGTACCT CTCCACGAAG
GCTTCGCACG CGGGCGGAGA GGGGACGGGC TTCGCGTTCA TCCACTCGAG GAGCATGGTC
GCGGAGCTGG AGGGGCTAGC GGAGAGAGCC AGCAGGCTGG CGCTCGACGC GGCTAGGGGT
GAGAAGCTGG GATCCTCGGT AACCGGGAAC GTGCTCTTCA AGCCCTACCC CTTGGCGGAG
CTCCTGGACT ACCTGCTCGT ACCCGCGCTC AACGCTATGA ACGTCCTCGA GGGGCTCAGC
CCGCTGAGAG ACAAGGTTGG CGAAAAGGTC CTCGGAGAGA TGACGCTCCT AGACGACGGA
ACCCTCCCCG GAGGCATTGA GACCTCGCTC TTCGACGCCG AGGGTGTGCC GCGCAGGCGG
ACCTTGCTGG TCGAAAGAGG CGTGCTTAGG GGTTACCTGC ACAACACGTA CACCGCCAGG
AGGATGTCGA CGAAGAGCAC CGGTAACGCC GGTAGGGCTA GGGGCTCGTA CACGGTGTCG
CGCTCGAACA TGGTAGTGGA GGGAGGAGAC GAGTCCGAGG AGGAGCTAGC GCGAGACGCC
GCCGTCGTTG TGGACGGAAG CCTCCTAAGC GTTCACACGG TGAACTACGT CACGGGTAAC
TTCAGCGTTG TAGCCACGAA CCCATACCTC GTCAAGAACG GGGAACTTAA ACCCCTCAAG
CCCGTAACGA TAGCCGGGAA CATATACCAG TCGGCCCCCA CGTTGAGGTT CTCCAGAACC
CCCAGGAATA CCTACACCGG CTTCTACCTG CCCGAAACGC TCGTAGGAAA AGTCACGGTA
TCAGGCTAG
 
Protein sequence
MIGEIPYLAE WLVKRALELG ADEAEASFSV YRERSVKSEG AYPKALSRAS VDVWIRVAKG 
KRVAIVTATS LDRDVLSEIL ERGVAMARVA EEDPHWGGLP DPEGPTHGWV GFDEGVATLG
ADWLIAAVRE LAEEVRGRFP DVKVTGSGAS AASGYSYVYN SRGLRAEDRG TSMGVYLSTK
ASHAGGEGTG FAFIHSRSMV AELEGLAERA SRLALDAARG EKLGSSVTGN VLFKPYPLAE
LLDYLLVPAL NAMNVLEGLS PLRDKVGEKV LGEMTLLDDG TLPGGIETSL FDAEGVPRRR
TLLVERGVLR GYLHNTYTAR RMSTKSTGNA GRARGSYTVS RSNMVVEGGD ESEEELARDA
AVVVDGSLLS VHTVNYVTGN FSVVATNPYL VKNGELKPLK PVTIAGNIYQ SAPTLRFSRT
PRNTYTGFYL PETLVGKVTV SG