Gene Tpen_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0043 
Symbol 
ID4600578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp31935 
End bp33326 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content64% 
IMG OID639772796 
Productpeptidase U62, modulator of DNA gyrase 
Protein accessionYP_919456 
Protein GI119718961 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGAAG ACGACGTGAA GCTGGTACTC GGGAAAGCGG GTTCTCTGGG CGTACAGTTC 
GCCGAGGTCC GGTTCGAGGA GAGCACCCGC GAGATCATCT CCTACGTTAA CGGCAGGATA
GCTTCTATGA GCAACCAGCG CCTGAGGGGA GTCGGCGTAA GGGTGCTCTA CAACGGCAAC
TTCGGCTTCG CATCCACCTT CGACTTGAGC AGGGATGGGT TGCTCTCGGC GCTCGAAGAG
GCCTTCAAGG CGGCTAGGGC TCTGGGGCCC GGCAACAAGA GGCTAGCGGA GGTGGAGGCC
TCCAAGAGGC GGTACAGCAT AGAGGGCGTG AAGAGGCACC CGGCGCGCGC AGAGCTCGCG
GAGAAGCTCG ACCTCGTCAA GAGGGCTTAC GAGACCCTAA AGTCCGGGGG CGCCTCCTCG
GCTGTCGTGA GGTACGGGGC GTACTACGGG AGGGTGGAGG TCTATACGAG CGAGGGGCTG
GAGGTGTCCT CGGAGAGGCT CGTTACGGGC GTCTCGGCTA CCGCCGTGCT TAGGGAGAAC
GGCAGGGTGG GCGACGGGAC CGAGACTTAC GGAGCGTCGA AGGGCCTCGA GGCGTTCACG
GGGGACCGCG CGCCCGAGGC TATAGCCGAG AAGGCACTCG AGGTCGCTAG GGCCGCGCTG
AACGCGTCTA GGCCTCCCGC CGGGCTACAG ACCGTCATCA CCCGCCCCGA GCTCACGGGG
GTGTTCGCCC ACGAGAGCTT CGGGCACTTG ACGGAAGGCG ACGGGGTGTT CGCCGGGTCC
AGCCCGCTGG TCGGGAGGCT GGGAGAGGTA CTCGCAAGCG AGCAGGTAAC AATAGTCGAC
TCCGGGTTCG ACGAGAGGGG AGGCTACGTG TTGCTCGCCG ACGACGAGGG GGTTCCCACG
GAGCGCACGA TCCTAGTGGA GAAGGGGGTG CTCAAGGGCT ACCTGCACAG CAGGGAGAGC
GCGGCACTCA CGGGGATGAA GCCTACAGGC AACGGGCGCG CGCAGAGCTT TGCCCACGAC
GTGATCGTCC GCATGCGCAA CACGTTCTTC GAGGCCGGCG ACTGGACCGA GGAAGAGATA
ATCAGGGAGA CTAGGCACGG CATACTGCTG GACAAGCCGG CGGGCGGGCA GGTGGAGGAG
GACGGCACGT TCACGTTCAA CGCCAGGATA GGGTATATAG TGGAGAACGG GGAGCTGAAG
CAGCCAGTCA GGGACGTCGT ACTGGCGGGG AACATACTCG AGATGCTGAA GTACGTAGAT
GCCGCCGGTA AAAACGTAGA AATATCCACG AGCCCCTTCG GCGGTTGCGG CAAGTGGGGA
CAGATGGTCC ACGTAGGCGA CGGTGGACCG ACGCTCAGAG TTTCCCGCTT GCTCGTGGGT
GGTGAGAGAT GA
 
Protein sequence
MHEDDVKLVL GKAGSLGVQF AEVRFEESTR EIISYVNGRI ASMSNQRLRG VGVRVLYNGN 
FGFASTFDLS RDGLLSALEE AFKAARALGP GNKRLAEVEA SKRRYSIEGV KRHPARAELA
EKLDLVKRAY ETLKSGGASS AVVRYGAYYG RVEVYTSEGL EVSSERLVTG VSATAVLREN
GRVGDGTETY GASKGLEAFT GDRAPEAIAE KALEVARAAL NASRPPAGLQ TVITRPELTG
VFAHESFGHL TEGDGVFAGS SPLVGRLGEV LASEQVTIVD SGFDERGGYV LLADDEGVPT
ERTILVEKGV LKGYLHSRES AALTGMKPTG NGRAQSFAHD VIVRMRNTFF EAGDWTEEEI
IRETRHGILL DKPAGGQVEE DGTFTFNARI GYIVENGELK QPVRDVVLAG NILEMLKYVD
AAGKNVEIST SPFGGCGKWG QMVHVGDGGP TLRVSRLLVG GER