Gene Ent638_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4042 
SymbolhslU 
ID5110796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4396222 
End bp4397553 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content52% 
IMG OID640494267 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_001178748 
Protein GI146313674 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TGACCCCACG CGAAATTGTC AGCGAGCTGA ACAAACACAT TATCGGCCAG 
GACAATGCCA AGCGTTCCGT GGCAATCGCC CTGCGTAACC GCTGGCGTCG TATGCAGCTT
GATGAAGAGC TGCGCCACGA AGTTACCCCA AAAAATATTC TGATGATCGG CCCGACCGGC
GTCGGTAAAA CCGAAATCGC CCGTCGTCTG GCGAAGCTGG CAAACGCACC GTTCATCAAA
GTTGAAGCCA CCAAGTTCAC TGAAGTGGGC TACGTCGGTA AAGAAGTGGA CTCGATTATC
CGCGATCTGG CCGATTCAGC GATGAAAATG GTGCGCGTAC AGGCCATCGA GAAAAACCGT
TATCGTGCTG AAGAAATGGC CGAAGAACGT ATTCTCGACG TGCTGATCCC ACCGGCAAAA
AACAACTGGG GTCAGAATGA ACAACCTCAG GAGCCGTCCG CCGCGCGTCA GGCATTCCGC
AAAAAACTGC GTGAAGGCCA ACTGGACGAC AAAGAGATTG AAATTGATCT GGCTGCGGCA
CCAATGGGTG TAGAAATTAT GTCTCCTCCG GGTATGGAAG AGATGACAAG CCAGCTGCAG
TCGATGTTCC AGAACCTGGG CGGTCAGAAG CAAAAGCCGC GCAAGCTGAA AATCAAAGAC
GCAATGAAGC TGCTGATTGA AGAAGAAGCG GCCAAACTGG TGAATCCAGA AGAGTTGAAA
CAAGACGCAA TCGACGCGGT TGAGCAACAC GGCATCGTGT TTATCGATGA GATCGACAAA
ATCTGTAAGC GCGGTGAATC TAACGGTCCG GACGTGTCTC GTGAAGGCGT TCAGCGTGAC
CTGTTGCCGC TGGTCGAGGG CTGTACCGTT TCGACCAAGC ACGGCATGGT CAAAACCGAT
CACATTCTGT TTATTGCTTC TGGCGCGTTC CAGGTGGCTA AGCCTTCCGA TCTCATTCCT
GAATTGCAGG GCCGTCTGCC AATTCGCGTG GAGCTGCAGG CGCTGACCAC CGACGATTTC
GAACGCATTC TGACTGAGCC AAATGCCTCG ATCACCGTGC AGTACAAAGC GCTGATGGCC
ACCGAAGGTG TGACCATTGA GTTCACCGCA GACGGTATCA AGCGTATCGC TCAGGCCGCA
TGGCAGGTTA ACGAAACCAC CGAAAACATC GGAGCGCGTC GTTTGCACAC CGTGCTGGAA
CGTCTGGTCG AAGATATCTC TTATGAAGCA AGCGATCTGA ACGGCCAAAG TATTACCATT
GACGCAGATT ATGTGAGTAA ACATCTGGAT GCGTTAGTGG CAGATGAAGA TCTAAGCCGT
TTTATCTTAT AA
 
Protein sequence
MSEMTPREIV SELNKHIIGQ DNAKRSVAIA LRNRWRRMQL DEELRHEVTP KNILMIGPTG 
VGKTEIARRL AKLANAPFIK VEATKFTEVG YVGKEVDSII RDLADSAMKM VRVQAIEKNR
YRAEEMAEER ILDVLIPPAK NNWGQNEQPQ EPSAARQAFR KKLREGQLDD KEIEIDLAAA
PMGVEIMSPP GMEEMTSQLQ SMFQNLGGQK QKPRKLKIKD AMKLLIEEEA AKLVNPEELK
QDAIDAVEQH GIVFIDEIDK ICKRGESNGP DVSREGVQRD LLPLVEGCTV STKHGMVKTD
HILFIASGAF QVAKPSDLIP ELQGRLPIRV ELQALTTDDF ERILTEPNAS ITVQYKALMA
TEGVTIEFTA DGIKRIAQAA WQVNETTENI GARRLHTVLE RLVEDISYEA SDLNGQSITI
DADYVSKHLD ALVADEDLSR FIL