Gene Sala_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1595 
SymbolhslU 
ID4083032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1670905 
End bp1672206 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID638009964 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_616641 
Protein GI103487080 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.472132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.884744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG ATTTGACCCC GAAGGCGATC GTCGCCGCAC TGGACACGCA TATCATCGGC 
CAGGATGCGG CGAAGCGCGC GGTCGCGGTG GCGCTGCGCA ACCGCTGGCG CCGCCAGCAG
CTGCCCGCCG AGCTGCGCGA CGAGGTGACG CCGAAGAATA TCCTGATGAT CGGCCCCACC
GGCTGCGGCA AGACCGAGAT TTCGCGCCGC CTGGCCAAGC TCGCCGATGC GCCCTTCATC
AAGGTCGAGG CGACCAAGTT CACCGAGGTC GGCTATGTCG GCCGCGACGT CGAGCAGATC
GCGCGCGACC TCGTCGAAGA GGCGGTGCGG CTGGAAAAGG ACCGCCGCCG CGACGCCGTG
CGCGCCGCCG CCGAAGAGGC CGCGATGGAA CGGCTGCTCG ACGCGCTCAC CGGCAAGGGC
GCGAGCGAGG CGACGCGCCA GAGCTTTCGC CAACGCATCC GCGAAGGCCA TCTCGACGAC
AGCGAGGTGG AGATCGAGGT CGCCGACGCG CCCGGCATGA GTTTCGAACT GCCCGGCCAG
CCGGGGCAGA TGAGCATGAT CAACCTGTCC GACATGCTCG GCAAGGCGAT GGGCGGCCTG
CCCAGGAAAC GCCGCAAGAT GAAGGTGATC GACGCCGCGA CGCGGTTGAT CGAGGAGGAG
CAGGACAAAA GGCTCGACCA GGACGATGTC GCCCGCGTCG CGCTCGCCGA TGCCGAGGCC
AACGGCATCG TCTTCCTCGA CGAGATCGAC AAGATCGCGG TCAGCGACGT GCGCGGCGGA
TCGGTCAGTC GCGAGGGGGT GCAGCGCGAC CTCTTGCCGC TCATCGAGGG CACGACCGTC
GCGACCAAAT ATGGCCCGAT GAAGACCGAT CATATCCTCT TCATCGCGTC GGGCGCCTTT
CACGTCGCCA AGCCCAGCGA CCTGCTCCCC GAACTCCAGG GCCGCCTGCC GATCCGCGTC
GAACTCGGTG CGCTCACCGA GGAGGATTTC GTCCGCATCC TGAGCGAGAC GAAGGCGGGG
CTGCCCGAAC AATATGTCGC GCTGCTCGGC ACCGAGGGCG TGACGCTGAA CTTCGCCCCC
GACGCGATCG CGCGCGTCGC GAAACTCGCC GCCGAAGTGA ACGAAAAGGT CGAGAATATC
GGCGCGCGCC GACTTCAGAC GATCATGGAA CGGCTGGTCG AGGAAATCAG CTTCACCGCC
GAGGATGCTC CCGGCGCGAC GATCGACATC GACGCCGCCT ATGTCGACCG CCAGCTTGCC
GATGTCGTGG GCGACACCGA TCTCAGCAAA TATGTGCTTT AG
 
Protein sequence
MNKDLTPKAI VAALDTHIIG QDAAKRAVAV ALRNRWRRQQ LPAELRDEVT PKNILMIGPT 
GCGKTEISRR LAKLADAPFI KVEATKFTEV GYVGRDVEQI ARDLVEEAVR LEKDRRRDAV
RAAAEEAAME RLLDALTGKG ASEATRQSFR QRIREGHLDD SEVEIEVADA PGMSFELPGQ
PGQMSMINLS DMLGKAMGGL PRKRRKMKVI DAATRLIEEE QDKRLDQDDV ARVALADAEA
NGIVFLDEID KIAVSDVRGG SVSREGVQRD LLPLIEGTTV ATKYGPMKTD HILFIASGAF
HVAKPSDLLP ELQGRLPIRV ELGALTEEDF VRILSETKAG LPEQYVALLG TEGVTLNFAP
DAIARVAKLA AEVNEKVENI GARRLQTIME RLVEEISFTA EDAPGATIDI DAAYVDRQLA
DVVGDTDLSK YVL