Gene Saro_2634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2634 
SymbolhslU 
ID3917067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2870737 
End bp2872038 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID640445411 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_497904 
Protein GI87200647 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.206684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACT CCCTTACCCC CAAGGCCATC GTCCGCGCGC TGGACGAACA TATCGTCGGC 
CAGACCGCCG CGAAGAAGGC CGTGGCCGTG GCACTGCGCA ATCGTTGGCG CCGTCAGCGG
CTGTCCGCCG ACCTGCGCGA CGAGGTTTCC CCCAAGAACA TCCTGATGAT CGGGCCCACC
GGCTGCGGCA AGACCGAGAT CAGCCGCCGC CTTGCCAAGC TTGCCGATGC GCCGTTCGTG
AAGGTCGAGG CGACGAAGTT CACCGAAGTC GGCTATGTCG GCCGCGACGT TGAGCAGATC
GCGCGCGACC TCGTGGAAGA GGCGATCCGG CTGGAGAAGG AGCGCCGCCG CGACGCGGTG
CGGGAAGCCG CCAGCAAGGC GGCGATGGAC CGGCTGCTCA AGGCGCTGGT CGGCGATGGC
GCAAGCGAGG CGACGCGGGA AAGCTTCAAG GCGCGGCTTT CGGACGGCTC GATGAACGAC
GTCGAAGTGG AAATCGAGGT CGAGGATGCG CCATCGATGC CGATGGAAAT ACCGGGCATG
GGCGGTGGGA TCGGCATGAT CAACCTCAGC GACATGATGG GCAAGGCTTT CGGCAAGCAG
AACCTCAAGC GTCGCAAGAT GCGCGTGGTC GATGCCTGGG ACAAGCTGGT CGACGAGGAA
GCCGAAAAGC GCATGGACCA GGACGATGTC GCGCGCGAGG CGATCCGCAA CGCCGAGACC
AACGGCATCG TCTTCCTTGA CGAGATCGAC AAGATCGCAG TTTCCGACGT GCGCGGCGGT
TCGGTGAGCC GCGAGGGCGT GCAGCGCGAT CTCCTGCCGC TGATCGAGGG CACGACGGTC
GCCACCAAGT ACGGCCCGAT GAAGACCGAC CACGTGCTGT TCATCGCGAG CGGGGCGTTC
CATGTCGCCA AGCCTTCGGA CATGCTGCCC GAACTCCAGG GGCGCCTGCC GATCCGGGTC
GAGCTGAATG CGCTGTCGGA AGACGATTTC GTGCGCATCC TGTCGGAAAC GCGGGCCAAT
CTCGTCGAGC AATACCGCGC GCTGATCGCG ACCGAGAACG TTACGCTGGA CATCACCCCC
GCAGCGATCC GCGCGATTGC CCGCACCGCC GCGCAGGTCA ACGAAAGCGT CGAGAACATC
GGCGCACGGC GCTTGCAGAC GGTGATGGAA AAGCTGCTGG AGGAAGTGAG CTTCGACGCC
GAGGATCGCG CGGGCGAGAC CGTCATGGTG GACGAGGCCT ACGTGGCCGA CAAGCTGGCC
AACCTTGCCG GCAACGCGGA TCTTTCGAAG TACATCCTGT GA
 
Protein sequence
MNDSLTPKAI VRALDEHIVG QTAAKKAVAV ALRNRWRRQR LSADLRDEVS PKNILMIGPT 
GCGKTEISRR LAKLADAPFV KVEATKFTEV GYVGRDVEQI ARDLVEEAIR LEKERRRDAV
REAASKAAMD RLLKALVGDG ASEATRESFK ARLSDGSMND VEVEIEVEDA PSMPMEIPGM
GGGIGMINLS DMMGKAFGKQ NLKRRKMRVV DAWDKLVDEE AEKRMDQDDV AREAIRNAET
NGIVFLDEID KIAVSDVRGG SVSREGVQRD LLPLIEGTTV ATKYGPMKTD HVLFIASGAF
HVAKPSDMLP ELQGRLPIRV ELNALSEDDF VRILSETRAN LVEQYRALIA TENVTLDITP
AAIRAIARTA AQVNESVENI GARRLQTVME KLLEEVSFDA EDRAGETVMV DEAYVADKLA
NLAGNADLSK YIL