Gene Saro_3054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3054 
SymbolclpX 
ID3916667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3269818 
End bp3271074 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID640445835 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_498323 
Protein GI87201066 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0723606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGC TTTCCGGATC GGATACCAAG AGCACCCTCT ACTGCAGCTT CTGCGGGAAA 
TCGCAGCACG AGGTGCGCAA GCTGATCGCC GGGCCGACCG TGTTCATCTG CGATGAATGC
GTCGAACTGT GCAACGACAT CATCCGTGAA GAGACCAAGG CCGGCATCGC CGGGAAGAAG
GACGGTGGCG TACCCACGCC GCGCGACATC TTCGAGACTC TGAACGATTA TGTGATCGGC
CAGGACCGCG CCAAGCGCGT GCTCTCGGTT GCAGTCCACA ACCACTACAA GCGCCTCAAG
CACAGCGGCA AGGGCGGCGA CGTCGAACTG TCGAAGTCGA ACATCCTGCT CGTCGGTCCG
ACCGGCTCGG GCAAGACCCT GCTTGCGCAG ACGCTGGCCA AGACCTTCGA CGTGCCGTTC
ACGATGGCGG ATGCGACGAC GCTGACCGAA GCCGGCTACG TGGGCGAGGA CGTCGAGAAC
ATCATTCTCA AGCTGCTCCA GGCCTCGGAC TACAACGTCG AGAAGGCCCA GCACGGCATC
GTCTACATCG ACGAGATCGA CAAGATCAGC CGCAAGGCGG AAAATCCGTC GATCACGCGC
GACGTGTCGG GCGAAGGCGT GCAGCAGGCC CTGCTCAAGC TGATGGAAGG GACGACCGCC
TCGGTTCCGC CGCAGGGCGG GCGCAAGCAT CCGCAGCAGG AATTCCTCCA GGTCGACACC
ACGAACATCC TGTTCATCTG TGGCGGCGCC TTCGCGGGCC TCGAAAAGAT CATCGCCGAC
CGCCTGCAGA AGCGTTCGAT CGGTTTTGGC GCGCATGTTG CCGATCCCGA CAAGCGCAAG
GTCGGCGAAC TGCTCCAGAA GGCAGAGCCC GAGGATCTCC TCAAGTTCGG CCTGATCCCG
GAATTCGTCG GCCGTCTCCC GGTGATCGCG ACGCTCAACG ACCTCGACAT CGAGGCGCTG
GTCAAGATCC TCAAGGAGCC CAAGAACGCA CTGGTAAAGC AGTACGCCAA GCTCTTCGAG
CTTGAAGACG TCACCCTGAC CTTCACCGAC GACGCTCTCG AGGCGATCGC CAAGAAGGCC
ATTGAACGCA AGACCGGCGC GCGCGGTCTG CGTTCGATCG TCGAAGGCCT TCTGCTCGAC
ACCATGTTCG ACGTGCCGAC CGAAAGCGAC ATTGCCGAAA TCGTCGTCGA CAAGGATGTC
GTCGAAGGTC GCAAGGAACC CGTCCGTGTC CTCAAGGGCA AGGAAGAAGC GGCCTGA
 
Protein sequence
MTKLSGSDTK STLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCNDIIRE ETKAGIAGKK 
DGGVPTPRDI FETLNDYVIG QDRAKRVLSV AVHNHYKRLK HSGKGGDVEL SKSNILLVGP
TGSGKTLLAQ TLAKTFDVPF TMADATTLTE AGYVGEDVEN IILKLLQASD YNVEKAQHGI
VYIDEIDKIS RKAENPSITR DVSGEGVQQA LLKLMEGTTA SVPPQGGRKH PQQEFLQVDT
TNILFICGGA FAGLEKIIAD RLQKRSIGFG AHVADPDKRK VGELLQKAEP EDLLKFGLIP
EFVGRLPVIA TLNDLDIEAL VKILKEPKNA LVKQYAKLFE LEDVTLTFTD DALEAIAKKA
IERKTGARGL RSIVEGLLLD TMFDVPTESD IAEIVVDKDV VEGRKEPVRV LKGKEEAA