Gene Plav_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3231 
SymbolclpX 
ID5455069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3449102 
End bp3450367 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID640878821 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001414493 
Protein GI154253669 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.911182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TGAGCGGCGG CGACTCCAAG AACACGCTCT ACTGTTCCTT TTGCGGCAAA 
AGCCAGCATG AGGTGCGGAA GCTGATCGCG GGACCGACCG TCTTTATCTG TGACGAATGC
GTCGAACTCT GCATGGACAT CATCCGCGAG GAGAACAAGA GCTCGCTGGT GAAGTCGCGT
GACGGTGTTC CCTCGCCGCA GGAAATCTGT GGAGTGTTGG ACGATTACGT GATTGGGCAG
CAGCACGCGA AGCGCGTGCT GTCGGTCGCC GTTCACAACC ACTACAAGCG CCTCAACCAC
GCTGCAAAAA ACAACGACGT CGAACTTGCC AAGTCCAACA TCCTGCTGAT CGGCCCGACC
GGTTGCGGCA AGACACTGCT CGCGCAGACG CTTGCCCGTA TCCTGGACGT TCCCTTTACG
ATGGCGGATG CGACGACGCT GACGGAAGCC GGCTATGTCG GTGAGGACGT TGAAAACATC
ATTCTGAAGC TGCTGCAATC GGCCGACTAC AATGTCGAGC GCGCGCAGCG TGGCATCGTT
TATATCGACG AAGTCGACAA GATCAGCCGC AAGTCCGATA ACCCGTCCAT CACGCGTGAC
GTGTCAGGCG AGGGTGTCCA GCAGGCGCTG TTGAAGATCA TGGAAGGCAC TGTGGCTTCC
GTGCCGCCGC AGGGCGGACG AAAGCATCCG CAGCAGGAAT TCCTGCAGGT CGACACGACG
AACATCCTGT TCATCTGCGG CGGCGCCTTT GCGGGCCTTG AAAAGATCAT CGGTCAGCGC
GGCAAGGGCG CAGGCATCGG TTTCGGTGCA AAAGTGCAGT CGGTCGAAGA CCGGCGTACC
GGCGACATTC TGAAGGACCT GGAGCCGGAA GATCTGTTGA AATTCGGCCT CATCCCCGAA
TTCGTTGGCC GTATGCCGGT GCTCGCGACG CTGGAAGATC TCGATGAGGA AGCGCTGCTG
ACCATTCTCA CCCAGCCCAA AAACGCGCTG GTAAAGCAGT ATGAGCGCCT TTTCGAGATG
GAAAATGTGC GGCTCACCTT CTCCGAGGAA GCGCTTCGCG CAGTCTCGCG CAAGGCAATC
GAGCGCAAAA CGGGTGCCCG CGGCCTCCGC TCGATCCTCG AATCGATCCT GCTCGACACG
ATGTTCGAGC TGCCGACGCT CGAAGGGGTC GAGGAAGTGG TCATCAGCGC CGAAGTGGTG
GAGGGCAAGG CTCGCCCGCT GTATATTTAT GCGGAGCGCC AGGGCGACGT CGGGACCGGC
GCCTGA
 
Protein sequence
MSKVSGGDSK NTLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ENKSSLVKSR 
DGVPSPQEIC GVLDDYVIGQ QHAKRVLSVA VHNHYKRLNH AAKNNDVELA KSNILLIGPT
GCGKTLLAQT LARILDVPFT MADATTLTEA GYVGEDVENI ILKLLQSADY NVERAQRGIV
YIDEVDKISR KSDNPSITRD VSGEGVQQAL LKIMEGTVAS VPPQGGRKHP QQEFLQVDTT
NILFICGGAF AGLEKIIGQR GKGAGIGFGA KVQSVEDRRT GDILKDLEPE DLLKFGLIPE
FVGRMPVLAT LEDLDEEALL TILTQPKNAL VKQYERLFEM ENVRLTFSEE ALRAVSRKAI
ERKTGARGLR SILESILLDT MFELPTLEGV EEVVISAEVV EGKARPLYIY AERQGDVGTG
A