Gene Ava_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3603 
SymbolclpX 
ID3679295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4493491 
End bp4494831 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content47% 
IMG OID637718954 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_324104 
Protein GI75909808 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.348345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAGT ACGACTCCCA TTTAAAATGT TCTTTCTGTG GCAAGTCTCA AGAGCAGGTG 
CGGAAATTGA TCGCAGGGCC GGGAGTCTAC ATCTGCGATG AATGTGTAGA CTTGTGTAAT
GAAATACTAG ATGAGGAGTT GCTAGATACT AGCGGTGCAG CGGCGCAACC AGCACCAAAA
TCAGAACCCC CTCAGAAACG CCGCGCCCGT TCTTCTAATC TCTCCCTGAG CCAAATACCC
AAACCTAGAG AGATTAAAAA GTACCTAGAC GAACACGTTA TCGGTCAAGA TGAAGCCAAG
AAAGTTTTAT CTGTAGCTGT TTACAACCAC TACAAACGCT TGGCCATATT GCAGTCTAAG
GGTAGTAGCA AAAATGGCGC TGATGATGCC GTAGAACTGC AAAAGTCCAA CATTCTCTTA
ATCGGCCCCA CCGGTTGCGG CAAAACTCTC CTAGCCCAAA CCCTAGCCAA AATTCTCGAT
GTCCCCTTTG CTGTGGCTGA TGCGACAACG CTCACAGAAG CAGGGTATGT AGGGGAAGAT
GTAGAAAATA TCTTACTGCG ATTATTACAA GTAGCTGATT TGGATGTAGA AGAAGCCCAG
CGCGGCATCA TCTACATCGA TGAAATTGAT AAAATTGCCC GCAAGAGTGA AAACCCCTCC
ATCACCAGGG ACGTTTCCGG CGAAGGTGTA CAACAAGCTT TGTTAAAAAT GCTAGAAGGT
ACAATCGCCA ATGTTCCCCC CCAAGGAGGA CGGAAACACC CTTACCAAGA CTGCATCCAA
ATTGATACCA GCAATATTTT ATTTATCTGT GGTGGAGCCT TCGTTGGTTT AGAGAAGGTT
GTAGACCAGA GAGGGGGCAA AAAGTCAATA GGCTTTGTGC AGCCTGGAGA AGGTCAATCT
AAAGAAAAAC GGGCAGCAGA TGTTCTACGC CACCTGGAAC CAGATGACCT GGTGAAATTT
GGCATGATTC CCGAATTTAT CGGCCGGGTA CCAATGGTAG CCGTAGTAGA TCCTCTAGAT
GAAGAAGCCT TGATGGCGAT TCTTACTCAA CCACGCAGCG CCCTAGTCAA GCAGTACCAA
AAACTGCTGA AGATGGACAA CGTCCAATTA GACTTTAAAC CAGATGCCCT CAAGGCGATC
GCGCAAGAAG CCTACCGCCG GAAAACTGGC GCGAGAGCAT TACGGGGTAT TGTGGAAGAA
CTAATGCTAG ATGTGATGTA CGAGTTACCA TCCCGTAAAG ATGTGACGCG ATGCACAGTC
ACCAGGGAAA TGGTAGAGAA GCGCTCAACA GCAGAATTGC TAGTACACCC GTCTTCATTG
CCTAAACCAG AATCAGCTTA G
 
Protein sequence
MSKYDSHLKC SFCGKSQEQV RKLIAGPGVY ICDECVDLCN EILDEELLDT SGAAAQPAPK 
SEPPQKRRAR SSNLSLSQIP KPREIKKYLD EHVIGQDEAK KVLSVAVYNH YKRLAILQSK
GSSKNGADDA VELQKSNILL IGPTGCGKTL LAQTLAKILD VPFAVADATT LTEAGYVGED
VENILLRLLQ VADLDVEEAQ RGIIYIDEID KIARKSENPS ITRDVSGEGV QQALLKMLEG
TIANVPPQGG RKHPYQDCIQ IDTSNILFIC GGAFVGLEKV VDQRGGKKSI GFVQPGEGQS
KEKRAADVLR HLEPDDLVKF GMIPEFIGRV PMVAVVDPLD EEALMAILTQ PRSALVKQYQ
KLLKMDNVQL DFKPDALKAI AQEAYRRKTG ARALRGIVEE LMLDVMYELP SRKDVTRCTV
TREMVEKRST AELLVHPSSL PKPESA