Gene Tery_0584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0584 
SymbolclpX 
ID4244610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp936923 
End bp938272 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content42% 
IMG OID638105888 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_720501 
Protein GI113474440 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAT ACGACTCCCA TCTAAAATGT TCATTCTGTG GCAAGTCTCA AGAGCAGGTT 
AGGAAATTGA TAGCTGGACC TGGAGTTTAT ATATGTGATG AATGTGTAGA GTTGTGCAAT
GAGATTTTGG ATGAGGAGCT TTTTGACTCC AATGCTACAG GAGCACAACC ACCAATACCA
CGTCCAGCAC CAGCACCCCA AAAACGAGGG ACTGGTACTA AGAGATTATC TATTAGTCAA
ATACCTAAGC CTAGGGAAAT AAAGAATTAT CTGGATGCTC ATGTTATTGG TCAGGAGGAA
GGTAAGAAGG TTTTATCAGT GGCAGTTTAT AACCACTATA AACGTCTGAG TTTTCTAGAG
GCCAAAAAAA GTGGTAAGTC CTCTCAAGAT GAGGTGGAAT TACAAAAGTC TAATATTTTG
TTGATTGGGC CCACAGGTTG TGGAAAAACG TTGTTGGCTC AAACTTTGGC GGATTTATTG
GATGTGCCTT TTGCCGTGGC AGATGCGACG ACTTTAACTG AAGCTGGATA TGTTGGGGAG
GACGTGGAGA ATATTTTGCT ACGACTTTTA CAAGTAGCAG ATTTAGAAGT GGATGAAGCA
CAACGGGGAA TTATATATAT TGATGAGATT GATAAAATAG CTCGTAAGAG TGAGAACCCT
TCTATAACAA GAGATGTTTC TGGGGAGGGT GTGCAGCAAG CCTTATTAAA GATGTTGGAG
GGAACTGTTG CTAATGTTCC TCCACAAGGT GGTCGGAAAC ATCCCTATCA AGATTGTATT
CAGATCGATA CGAGTAATAT TTTATTTATC TGTGGTGGTG CTTTTGTTGG TTTAGAAAAG
ATAGTAGATC AAAGAATTGG TAAAAAGTCA ATGGGCTTTA TTCACCAGAG TGGGGACAGT
TATCAGGTTA AGGAGAAAAA AGTTGTAGAT TTAATGAAGC AAATGGAACC AAATGATTTG
GTGAAGTTTG GTTTGATCCC AGAATTGATT GGGCGAATAC CTATGGTGGC TGTCGTTGAA
CCTCTCGATG AGGAGACTCT GATGGCAATT TTGACGAAAC CTCAGAATGC TCTGGTGAAG
CAGTATCAAA AGCTGTTACG GATGGATAAT GTGAAGTTGG AGTTTGAGGA GGATGCTGTA
CGGGCGATCG CGAAGGAAGC ATTTAGGAGA AAGACTGGGG CGCGAGCTTT GCGGGGTATT
GTTGAGGAGT TGATGTTGGA TGTGATGTAT GAGCTACCAT CACGGAAGGA TGTGAGTCGT
TGCACTATTA CTAAGGAAAT GGTGGAAAAG CGATCAACTG CAGAGTTGTT ATTGCATCCT
TCGTCTTTGC CTAAACCGGA GTCAGCTTAA
 
Protein sequence
MSKYDSHLKC SFCGKSQEQV RKLIAGPGVY ICDECVELCN EILDEELFDS NATGAQPPIP 
RPAPAPQKRG TGTKRLSISQ IPKPREIKNY LDAHVIGQEE GKKVLSVAVY NHYKRLSFLE
AKKSGKSSQD EVELQKSNIL LIGPTGCGKT LLAQTLADLL DVPFAVADAT TLTEAGYVGE
DVENILLRLL QVADLEVDEA QRGIIYIDEI DKIARKSENP SITRDVSGEG VQQALLKMLE
GTVANVPPQG GRKHPYQDCI QIDTSNILFI CGGAFVGLEK IVDQRIGKKS MGFIHQSGDS
YQVKEKKVVD LMKQMEPNDL VKFGLIPELI GRIPMVAVVE PLDEETLMAI LTKPQNALVK
QYQKLLRMDN VKLEFEEDAV RAIAKEAFRR KTGARALRGI VEELMLDVMY ELPSRKDVSR
CTITKEMVEK RSTAELLLHP SSLPKPESA