Gene Aasi_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0084 
Symbol 
ID6376277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp99025 
End bp100392 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content35% 
IMG OID642681277 
Productpeptidase S14 ClpP 
Protein accessionYP_001957262 
Protein GI189501545 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCA AAAATCCCTT ATTTAGTATC ATAGTAGCAG CGCTAGTATT TACAAACAAT 
CATGCACAAA CGGCTCAAGT ACAATTAACG GCTTTTGAAG CAAATAAGCG TACCAATAAG
AATACTTCAA AGTCAACTCA ACAAACTAAT CAAAAGCAAA GTGTAAATAA TCAAGAACTT
TCATCATCTC AAGAACAGGA TGAAGATAAC GATAAAGAGT TTAAGCAGAA GATGAAAGAA
CAAAGAAAGC TGGAGATAGA AACTGCTCTT ATAAGAATTC GGCTAGAACG CGAATTAGCT
GAGATACGTG CAGAAATTGA AAGAATTAGG GTACAAAGAG AAGCAGAATC ATTAAAATGG
GAATTAGAAC ACGAAAAAAA TACTAAAGAG TACGAAAAGC AAATACTTGA GCTCAATCGA
CAGCGTGATA AAATCATGGC TGAGGTAAGT CTCTCACAGG CTAAATTGAC CCAGGCTATG
GATAAATTTA ATGCTACTTA TGTAGAAATA CAAAATCAAG TATTATTGCT GAGAACAAAC
ACAGAACAAG TACGTACAGA AATTGAAGAA AAAAAGGCAA AAAAGGAACG ATCAGAATAT
GCAGATGGTA ATCCTGAATA TTTACAAGAC CCACTACTTC CAGATGGCAC TTTAGTTCTC
TCAGATCGTG CTGTATCATT AGATGGTGTG GTTACTTCCT GGAAGGCTAA TTATATTACA
GATCGCATTA AGTATTTTAA TAATAAAGAT AAAACCAAAC CTATTTTTAT TGTTATTGAA
AACTCTCCTG GTGGTAGCGC ACTAGCTGGC CTCCATATTG TCCAAGCTAT GCAAAACAGC
CAAGCGCCCG TCTATGTGGT GCTGAAAACA TTTGCCGCAT CTATGGCTGC ACTGATTACT
ACGCTAGCTA AAAAGTCTTA TGCTTACCCA AATGCTATCC TCTTACACCA CCAACCTTGG
AGTTTTACTG GTGGTAATTT AAGAGAATTA AAGGAAGAAA TAGAATTTAT GAAAGAACTA
TGGAAACGTT TAGGCGGCCA AGTTGCAAAA AAGATGGGTA TCTCGCTCGA TAAATTAGAC
AAGCAACTAT ATGAGAAGGC AAGTAGAGGA GACTGGACAG AATTTGCAGA TAATGCTAAA
AAAATTAAGT GGGTAGATCA TGTAATAACC AATATTAATG ATACTGCTAT ACAGGAAATG
CCTAACTCTA CGAACTATAC ATGGCATAAA TACATGAAAG ACTATTTTGA TATGGCAGAA
ACATCAGTAG ACAATAGTAA TAACATATAT TTACCAGTAT TAGGACCTAA AGATTTCTAC
TATCTATATA ATCCAGATAA CACATACCAA TTGCGTTCAA ATAAATAA
 
Protein sequence
MQIKNPLFSI IVAALVFTNN HAQTAQVQLT AFEANKRTNK NTSKSTQQTN QKQSVNNQEL 
SSSQEQDEDN DKEFKQKMKE QRKLEIETAL IRIRLERELA EIRAEIERIR VQREAESLKW
ELEHEKNTKE YEKQILELNR QRDKIMAEVS LSQAKLTQAM DKFNATYVEI QNQVLLLRTN
TEQVRTEIEE KKAKKERSEY ADGNPEYLQD PLLPDGTLVL SDRAVSLDGV VTSWKANYIT
DRIKYFNNKD KTKPIFIVIE NSPGGSALAG LHIVQAMQNS QAPVYVVLKT FAASMAALIT
TLAKKSYAYP NAILLHHQPW SFTGGNLREL KEEIEFMKEL WKRLGGQVAK KMGISLDKLD
KQLYEKASRG DWTEFADNAK KIKWVDHVIT NINDTAIQEM PNSTNYTWHK YMKDYFDMAE
TSVDNSNNIY LPVLGPKDFY YLYNPDNTYQ LRSNK