Gene Amuc_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1050 
Symbol 
ID6274060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1252607 
End bp1253977 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642613101 
ProductATP-dependent Clp protease, ATP-binding subunit ClpX 
Protein accessionYP_001877657 
Protein GI187735545 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0052122 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTT CCACCCCTCA TCTTCCTGCC TGTTCCTGCT GCGGCAAGCC CGGGAACAAG 
GTGGACAAAC TCATCCAGAT TGCGGAAGAC TTTTACATCT GCAACAATTG CGTTGAAATT
TGCGTCAACA TGATTGTGAA GGACACAGGC CTTCCGATGG CGACCCGCTT CATCCGCGGC
ATCCTGAACA TGGAACCCTC CGCTTACGCC ATGTGCCAGG CGGAGGCCCG CAAAGCGGCA
GCCGCGGACA TGCTCAGGGA AACGGCGGCC GGCCCGGCCT CCTATGAAGG CCCGCTGCCC
ACACCGGAAG AAATGTGCGC TACGCTCAAC CAGTATGTCA TCGGTCAGGA CTACGCCAAA
AAAGTGCTTT CCGTAGCCGT GTACAACCAC TACATGCGCC TGCGCCAAAG TGCTGTCATG
CTGGACGACA AGTCCCTGGA CGATGTGGAA ATTGAAAAAT CCAACATCCT GCTGGCCGGC
CCCACCGGCT CAGGAAAAAC CCTGCTGGCG AAAACGCTGG CGAAAATGCT CAACGTCCCA
TTCTGCATTG TGGACGCCAC CACGCTGACG GAAGCCGGTT ACGTAGGGGA AGATGTGGAA
AACATCATCC TGCGCCTGCT CCAGGCTGCC AACTTTGACG TAGCGAAAGC GGAACAAGGC
ATCATCTACG TGGATGAAAT CGACAAAATC GGACGCAAAA CACAGAATGT CTCCGTCACG
AGAGACGTCT CCGGGGAAGG CGTGCAGCAG GCTCTGCTGA AAATCATTGA AGGCACCATC
TGCAATGTTC CTCCCACCGG AGGCCGCAAG CACCCGCAAC AGGAATACAT CCGCGTCAAT
ACGGAAAAAA TCCTCTTCAT TGTGGGCGGC GCTTTCGTCG GGCTGGAAGA CATCATCCGC
AAACGCCTCG GCGCCACCCA GATGGGATTC GGAGCCATCA CGGAACAACG CGACCGCAAG
GAATACTCGG AAGAGGAAAT ACTGGCACAG GCCATGCCGG AAGACCTCTT CTCCTTCGGC
ATGATTCCGG AATTCGTGGG ACGCCTGCCC ATCTTCTGTC CGCTCTCCAA GCTGGATGAA
AGCCAGCTCG TCCGCCTTCT TACGGAACCC AAAAACGCCC TGGTCAAGCA ATATTCCAAA
CTGCTCGCCA TGTACGGCGC CAAACTGGAC GTGCTGCCGG ACGCCCTGAA AGCCATGGCC
GCCGAAGCCA TGAAACGCGG CACGGGAGCC CGCGCTCTGC GTTCCATCTT TGAAACCCTC
ATGCTGGACG TCATGTACAA AGTGCCCAGC ATGAAAAATG CGGACACCGT TACCATTACC
AGGGAAACGG TTACCGGCAA CAAGCCGGCC CAAATCCACC AGTCCTCCTA A
 
Protein sequence
MSGSTPHLPA CSCCGKPGNK VDKLIQIAED FYICNNCVEI CVNMIVKDTG LPMATRFIRG 
ILNMEPSAYA MCQAEARKAA AADMLRETAA GPASYEGPLP TPEEMCATLN QYVIGQDYAK
KVLSVAVYNH YMRLRQSAVM LDDKSLDDVE IEKSNILLAG PTGSGKTLLA KTLAKMLNVP
FCIVDATTLT EAGYVGEDVE NIILRLLQAA NFDVAKAEQG IIYVDEIDKI GRKTQNVSVT
RDVSGEGVQQ ALLKIIEGTI CNVPPTGGRK HPQQEYIRVN TEKILFIVGG AFVGLEDIIR
KRLGATQMGF GAITEQRDRK EYSEEEILAQ AMPEDLFSFG MIPEFVGRLP IFCPLSKLDE
SQLVRLLTEP KNALVKQYSK LLAMYGAKLD VLPDALKAMA AEAMKRGTGA RALRSIFETL
MLDVMYKVPS MKNADTVTIT RETVTGNKPA QIHQSS