Gene Amuc_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1408 
Symbol 
ID6275718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1683318 
End bp1684970 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID642613464 
Productchaperonin GroEL 
Protein accessionYP_001878012 
Protein GI187735900 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.870477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC AAATCCAATT TGACGAAACC GCCCGCCAGG CTCTGCTCCG CGGCGTGGAA 
CAGATTGCCA AGGCTGTCAA GAGCACGCTG GGCCCTGCCG GCCGCAACGT AGTGATTGAC
AAAAAATTCG GTTCCCCCCT CATCACCAAG GACGGCGTAA CCGTGGCCAA GGAAATTGAA
CTGGAAGACC CGTTTGAAAA CATGGGCGCC CAGCTTGTCC GGGAAGTATC TTCCAAAACC
AATGACGTGG CCGGCGACGG CACCACTACC GCTACCGTGC TGGCTGAAAG CATTTACCGC
GAAGGCCTGC GCAACGTTAC TGCCGGGGCC AACCCCATCT CCCTCCAGAG GGGCATCATG
AAGGCTGCGG ATTCCGTTGT GGAAGAACTC AAGAAGATCA GCAAGCCTGT CGACTCCAGC
AAGGAAGTGG CCCAGGTCGC TACCGTCTCC GCCAACTGGG ACGCTGAAAT CGGCAACATC
ATCGCGGAAG CCATGGACAA GGTGGGCAAG GACGGCACCA TCACCGTGGA AGAAGCCAAG
GGCATTGAAA CTACGCTGGA CGTGGTGGAA GGCATGCAGT TTGACAAGGG ATACCTGTCC
CCCTACTTCG TGACGAACGC GGAAACGATG GAAGCGGTGC TGGAAAACCC CTACATCCTC
ATCCACGAAA AGAAAATCAA CAACCTGAAG GACTTTCTTC CGCTGCTTGA AAAAGTGGCC
AAGAGCGGCC GTCCCTTCCT GGTAATCGCG GAAGACATTG AAGGCGAAGC CCTCGCCACC
CTGGTAGTCA ACCGTCTGCG CGGCGTGCTG AACATCTGCG CGGTCAAGGC TCCCGGCTTC
GGCGACCGCC GCAAGGCCAT GATGGAAGAC ATCGCCATCC TTACCGGCGG CAAGTGCATC
ACGGAAGACC TGGGCATCAA GCTGGAAAAC GTGGGCATCG AAGACCTCGG CCAGGCCAAG
CGCGTGGTTG TTTCCAAGGA TGAAACCGTT ATCGTGGAAG GTTCCGCCAA ATCTTCCGAT
ATTGAAGCCC GCATTTCCCA GATTCGCCGC CAGATCAAGG ACACCACGTC CGACTACGAC
CGCGAAAAAC TCCAGGAACG CCTGGCCAAG CTGGCCGGCG GTGTGGCCGT CATCCATGTG
GGTGCCGCTA CGGAAACGGA AATGAAGGAA AAGAAGGCCC GTGTGGACGA CGCCCTGCAC
GCTACCCGCG CTGCGGTGGA AGAAGGCATC GTTCCCGGCG GCGGCGTGGC GCTGATTCGC
GCCCAGAAAG CCATTGACAC CCTCAAACTG GAAGGTGATG AAGCAACCGG CGCCCAGATC
GTTTATCGCG CTGTGGAAGC CCCGCTCCGC CAGCTGGCCT GCAATGCCGG CCGCGAAGGA
GCCCTCATCG TCGCCAACGT GAAAGGCATG AAGAATACTG CCGAAGGTTA CAACGTGGCC
ACGGACAAGT ATGAAGACCT GCTTTCCGCC GGCGTGGTGG ATCCGACCAA GGTGACCCGT
TCCGCTCTGC AGAATGCGGC CTCCATCGCC GGCCTGCTGC TTACCACGGA ATGCGTCATT
GCCGACAAGC CCGAGAAGAA GAGCTGCAGC TGCGGCTCCG GAGCTTCCGA CATGGGCGGC
ATGGGAGGAA TGGGCGGCAT GGGCATGATG TAA
 
Protein sequence
MAKQIQFDET ARQALLRGVE QIAKAVKSTL GPAGRNVVID KKFGSPLITK DGVTVAKEIE 
LEDPFENMGA QLVREVSSKT NDVAGDGTTT ATVLAESIYR EGLRNVTAGA NPISLQRGIM
KAADSVVEEL KKISKPVDSS KEVAQVATVS ANWDAEIGNI IAEAMDKVGK DGTITVEEAK
GIETTLDVVE GMQFDKGYLS PYFVTNAETM EAVLENPYIL IHEKKINNLK DFLPLLEKVA
KSGRPFLVIA EDIEGEALAT LVVNRLRGVL NICAVKAPGF GDRRKAMMED IAILTGGKCI
TEDLGIKLEN VGIEDLGQAK RVVVSKDETV IVEGSAKSSD IEARISQIRR QIKDTTSDYD
REKLQERLAK LAGGVAVIHV GAATETEMKE KKARVDDALH ATRAAVEEGI VPGGGVALIR
AQKAIDTLKL EGDEATGAQI VYRAVEAPLR QLACNAGREG ALIVANVKGM KNTAEGYNVA
TDKYEDLLSA GVVDPTKVTR SALQNAASIA GLLLTTECVI ADKPEKKSCS CGSGASDMGG
MGGMGGMGMM