Gene Amuc_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2002 
Symbol 
ID6275653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2433034 
End bp2434875 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content59% 
IMG OID642614062 
Productheat shock protein 90 
Protein accessionYP_001878594 
Protein GI187736482 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0326] Molecular chaperone, HSP90 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.240397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAG AAACACATCA ATTTCAGGCG GAAGTCCGGC AGCTGCTGGA CATTGTCATC 
AACGCCCTTT ACAGCGACCG TGAAATCTTT GTCCGCGAAC TTGTCTCCAA CGCTTCCGAC
GCCCTGGAAA AACTGCGCCT GAAGCAGCTG ACGGACTCCA ATATTTACCA GCCGGACAAG
CCCCTGGAAA TCACTGTAGC CACGGATAAG GAAAACAAAA CCATCACCAT CGCGGATACC
GGCATCGGGA TGACGGAAGC GGACCTGGTG GAAAACCTGG GAACCATCGC CCACTCCGGC
ACCAAAAAAT TCATGGAAGC CCTTAAGCAG AAGCAGGAAG GCGGAGCGGA CCTGATCGGT
CAGTTCGGCG TGGGCTTCTA CAGCTCTTTC ATGGTGGCGG ACCGCGTGGA AGTGTTCACC
CGCTCCTATG AACCGGAAGC CGCCTCCCTG CGCTGGTCTT CCGACGGACG GGAAGGCTAC
AGCATCGAAA CGCTGGCGGA ACCGCTGGAC CGGGGCACCC GCATCGTCAT CCGCCTGAAA
GACGAATATG AAGAATTTTC CCAGGAATAC CGCGTCAAGG AACTCCTGCG CCGCTACTCC
AACTTCGTGG GGTTCCCCCT CAACTTCAAC GGAGAACACA TCAACACCGT CCAGGCCATC
TGGTCCAAGT CCAAATCCGA CGTGAAGCCG GAGGAATATG ACGAGTTTTA CCAGTTCATC
TCCCATACGG ATGAAAAGCC CCTGTCCTAC ATGCACTTCA GCGCGGACGC CCCTATTGCC
CTGAATGCCC TGCTTTTCAT CCCCAGGCGC AATCCGGAAA TGTTCGGATT CGGCCGCGTG
GACGCCAACG TGGCCCTGTA CTGCAAGCGC GTGCTGATTG ACGCCAAGCC GGAAGGCCTG
CTGCCGGAAT GGCTCCGCTT CCTGAACGGC GTGGTGGACA GCGAAGACCT GCCCCTGAAC
ATTTCCCGCG AAATGCTTCA GGACAATTCC CTGGTACGCA AAATCAGCGA CATCATCACC
AGGCGCTTCA TCAAGCATCT GGAAAAACTG GCGAAGGACG ACAAGGAAAC CTACAGGGAA
TTCTACGCGC AATTCTCCCG CTACCTGAAA GAAGGCGTCG TCACCTCCTG GCCGAACAAG
GAATCCCTGG GCAAGCTGCT CCGTTTTGAA TCCACGTCCA CGGAACCGGG GGAAACGACC
TCATTTGAGG AATACCTTAC CCGCATGAAG GAAGGGCAGA CGGCCATTTA CGCGCTTACC
GGCCCTTCCC GCTCCCATCT GGAAAACAGC CCGTACCTGG AAGCCTTCAA GGCCCGCGGC
TATGAAGTGG CCTTCTTCAC GGACCACGGG GACGAATTCG TGCTGGACTC CCTGTCCAGC
GTGGACGGCA AGCCCGTCAC GATGATCGAC CGCGCCGACG TGGAACTCCC CGCCCTGGAA
GAGGAACAGA AGGACGCCCT GCCCCAGGAG GAAGCCGCGG CCCTGGAAGA ATGGCTGAAA
GGACTGTACC CGGACAAATT CTCCAAAGTC ACCCTGGGCA AGCGCCTGGT CAGCGGAGCC
GCCGTAGCCC TGCAAAGCGG CAATGACATG GGGCCGGAAA TGAGGGCGTA CATGAAAGCC
ATGGGGCAGG AAGTGCCGGA AAGCCACCCG CAGCTGGAAC TGAACCCCTC CAACCCTCTG
GTGAAAAAGC TTTCCGCCCT GCGGACGGAA AACCCGGAAC TGGCGCAAAT GGTGGCGGAC
CAGATCGCGA ATACCGCCCT GCTCCGCGCC GGCATGCTGG ACGATCCCGC CGTGCTGGCC
CAGTCCTCCC AGGCCCTGAT GGAACAGCTC CTGCTGAAGT AG
 
Protein sequence
MNTETHQFQA EVRQLLDIVI NALYSDREIF VRELVSNASD ALEKLRLKQL TDSNIYQPDK 
PLEITVATDK ENKTITIADT GIGMTEADLV ENLGTIAHSG TKKFMEALKQ KQEGGADLIG
QFGVGFYSSF MVADRVEVFT RSYEPEAASL RWSSDGREGY SIETLAEPLD RGTRIVIRLK
DEYEEFSQEY RVKELLRRYS NFVGFPLNFN GEHINTVQAI WSKSKSDVKP EEYDEFYQFI
SHTDEKPLSY MHFSADAPIA LNALLFIPRR NPEMFGFGRV DANVALYCKR VLIDAKPEGL
LPEWLRFLNG VVDSEDLPLN ISREMLQDNS LVRKISDIIT RRFIKHLEKL AKDDKETYRE
FYAQFSRYLK EGVVTSWPNK ESLGKLLRFE STSTEPGETT SFEEYLTRMK EGQTAIYALT
GPSRSHLENS PYLEAFKARG YEVAFFTDHG DEFVLDSLSS VDGKPVTMID RADVELPALE
EEQKDALPQE EAAALEEWLK GLYPDKFSKV TLGKRLVSGA AVALQSGNDM GPEMRAYMKA
MGQEVPESHP QLELNPSNPL VKKLSALRTE NPELAQMVAD QIANTALLRA GMLDDPAVLA
QSSQALMEQL LLK