Gene Amuc_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1091 
Symbol 
ID6274011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1302165 
End bp1303544 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content56% 
IMG OID642613142 
ProductCytochrome-c peroxidase 
Protein accessionYP_001877698 
Protein GI187735586 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000159552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000000000391154 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAAA CATGTTCCTT CATCAAGGCA GGAGCCATTC TGGGCGGCAC GGTCATCGTG 
ACGGCGGCTG TCGCCCCCCT GTTCCTGCCG AACCAGAATG TCAAGCCACT GACGTCCGCC
CAGGCGGCGG AAATTACCGC ACAGACCATG AATTCCAAAT GTGCGGACTG CCACAAGCCC
GGCACTCACA TTTCCGAACT GGTCAATACT CTTTCCGGAG GCCTGCTGGC GCGCCATATC
AGGGACGGGC AGCGCAGCTA CAATATGGAA GAACCTCCCA CTGCCGTCAC CCTTTCCAAG
CTGGAACATG TGCTTCAAAT CAATTCCATG CCCCCCACTT CCTACACCAT GGTGCACTGG
GGCAGCACGC TCACTCTCCG GGAGAAAAAC GCCATGCTCC AGTGGATCAA GGATGAGCGC
CTGAAAATTT TCGGCGATAT GGTAGGAGAG GAATACGCCC TTTCCCCTCT TGCCCCCATT
CCGGACGCCC TCCCCACGGA TCCGGCCAAA GTGGCCCTGG GCTACAAGCT TTTTCATGAC
GTGCGCCTTT CCACGGACAA TACCGTTTCC TGCGCTTCCT GCCATTCCCT GGAAAAAGCC
GGGACGGACA ACCTGCCCAC TTCCACCGGA GTCCGCGGCC AGAAAGGCGG CATCAATGCC
CCCACCGTTT TCAATGCCGC TTTCCATGCC AAGCAATTTT GGGACGGACG CGCAGCCAAC
CTCCAGGAAC AGGCCGGCGG ACCGCCCCTG AATCCGGTGG AAATGGGGTA CGAACATCCG
GATGACTGGA AGAAGATCGC TGCCAAACTG GACCAGGACA CCGCTTTTGC CGCAGAATTC
AAAAAGGTTT ACCCCCAGGG ATTCACCGGA GAGACCATCA CGAATGCCAT CGCGGAATAT
GAAAAAACTC TTATCACGCC GAACAGCCCG TTTGACCGCT ACCTGAAAGG GGATGAAAAC
GCCATCAGCG AGAACGCCAA AAAAGGTTAC AAGCTTTTCC TGAAGCTTGG TTGCCAGACC
TGCCACACCG GTCCCGCCAT GGGAGGCCAG TCCTTTGAAT ACGCCGACCT CAAAGGCGAT
TTCTTTGCCG GACGCGCCAA GACCAACGAC GATAACGGCC TGATGAATTT CTCCAAAAAG
GAATCGGACA GGCACCGCTT CCGTGTTCCG ACCCTCCGCA ATGTGGAACT CACCTGGCCG
TACATGCATG ACGCCTCCGC ACAGACTCTG GAGGAAGCCA TTACGAAAAT GTACCATTAC
CAGCTCGGTT ACGATAAACT GGACAAGAAG GAAGTGAGAC TTCTGGTGGC TTTCCTGAAG
ACGCTTACCG GAGAATACAA CGGCAAACCC GTCCAGGGCG AAGTTTGCCC TGCCTCCTGA
 
Protein sequence
MSKTCSFIKA GAILGGTVIV TAAVAPLFLP NQNVKPLTSA QAAEITAQTM NSKCADCHKP 
GTHISELVNT LSGGLLARHI RDGQRSYNME EPPTAVTLSK LEHVLQINSM PPTSYTMVHW
GSTLTLREKN AMLQWIKDER LKIFGDMVGE EYALSPLAPI PDALPTDPAK VALGYKLFHD
VRLSTDNTVS CASCHSLEKA GTDNLPTSTG VRGQKGGINA PTVFNAAFHA KQFWDGRAAN
LQEQAGGPPL NPVEMGYEHP DDWKKIAAKL DQDTAFAAEF KKVYPQGFTG ETITNAIAEY
EKTLITPNSP FDRYLKGDEN AISENAKKGY KLFLKLGCQT CHTGPAMGGQ SFEYADLKGD
FFAGRAKTND DNGLMNFSKK ESDRHRFRVP TLRNVELTWP YMHDASAQTL EEAITKMYHY
QLGYDKLDKK EVRLLVAFLK TLTGEYNGKP VQGEVCPAS