Gene Amuc_1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1288 
Symbol 
ID6273845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1560701 
End bp1561972 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content61% 
IMG OID642613345 
Productpeptidase U32 
Protein accessionYP_001877894 
Protein GI187735782 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTC AAAACCGGAT AGAAATCATG GCCCCGGCGG GGTCGTTTGA ATCTCTGGCG 
GCCGCCTTGC AGGGCGGAGC GGACTCCGTG TACTTCGGAG TAGGAAAGCT GAACATGCGT
TCCCGCGCCA CGGTCAATTT TTCGGAAGAA GATTTGCCGG AAATCGTCGA GCGTTGCCAT
GAGGCGGGCG CCAAAGCTTA TCTGACACTG AATATCATTG TGTACGATGA AGAGCTGGAG
GCGGTGCATG CCCTGTGCGA CGCCGCCCGG AAAGCCGGTG TGGATGCCGT CATCGCTTCC
GACCTGGCGG TGATTTCTTA TGCGCGTTCC ATTGGGCTGG AAGTCCATAT GTCCGTGCAG
GCCAACGTCT GCAACATGGC CTCCGTCAAA TTTTACGCGC AGTACGCGGA TGTGGTGGTG
CTGGCCCGGG AACTCACCCT GGCGCAAATC AGGCATATCA TTGAATCCAT CCGGAAGGAG
GGCGTGAAAG GCCCCTCCGG GGAGCTGCTG CGGGTGGAAA TTTTCGCCCA TGGGGCGTTA
TGTGTGGCCG TGTCCGGCAA ATGCCACATG AGCCTGGCGG CCTACAACTC CTCCGCCAAC
CGGGGGGCCT GCTTCCAGAA CTGCCGCCGC GCCTACCGCG TGACGGATGA GGAAACGGGC
AATGAACTGG TGATAGACAA CAAATACGTG ATGTCTCCCA AGGACCTGTG TACCATTCCG
GTGCTGGACC AGCTTCTGGA CGCGGGCGTT TCCGTGCTGA AGCTGGAAGG GCGCGGCCGT
TCCTCGGATT ACGTCAGGAC GGTTACCTCC GTGTACCGGG AAGCCGCGCG GGCATGCCAG
GACGGAACCT TTTCCGCGGA CAGGGCGGAA GCGTGGATGA AACGGCTGGA ATCCGTTTTC
AACCGGGGAT TCTGGCAAGG CGGCTATTAC CTGGGCGTGA AGTGGGGGGA ATGGAGCGGT
TCCGCCAACA GCCGCGCTGC CCTGTTGAAG ATCCACATTG CCAGGGTAGA GAACTTTTAT
AAGAAGAACG GGGTGGCGGC CCTGTTCCTG GAAGCCGGCG GCCTGTCCGC GGGGCAGACC
ATCCTCATAA CAGGCCCCAC TACGGGAGCC GTCCGCATGG AAGTGGCAGC CATGCGGAGG
GAGACGGCAG AGGGCATGGA GCCCGTAGAA GCCGCTCAAA AGGGAGAAAC CGTCTATCTG
GCGGTTCCCG AACAGGTGCG CCGCCGGGAC AAGGTGTACC TGCTGCGCCC CCGGATGCTG
GAGGATGCCT GA
 
Protein sequence
MPFQNRIEIM APAGSFESLA AALQGGADSV YFGVGKLNMR SRATVNFSEE DLPEIVERCH 
EAGAKAYLTL NIIVYDEELE AVHALCDAAR KAGVDAVIAS DLAVISYARS IGLEVHMSVQ
ANVCNMASVK FYAQYADVVV LARELTLAQI RHIIESIRKE GVKGPSGELL RVEIFAHGAL
CVAVSGKCHM SLAAYNSSAN RGACFQNCRR AYRVTDEETG NELVIDNKYV MSPKDLCTIP
VLDQLLDAGV SVLKLEGRGR SSDYVRTVTS VYREAARACQ DGTFSADRAE AWMKRLESVF
NRGFWQGGYY LGVKWGEWSG SANSRAALLK IHIARVENFY KKNGVAALFL EAGGLSAGQT
ILITGPTTGA VRMEVAAMRR ETAEGMEPVE AAQKGETVYL AVPEQVRRRD KVYLLRPRML
EDA