Gene Amuc_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2040 
Symbol 
ID6273697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2477612 
End bp2479792 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content59% 
IMG OID642614101 
ProductOligopeptidase A 
Protein accessionYP_001878631 
Protein GI187736519 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0328478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00152686 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCCA TGAGTTTACT CAGGCAGTCT CTTCATCTCT TTTTCCTCTT CCTCTCCGTG 
GCCCTTCCCT CCCCGGCGGC CACTTCCGCC CACCCTTTCC TGGACAGGGA GCGTCCCATC
CGCTGGAGCC GGCTGACCCA GGACAAGCTG GAACCGGATA TTCAGGAAGC CATGCGCCTC
ACCCGGACAG CCATAGAGGA AATCAGCCGC CTCCGTCCGG AAGAAATGAC GTATGAAAAT
ACGTTCGGCG CGCTGGAGAA AAGCAATGAC CTCCTTACGG AAGGCATGTG CAAGGCTTAT
GTCCTGAAAA GCCTGTGCGA CAGCGGGGAA CTTCGCAAGG CCATGGATTC GGTCGCTCCC
CGCGTGTCCG CCTTCCTTTC CTCCGTCACG AAAGACCAGG CCCTGTGGAA AGTCCTGAAA
ACCGCGGAGG AACGCCTGCG GCAAACTCAC CTGAACCCCG AACAGGAACG GTACATGGAG
CTAAGCATGC AGAGCTTCCG GGATAACGGC GCTGATCTGC CGCCGGACAA GCGCGCACGG
CTTGAGTCCA TTGACAGGGA ACTGACGCTC GCCTCCCAGC GTTTCAACAA TTTGTATATG
GATGCCAGGA AATCCTGGAC CTGGACGGTG CGGAACGCCG CCCTTCTGGA AGGGATAGAC
GGAAGCGCCC TGCAGCAGGC GCGTGAAGAA TTTCTGAAGC GCCAGCCCGG CCAGTCCGGT
CCGGGCTGGA CGTTCACGCT TGATTCCGCG GCTTCCGCCC GGGTCATGGA AAAAGCCCGG
AGGGAAGAAT GCCGGAAAGA TTTATGGGAA CACCGCCAGT CTCTGGCTAC GGGAACATGC
GACACGGAAC CCGTCATCAG GGAAATCCTC TCCCTGCGCC GTGAAAAGGC GCATTTGTGC
GGATATAAGG AATACCCGGA TTACGCCCTG CGCGAGAGCA TGGCGGAAAA CGGGGAAAAC
GCCATAAAGT TCGTCAATGA GCTGCTGGAC AAGATCAAGG CCCCCTTTTT CCGTGAAATG
GAAACGCTCC GCAGCCTGAA GGCCCGCCTT ACGGGGCAGG AAAACGCCCG CCTGAATCCC
TGGGACGTGG CATATTACGC CAATCTCCGG GCAGAAGAAC ATTTCCGGCT GGACCAGGAG
GAGCTGCGCC GCCACTTTCC CCTTCCCCGC GTGCTGGACG GCCTGTTTTC CCTGGCGGAG
CGGCTGTACG GCATCCGCGT GAAGGAAGTT CCGGCGCGGC AGTCCCTCTC CGGCATCCCG
GCAGGCGAAT CCGCCGGAAC CGTGGAAGTA TGGCATCCGG ACGTACGCTT TTTCACGATT
GACGACGGCA ACGGCAACCA ACTGGGTTCC TTTTACCTGG ATCTTTTCTC CCGTAGCAAT
AAACGGGCCG GAGCGTGGAT GAACACCCTG GATACGGGCA GCCCGTCCAC GCCGGAGACC
CCGGGCAAGC CGCGCCTGGG CATGGTCTGC CTCAATATCC ATCCCCCCGC GGCAGGAGAC
ACGGTGATAC TGTCCCACCG GGAAGTCAGG ACTTTATTCC ATGAGTTCGG CCACTTGCTG
CACCTGATGT TTACCAGGGT TTCCATTCCT TCCCTGGCGG GGACCAGCGT GCCGCGGGAT
TTTGTGGAAG TCCCCTCCCA ATTCATGGAA AACTGGTGCT GGCGGCCGGA CGTGCTGAAA
AGCTTTGCGC GCCATGAACG AACGGGACTC CCCATTCCGG AGGAAATGCT GAACTCTCTG
GACGCCTCCC GCGGCAATAC GCCTGCCCTC GCGCTGGCCG GGCAGCTCCT GTACGCAAAG
ATGGACCTGG CCGTGCATTC GGAACCGGAA CGCTTCTCCG CCGGCTCTCT TGATGATGTG
GATTCCGCCG TAGCGGGAGA TATGGATTAT TTCAAAGATT TCAAAAGAGC CGGCAAGCTG
CGCACGGCGC GTCACTTGTT TTCCTCTCCT GCGGGCTATG CCTCCTTTTA TTTCTCCTAC
CAATGGGCGG AAGTTTTGGA CAAAGATATT TTTGAAGCCT TTGAACGGGC CGGAGGCCAG
GACAGGGAAA CGGCAGGAAA ATTCCGGAAA ACCATTCTGG AAAAAGGCTA TGCTGTCCCG
CCCATGCGGC AGTTCATGGA TTTCATGGGA AGAAAGCCGC GCATGGACGC CATGCTCCGC
AAGAGGCGGC TGGCATCCTG A
 
Protein sequence
MAAMSLLRQS LHLFFLFLSV ALPSPAATSA HPFLDRERPI RWSRLTQDKL EPDIQEAMRL 
TRTAIEEISR LRPEEMTYEN TFGALEKSND LLTEGMCKAY VLKSLCDSGE LRKAMDSVAP
RVSAFLSSVT KDQALWKVLK TAEERLRQTH LNPEQERYME LSMQSFRDNG ADLPPDKRAR
LESIDRELTL ASQRFNNLYM DARKSWTWTV RNAALLEGID GSALQQAREE FLKRQPGQSG
PGWTFTLDSA ASARVMEKAR REECRKDLWE HRQSLATGTC DTEPVIREIL SLRREKAHLC
GYKEYPDYAL RESMAENGEN AIKFVNELLD KIKAPFFREM ETLRSLKARL TGQENARLNP
WDVAYYANLR AEEHFRLDQE ELRRHFPLPR VLDGLFSLAE RLYGIRVKEV PARQSLSGIP
AGESAGTVEV WHPDVRFFTI DDGNGNQLGS FYLDLFSRSN KRAGAWMNTL DTGSPSTPET
PGKPRLGMVC LNIHPPAAGD TVILSHREVR TLFHEFGHLL HLMFTRVSIP SLAGTSVPRD
FVEVPSQFME NWCWRPDVLK SFARHERTGL PIPEEMLNSL DASRGNTPAL ALAGQLLYAK
MDLAVHSEPE RFSAGSLDDV DSAVAGDMDY FKDFKRAGKL RTARHLFSSP AGYASFYFSY
QWAEVLDKDI FEAFERAGGQ DRETAGKFRK TILEKGYAVP PMRQFMDFMG RKPRMDAMLR
KRRLAS