Gene Amuc_0348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0348 
Symbol 
ID6274960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp410440 
End bp412878 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content60% 
IMG OID642612399 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001876968 
Protein GI187734856 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.845111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.923404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCCA GTCCTCCCCG TCCTCCCAAA TTTCCCGGCT CCGGCAGGCC CGAATCCCCC 
AACTGGGGCG TGTGGGTCAT GGTACTTCTC ATTGTAGGCG TCCTTGCGTT TGGCTTTTTC
ACCCCTGAAT CTTTCGGGTT GGGCCCCCGC AAGGAAAATC TGGAATCCTT TGAAGCCCAG
TATAAGGCGG GGCGCGTGGT GCTGAACGAC CCCAAGGCCC CCGTGGAAGT GGTGCTGAGC
GAGAACGGTT CCGAAGGCGT TATTCACGCG CTGGTGTACA GGAAGGAAAT TCAGCCGAAG
GTGGAAATGA CCCCCTTTGC GCTGACTTAT TCCATGTCCC TTCCGGACCG CGACAAGCCT
TTGCTGAATG AACTTTCCGG CTACAGGGTG GTGGAGAGCC CGTACCGGAC GGAAGAGGGA
AAGAATGTTT CCCTCATTCC TGAGGGAGCT CAGAAGCTTT CCGTGCCAGA ATTCAACCGC
CTGGCCCTGG AAGGCCGCAT CGCCGGAGGA AAGGACGGCA TCATCCTGGC GGAAGACGGC
AACCAGAACG TGCTGGTCGG ACAGATTGTC ACCCGCATCT GGCCCGCGGC TACGGGAGAC
GCCTCCGTGG ACAAACAGCG TTTTGAACGT GTGGAAGTGC CTTTTACCCT GGAGTTCCAG
GGAGACCGCG TCAAGCAGCT GCTGGGGCCG GATACGAAGT TCAAGCGTGA ATCCGGTTCC
TGGGGCGGCA TTCTGCTGAA TCTGCTGCCC ATCGTGCTCA TTCTGGTGAT TTTGTTCTTC
ATGTTCCGCG CGCAGAGCGG CGGGGCCCGG GGGGCCATGA GCTTCGGCAA AAGCCGGGCG
CGCCTCATCT CCCCGGACAA GAACAAGGTG ACGTTCAAGG ATGTGGCCGG TATCAGCGAA
GCCAAGGAGG AAGTGTGGGA ACTGGTGGAG TTTCTGCGCA ATCCGGAAAA ATTCCGCGAT
CTGGGCGCCA CCATTCCCCG CGGCGTGCTG ATGGTCGGGG CGCCCGGTAC GGGCAAGACT
CTGCTGGCGC GTGCCATTGC CGGGGAGTCC AACGCTTCCT TCTATTCCAT CAGCGGCTCG
GATTTTGTGG AAATGTTCGT GGGGGTGGGG GCAAGCCGTG TCCGCGATAT GTTTGAACAG
GCCAAAAGGA CGGCGCCCAG CCTGATTTTC ATTGATGAAA TTGACGCCGT GGGCCGCCAG
CGCGGTTACG GCATGGGCGG CGGCAATGAC GAGCGGGAAC AAACGCTGAA CGCCCTGCTG
GTGGAGATGG ACGGTTTTGA AAACAACTCC AATGTAATCG TGATTGCCGC CACCAACCGT
GCGGACATTC TGGACCCGGC CCTGCTGCGT CCCGGCCGCT TCGACCGGCA GGTGGTGGTG
AACCTGCCGG ACGTCCGGGG GCGCGAACAG ATCCTGCAGG TGCATGCCAG AAAAGTGAAG
ATGGCGCCCG GAGTCAGCTT TGAGCGGATT GCCCGCGGCA CGTCCGGCTT TTCCGGTGCC
CAGCTGGCCA ACCTGGTCAA TGAAGCCGCC CTGCTGGCCG CCCGCAAGGG GCTGAAGGAG
ATTACGGAGG CCGAATTGGA AGAAGCCCGC GACAAGGTCA GCTGGGGGCG TGAACGCCGC
AGCCTGGCGA TTAACGAACG GGGGCGCCGC ATTACTGCCG TGCATGAGGC GGGGCATGCC
ATCTGCCTGT TGAAAACACC GCACAGCGAG CCGCTGCACC GGGTGACCAT TGTTCCCCGC
GGCGGGGCCC TCGGCATGAC CATGTGGCTT CCTTCCGACG ACAAGATGCA CCAGCTCCGG
TCCGAAATGC TGGACCAGCT CGTCGTGGCG ATGGGCGGGC GCTGCGCCGA ACAAATCGTT
TTTGGTGATG TGACCAGCGG TGCTACCGGA GACATCAAGA GCGCCACCAA CCTGGCGCGG
CGCATGGTGT GCGAATTCGG CATGAGCGAA AAACTGGGGC TGATTGAGTA CGGAGAACAC
CAGGGAGAAG TTTATATTGC CCGCGACCTG GGGACGCGTT CCCGCAATTA TTCCGAATCC
ACGGCGGAGC TGATTGACTC GGAAGTCCGC TTCCTGGTGG ACAGCGCTTA TGAACGCGCC
ATGGCCATCC TGACGGAAAA CCGGGACAAG CTGGACATTC TGACGGAGGC CCTGATGGAG
TTTGAAACGC TGGAAGGTTC CCAGGTCATG GATATTCTGG AATACGGGGA AATGAAAAAC
CCTCCCGCCA GGGTGACTCC GCCCCCCATG CCTTCCGAAG TGGAGGAACA GCCCGGGAAG
GATGATTCCG GCCATAACGA GAAGAAAGAA GCTGAAGAGA CCCGCGCGGA CGGCGCTGAA
GAACGGAAGA TGGAAGAGGA ACTGGAACAG GCGGAACGGG CCCCCTTCTC CTACAATCCC
GTTGATGAGT TCGGCAAGGA CGGCGGAGAA AAGAAATAA
 
Protein sequence
MPPSPPRPPK FPGSGRPESP NWGVWVMVLL IVGVLAFGFF TPESFGLGPR KENLESFEAQ 
YKAGRVVLND PKAPVEVVLS ENGSEGVIHA LVYRKEIQPK VEMTPFALTY SMSLPDRDKP
LLNELSGYRV VESPYRTEEG KNVSLIPEGA QKLSVPEFNR LALEGRIAGG KDGIILAEDG
NQNVLVGQIV TRIWPAATGD ASVDKQRFER VEVPFTLEFQ GDRVKQLLGP DTKFKRESGS
WGGILLNLLP IVLILVILFF MFRAQSGGAR GAMSFGKSRA RLISPDKNKV TFKDVAGISE
AKEEVWELVE FLRNPEKFRD LGATIPRGVL MVGAPGTGKT LLARAIAGES NASFYSISGS
DFVEMFVGVG ASRVRDMFEQ AKRTAPSLIF IDEIDAVGRQ RGYGMGGGND EREQTLNALL
VEMDGFENNS NVIVIAATNR ADILDPALLR PGRFDRQVVV NLPDVRGREQ ILQVHARKVK
MAPGVSFERI ARGTSGFSGA QLANLVNEAA LLAARKGLKE ITEAELEEAR DKVSWGRERR
SLAINERGRR ITAVHEAGHA ICLLKTPHSE PLHRVTIVPR GGALGMTMWL PSDDKMHQLR
SEMLDQLVVA MGGRCAEQIV FGDVTSGATG DIKSATNLAR RMVCEFGMSE KLGLIEYGEH
QGEVYIARDL GTRSRNYSES TAELIDSEVR FLVDSAYERA MAILTENRDK LDILTEALME
FETLEGSQVM DILEYGEMKN PPARVTPPPM PSEVEEQPGK DDSGHNEKKE AEETRADGAE
ERKMEEELEQ AERAPFSYNP VDEFGKDGGE KK