Gene Amuc_0670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0670 
Symbol 
ID6273965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp788262 
End bp789842 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID642612722 
ProductTrypsin-like protein serine protease typically periplasmic contain C-terminal PDZ domain-like protein 
Protein accessionYP_001877288 
Protein GI187735176 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.475775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TTTCCACTTT TTTCCTCGGT TCCCTGATGT TGCTCGGCGG CTGCCAGCCG 
CGCGAAGCCG GACGAACCGC TCCCGAACCC CAGCCTGAAC AACAGACGGA AACGCCTGCG
GAACAGATGG AGGAAACGCC CGCGCCGGCC CCGCTTCCCT CCCCCACCGG CTCCATGGTC
GGCATTAACG CCACCAATCA GGGCTATGCC ATGATTCAGC CGTGGAGCAA GGAAAACCCG
GCGTACAGCC AGGGCTTCGG CATTTATCTG GGAGACGGCA ATATCCTGAC GGCAGCCAAC
ATCGTTTATT CCGCCAGCTT CGTGGAAGTG ACCTCCGCAG ACGGCTCCCA GACGGTTCCC
GTGACCGTAA CCGCCTTTGA CCCGGAAGCC AATCTTGCCC TTCTGCGCCT GAAAAACGGA
AAAGATGCCG CTTTTCTGGA CAAACTGGTC CCCGTTGCGC TGGGGAAGGC TCCCCGCCTG
GGCGACAAGG TAACCTTCTG GCAATTCAAT GGCGACGGCC TTCCCATCAC TACCTCCGGA
ACCCTTCTGG CGACGGAAAG CGCCTGCCCG TTCACGAACG GGGAACCGTT CGTCCTGTAT
AACGTCAAAT CCTCCGTCAC TCCCCTGAAA GGCGGCGCAG GCAACCCCGT CATGAGGGGC
AATGAACTTG TGGGCCTCAG CGCCAGCTGC GATCCCTCCG CACAGAAAGT GCTGGCCGTA
ACCCATACCA TGATTTCCCG GTTCCTGGAA CAGGCCCGGG CCGGCAATTA CACCGGTTTC
CCGGCGGACG GCACCCAGGT CACGGAACTG ACCGACCCCG TCTTCCGCAA ATTCCTGGGC
CTGCCTGAAA CTGGCGGCGG CTTTTACGTG GTGAAACTGC CTGTTTACGG CTCCTTCTAC
AAAGCCGGAG TACGTCCCGG AGACGTGGTG GAAAGCGTCA ACGGCATCCC TCTGGACAGC
AAAGGTTTAA TTAAGGATCC CGCCCTGGGC CCCGTTTCCG CCAACTTTCT GTTCCGAGAC
TCCGCCAAAC CGGGGGATAC CATTACGCTG GGCATCCGCC GCAAGGGAAA GGACGGCTCC
AGCCAGCCCA TGACGCTGGA CGTCAAACTG GACAGGAGCG CCCTTGAAGG GGACCTGGTC
AATCCGGCCC CCTTCATCTC CAATCCGCCC TACCGCATTT ACGGAGGTCT GGTATTTGTC
CCGCTGACGG GAGCCCTGAT GGGAGAAATC AACAAGCTCA GCAAGAACCA TCCCCCCCTC
AACCTGGTGG AAGCCACTCA AAAGAAAGAG GACATACGGA AAAAAGGCGT GGATGAAATC
GTGGTCTTCC TGATGGCCCT GCCCACCCAG GCTACACTGG GATACGCCCA GATGAGCCCC
TCCATTGTGG AAAAAGTCAA CGGTGTGCAG GTGAAAAGCT TCAAGCACCT CAACCAGCTT
CTGGACCTTC CCGCTCCCGG CGGCACGCAC CGCATCGAAG TGACCCAGCA GCCGTACACC
ATGTACATGT CCCAGAAGGA AGCTGCCAAA GCGGACCGCT TCATCCAGAT GAGGGCCGTT
CCCGTGCTCC GCAGGGACTA G
 
Protein sequence
MKILSTFFLG SLMLLGGCQP REAGRTAPEP QPEQQTETPA EQMEETPAPA PLPSPTGSMV 
GINATNQGYA MIQPWSKENP AYSQGFGIYL GDGNILTAAN IVYSASFVEV TSADGSQTVP
VTVTAFDPEA NLALLRLKNG KDAAFLDKLV PVALGKAPRL GDKVTFWQFN GDGLPITTSG
TLLATESACP FTNGEPFVLY NVKSSVTPLK GGAGNPVMRG NELVGLSASC DPSAQKVLAV
THTMISRFLE QARAGNYTGF PADGTQVTEL TDPVFRKFLG LPETGGGFYV VKLPVYGSFY
KAGVRPGDVV ESVNGIPLDS KGLIKDPALG PVSANFLFRD SAKPGDTITL GIRRKGKDGS
SQPMTLDVKL DRSALEGDLV NPAPFISNPP YRIYGGLVFV PLTGALMGEI NKLSKNHPPL
NLVEATQKKE DIRKKGVDEI VVFLMALPTQ ATLGYAQMSP SIVEKVNGVQ VKSFKHLNQL
LDLPAPGGTH RIEVTQQPYT MYMSQKEAAK ADRFIQMRAV PVLRRD