Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0670 |
Symbol | |
ID | 6273965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 788262 |
End bp | 789842 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642612722 |
Product | Trypsin-like protein serine protease typically periplasmic contain C-terminal PDZ domain-like protein |
Protein accession | YP_001877288 |
Protein GI | 187735176 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.475775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC TTTCCACTTT TTTCCTCGGT TCCCTGATGT TGCTCGGCGG CTGCCAGCCG CGCGAAGCCG GACGAACCGC TCCCGAACCC CAGCCTGAAC AACAGACGGA AACGCCTGCG GAACAGATGG AGGAAACGCC CGCGCCGGCC CCGCTTCCCT CCCCCACCGG CTCCATGGTC GGCATTAACG CCACCAATCA GGGCTATGCC ATGATTCAGC CGTGGAGCAA GGAAAACCCG GCGTACAGCC AGGGCTTCGG CATTTATCTG GGAGACGGCA ATATCCTGAC GGCAGCCAAC ATCGTTTATT CCGCCAGCTT CGTGGAAGTG ACCTCCGCAG ACGGCTCCCA GACGGTTCCC GTGACCGTAA CCGCCTTTGA CCCGGAAGCC AATCTTGCCC TTCTGCGCCT GAAAAACGGA AAAGATGCCG CTTTTCTGGA CAAACTGGTC CCCGTTGCGC TGGGGAAGGC TCCCCGCCTG GGCGACAAGG TAACCTTCTG GCAATTCAAT GGCGACGGCC TTCCCATCAC TACCTCCGGA ACCCTTCTGG CGACGGAAAG CGCCTGCCCG TTCACGAACG GGGAACCGTT CGTCCTGTAT AACGTCAAAT CCTCCGTCAC TCCCCTGAAA GGCGGCGCAG GCAACCCCGT CATGAGGGGC AATGAACTTG TGGGCCTCAG CGCCAGCTGC GATCCCTCCG CACAGAAAGT GCTGGCCGTA ACCCATACCA TGATTTCCCG GTTCCTGGAA CAGGCCCGGG CCGGCAATTA CACCGGTTTC CCGGCGGACG GCACCCAGGT CACGGAACTG ACCGACCCCG TCTTCCGCAA ATTCCTGGGC CTGCCTGAAA CTGGCGGCGG CTTTTACGTG GTGAAACTGC CTGTTTACGG CTCCTTCTAC AAAGCCGGAG TACGTCCCGG AGACGTGGTG GAAAGCGTCA ACGGCATCCC TCTGGACAGC AAAGGTTTAA TTAAGGATCC CGCCCTGGGC CCCGTTTCCG CCAACTTTCT GTTCCGAGAC TCCGCCAAAC CGGGGGATAC CATTACGCTG GGCATCCGCC GCAAGGGAAA GGACGGCTCC AGCCAGCCCA TGACGCTGGA CGTCAAACTG GACAGGAGCG CCCTTGAAGG GGACCTGGTC AATCCGGCCC CCTTCATCTC CAATCCGCCC TACCGCATTT ACGGAGGTCT GGTATTTGTC CCGCTGACGG GAGCCCTGAT GGGAGAAATC AACAAGCTCA GCAAGAACCA TCCCCCCCTC AACCTGGTGG AAGCCACTCA AAAGAAAGAG GACATACGGA AAAAAGGCGT GGATGAAATC GTGGTCTTCC TGATGGCCCT GCCCACCCAG GCTACACTGG GATACGCCCA GATGAGCCCC TCCATTGTGG AAAAAGTCAA CGGTGTGCAG GTGAAAAGCT TCAAGCACCT CAACCAGCTT CTGGACCTTC CCGCTCCCGG CGGCACGCAC CGCATCGAAG TGACCCAGCA GCCGTACACC ATGTACATGT CCCAGAAGGA AGCTGCCAAA GCGGACCGCT TCATCCAGAT GAGGGCCGTT CCCGTGCTCC GCAGGGACTA G
|
Protein sequence | MKILSTFFLG SLMLLGGCQP REAGRTAPEP QPEQQTETPA EQMEETPAPA PLPSPTGSMV GINATNQGYA MIQPWSKENP AYSQGFGIYL GDGNILTAAN IVYSASFVEV TSADGSQTVP VTVTAFDPEA NLALLRLKNG KDAAFLDKLV PVALGKAPRL GDKVTFWQFN GDGLPITTSG TLLATESACP FTNGEPFVLY NVKSSVTPLK GGAGNPVMRG NELVGLSASC DPSAQKVLAV THTMISRFLE QARAGNYTGF PADGTQVTEL TDPVFRKFLG LPETGGGFYV VKLPVYGSFY KAGVRPGDVV ESVNGIPLDS KGLIKDPALG PVSANFLFRD SAKPGDTITL GIRRKGKDGS SQPMTLDVKL DRSALEGDLV NPAPFISNPP YRIYGGLVFV PLTGALMGEI NKLSKNHPPL NLVEATQKKE DIRKKGVDEI VVFLMALPTQ ATLGYAQMSP SIVEKVNGVQ VKSFKHLNQL LDLPAPGGTH RIEVTQQPYT MYMSQKEAAK ADRFIQMRAV PVLRRD
|
| |