Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0343 |
Symbol | |
ID | 6274978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 403467 |
End bp | 405797 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612394 |
Product | Thiol:disulfide interchange protein-like protein |
Protein accession | YP_001876963 |
Protein GI | 187734851 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0708911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0124228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACCG CCTTTGCTAT GAGAAACTTT TTGAAAGCAG CCGTCGGCTG CCTGGGCATG ATCGCCGGGT GTTTCAGTAC GGGGGAGGCC CAGGACTTCG GAGGCATGAG CTTCGGGGAA GCGGGAAACT TTGGTCCGCC CCAGACATCC GGTTCCTCCA AGGCCACTGT CACCTCCTAT GGAGAAGCCT CCTTCGTCAT CGTGACGGAG CTGGCGCTGC CGGACCACTG GCACGTTTAC TATAAAAATC CGGGAACCGT GGGCCTTCCC ATGGAAGCCG CTCTCCAGCC TGTGCCCGGC TTCCGGGTGG AAGGCCCCTT CTGGCAAATC CCGGAACTGG AGAAAGGGCT TGTGGATTTT TACGGATACA GCGGAAAAGC CAAAATGGCG TTCCGGGTGA CTCCGGAAAA GGATGCCCCC CCGGAAGCGA CGTTCATCAC CACCATGACA TGGCAAATGT GCGCGGAACA GTGCGCCGCC CCGGAAACCA AAAACTTCAG CGTCACACTG AAGCGCGGAG ACGGGCAGAC AGCCCCTGAC GCGGCGGAAT TGACCGGAAA CATGGCAGGG CTTGCCGCCC CCGACTGGGC GGAAGGGCTG AAAGCCCGGA TTTCCCAGGA AGGGAAAACC GTCACCCTGC ACCTCAGGAC AAACGGACGC CCCGTTCCGC AGGATTCCGT CTATTTTTTC TGCAATCAGG GGGAAATCAA CCCTACCACT CCCCAAATCT TCAAAAAACT GGATGACTCC AATTATGAGC TGTCCATGCA ATTCAATGAC ACCACGGACG GCCTTTACCC CAACAACCTG CCGGATGCAG ACAGGGGAAA ACCGCTGACC AGCCTTTCCG GCATCCTGCG CGCCGGCAGG GAAGGCATCA TCATCACCGC GGACGACCGC CCCTTCTCCG GGGAATCTCA GTCAACAGCC GCCGGAACGG AATCCGCCCC GGAACCTTCC GCTTCCATCC CCGCTCCTCC GCTGATGGGC CTGGGGGAAA TCATGTTCTT CATGTTCATC GGCGGCATCA TCCTCAATGT CATGCCGTGC GTCTTCCCGG TAATCGGCCT CAAGATTATG GGATTCGTCC AGCTGGGAGG GGGGGAACGG AAAAAAGTGC TGGCCCACTC CCTTACCTTC GTACTGGGCA TCCTGATTTC TTTCTGGCTC ATTACTGCCA TTCTGATTGC GCTGAAAGCG AACATGTTTG ACTGGAGCGC CCCAGCCGGC CCCGGCGTGT TCAGCGGAGA CTTCTGGCTG GGCCGCGGCG CGGAGGGCGT TGTCAACTGG GCGTTCTGGT TTGAAAACCC CTGGGTGAAT TTCTGCCTGC TGGGCCTGAT GCTCGCCATG GGGCTGAGCA TGTTCGGCGT CTTTGAAATA GGCGTCAAAG CCACGACCAT GGGCAACGAC CTCCAGCACC GGAAGGGTTA TGCCGGCTCC TTCTGGTCCG GCGCCCTGGC TACGGTCATC TCCACCCCGT GCAGCGCTCC GTTCCTGGGC CAGGCTATCG GCGCGGCCAT GCTCCAGCCG CCGCTGGGCA TCGTGCTCTG CCTGACCATG ATGGGGCTTG GAATGTCCCT TCCCTACATC ATTCTGGGAG CCTTTCCCGT CCTGACCAGA TACCTGCCCA AACCCGGCGC ATGGATGGAA TCCTTCAAGC AGTCCATGTC CTTCCTGATG TTCGGCACCG CAGCCTACTT CCTCTGGATT TACATGGCCT TCTTTGATGC AGAAAACCAT CCTCAGGACA TCCTGTTCCT CTTCTTCGGG CTTGTGCTGT TTTCCATGGC CTTTTGGGTA TACGGCAGAT GGTGCCCCAT GTACCGCAGC AGAAAGTCCC GCATCACGGG AGGCATCTTC TCCGTAATCT TCCTGCTGGC CGGCCTGTAC TACATGCTCC CGCCGGAAGG AGCAGCCTGG TTCGGCCGCG GTTCCGCCCC CGGGGCGGCG GAGTCCGCCG CCGCAGCACC CTCCCTTCAG GAGGAAGGAA ACATCTGGAT ACCCTGGAGC CCGGAAGCCA TGCAGGCCGC CCTGGACGGA GGGAAACCCG TTTATGTGGA CTTCACGGCC CGCTGGTGCT CCACCTGCCA GGTCAACAAG GCCTCCTATA CGGATGAAGT GCTGGCCGCT TTCAAAAAAT ATGGCATCGT CATGATGAAA GCGGACAAGA CCCGCACGAA CCCCGCCATT GACCAGGAAC TTAAAAACCT GGGGCGCACG GCCGTTCCCG TCAATGCCCT GTATCTTCCC GGCAGAAAAC CAGCCGTCAC CAGGGAACTC CTGTCCCCGG CCTACCTGCT GGAATTCCTG GAAAAGGAAA TGGAACGCTA G
|
Protein sequence | MVTAFAMRNF LKAAVGCLGM IAGCFSTGEA QDFGGMSFGE AGNFGPPQTS GSSKATVTSY GEASFVIVTE LALPDHWHVY YKNPGTVGLP MEAALQPVPG FRVEGPFWQI PELEKGLVDF YGYSGKAKMA FRVTPEKDAP PEATFITTMT WQMCAEQCAA PETKNFSVTL KRGDGQTAPD AAELTGNMAG LAAPDWAEGL KARISQEGKT VTLHLRTNGR PVPQDSVYFF CNQGEINPTT PQIFKKLDDS NYELSMQFND TTDGLYPNNL PDADRGKPLT SLSGILRAGR EGIIITADDR PFSGESQSTA AGTESAPEPS ASIPAPPLMG LGEIMFFMFI GGIILNVMPC VFPVIGLKIM GFVQLGGGER KKVLAHSLTF VLGILISFWL ITAILIALKA NMFDWSAPAG PGVFSGDFWL GRGAEGVVNW AFWFENPWVN FCLLGLMLAM GLSMFGVFEI GVKATTMGND LQHRKGYAGS FWSGALATVI STPCSAPFLG QAIGAAMLQP PLGIVLCLTM MGLGMSLPYI ILGAFPVLTR YLPKPGAWME SFKQSMSFLM FGTAAYFLWI YMAFFDAENH PQDILFLFFG LVLFSMAFWV YGRWCPMYRS RKSRITGGIF SVIFLLAGLY YMLPPEGAAW FGRGSAPGAA ESAAAAPSLQ EEGNIWIPWS PEAMQAALDG GKPVYVDFTA RWCSTCQVNK ASYTDEVLAA FKKYGIVMMK ADKTRTNPAI DQELKNLGRT AVPVNALYLP GRKPAVTREL LSPAYLLEFL EKEMER
|
| |