Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1390 |
Symbol | |
ID | 6274602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1660025 |
End bp | 1661212 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642613447 |
Product | hypothetical protein |
Protein accession | YP_001877995 |
Protein GI | 187735883 |
COG category | [S] Function unknown |
COG ID | [COG3274] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.67535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.125851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTGC CGCTGGCTGA TGATGAAGAG AAAGGAAAGC GTTTTTCCTG TGGTTGTCCG GAATATATTT CCCCGTCCCG GAATTATGGA ATAGATGCCT TGCGCATTGT CGCCATGATG ATGGTTTTAA TTCTTCATCT TTTGGCTTCC ATTGATGTTT TGTCTTTGGA GAACCATGGG TCCGCTTCCT ATAATGTCGG GTGGCTGCTG GAGATTGCCG CTTATTGCGG TGTGAATTGT TATGCGTTAA TTACAGGATA TGTGTGCTGC GACGGAACGT TCAGGTATGA ACGTGTGGTT TCCTTATGGT TTCAGGTTGT TTTTTATACA TGGGGCAGTT TGCTGCTGGC TCTGTTGTTT TTCCCCCAGG AGGTTCAGTT GAGTAATATT CTGCATTCCC TTTTTCCGGT TTTATCCGGC CAATATTGGT ATGTGACGGC GTATGTGGGG CTGTTTTTCT TCATTCCGTT TCTTAACGCC CTGGGGAACA GGCTGACCAA ATTACAGTTC CAGTATCTTC TGGTTACCGT TTTCATGCTG TTTTCCATTA TTCCCACCCT GCTTCACACG GATGTGTTCC CGGTGGAAGA AGGGTATAGC ATCTGGTGGC TGGGCATCCT TTACATGCTG GGGATGTATG TTAAAAAGCA TGGCTTGCTG ACGGGAATGA AAACGCGTCC GTTATGGATG TTTTATGCCG GATGCGTATG TTTTGCCTGG GTTTTCAAGA TGGTTCTGAA TGTGGTGTCT CCATACCTGA TTGGGCAGAT CAAGGGCGGT GGAATGTTTA TTCGCTATAA TTCTCCTTTT ATTGTAGGGC CGGCGGTTGC CCTGTTGCTG ATTTTTTCCC GGATGCATTT TTCATCCCGA AGGGCTGTCT CCTGTATTTC ATGGCTGGCG GCAGCGTCGT TCAGCGTTTA TGTGCTGCAT TGTAATGCTT TGATAGGAAA ATGGTTTTTG TGGGATGTCT TTGAGTGGAC AGCGTCTTCT TCCTCAGCCC TGATGGTTGT GAACGTGTTG GCAATAGCCG CCGTGGTTTA TGCCGGATGC GCCTTGGTGG ACTCCGTGCG GCGTTATTTA TTTAAGCTCA TGAACGTGGA AAGGGGCGCC CGGGCGGTGA CGGGCTTTTG CGGAAAGCTG GGGCATGCGT TCCGGAAGAT GTGCCGCCGG ATAGATTCGC ATCCCTAA
|
Protein sequence | MSLPLADDEE KGKRFSCGCP EYISPSRNYG IDALRIVAMM MVLILHLLAS IDVLSLENHG SASYNVGWLL EIAAYCGVNC YALITGYVCC DGTFRYERVV SLWFQVVFYT WGSLLLALLF FPQEVQLSNI LHSLFPVLSG QYWYVTAYVG LFFFIPFLNA LGNRLTKLQF QYLLVTVFML FSIIPTLLHT DVFPVEEGYS IWWLGILYML GMYVKKHGLL TGMKTRPLWM FYAGCVCFAW VFKMVLNVVS PYLIGQIKGG GMFIRYNSPF IVGPAVALLL IFSRMHFSSR RAVSCISWLA AASFSVYVLH CNALIGKWFL WDVFEWTASS SSALMVVNVL AIAAVVYAGC ALVDSVRRYL FKLMNVERGA RAVTGFCGKL GHAFRKMCRR IDSHP
|
| |