Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1596 |
Symbol | |
ID | 6275611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1919317 |
End bp | 1920537 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613656 |
Product | protein of unknown function DUF111 |
Protein accession | YP_001878197 |
Protein GI | 187736085 |
COG category | [S] Function unknown |
COG ID | [COG1641] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00299] conserved hypothetical protein TIGR00299 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCT TGGTCTATGA TTGCAGCGCG GGCATCAGCG GAGATATGAA TCTGGCCGCC CTGATTGATT TGGGAGCGGA CCGGGAAAAC CTGGAACGGG AACTGTCCAA ATTGCATGTG CACGGGGAGT GGAAGCTGGA ATGCCGCCCG GCGCAGCAGC ACGGCATCCG GGGAACACGG ATTGACGTGC TGACGGAGGA AGAAGGGCAT TCCGGCAGCA GCCATGCGCA CCATCACCGA ACTATGGCGG ATATACGAAA GCTTATTGAG ACAAGCGCCC TTTCCAATAC CGTAAAACGG ACGGCCCTTT CTATTTTCAC CCTGCTGGCG GAGGCGGAAG CCCGCGTACA CGGCACTACT TCTGAAGAAG TACATTTCCA TGAAGTGGGC GCGGTGGATT CCATCATCGA TATCGCGGGA GCTGCCATCT GCCTGGAAAT GCTCCGGGTG GACACCGTCT TCACCGGACC GGTGGAGCTG GGGTCTGGCA CCGTAACCTG CCAGCACGGA ACAATGCCCG TTCCTGCGCC GGCAACGGCT TTGCTGGCGA AACATTTCCG GGCCACCTTG AATGGAACCA CCCATGAGGC CACTACACCC ACCGGAGCCG CCTACATTGC CGCCCTGGCG CAGCCCGTCC CCTCTCCGCT GGCAGGCCGT ATTACCGCAA CCGGCTATGG GATCGGCCAC CGCCAGGGAC TTCCCGTACC GAATATTCTA CGGGTGATGC TGGTAGAAAC GGAAGAAGAG GAAACTTCCC CGGAACTGCT GACGGAGCTG TGCGCCAATA TAGACGATAT GACGCCGGAA CAGACAGCCT ACCTGGCGGA AAAGCTGATG GAGGCAGGTG CTCTGGATAC CTGGCAGGAG TCCGTCTGCA TGAAGAAAGG CCGTCTGGCA GTCAAGGTTT GCGCCCTGTG CCAGCCGGAA CAGACGGACC GCGTGCGGGA AGCCTTCTTC CGGCACAGCA GCACGCCGGG CATCAGGCAG CATGGCATGC TCCGCCATAT TCTGCGCCGG GAAAGCGCCC CTGTCCACAC TCCCCATGGA ACGGTTCATG TAAAAACTTC TTTCATGCAC GGCCGCCCCC ATTACCGGAA AGCGGAATTT GAAGATTGCA GAATCCTGGC GGAAAAAACC GGATTGCCGC TGGGACAATG CCAGCTGATG GGGCTTTTTC CCACATCTTC CCATGACAAC GACGCCACAG ACTCCGTTTG A
|
Protein sequence | MRALVYDCSA GISGDMNLAA LIDLGADREN LERELSKLHV HGEWKLECRP AQQHGIRGTR IDVLTEEEGH SGSSHAHHHR TMADIRKLIE TSALSNTVKR TALSIFTLLA EAEARVHGTT SEEVHFHEVG AVDSIIDIAG AAICLEMLRV DTVFTGPVEL GSGTVTCQHG TMPVPAPATA LLAKHFRATL NGTTHEATTP TGAAYIAALA QPVPSPLAGR ITATGYGIGH RQGLPVPNIL RVMLVETEEE ETSPELLTEL CANIDDMTPE QTAYLAEKLM EAGALDTWQE SVCMKKGRLA VKVCALCQPE QTDRVREAFF RHSSTPGIRQ HGMLRHILRR ESAPVHTPHG TVHVKTSFMH GRPHYRKAEF EDCRILAEKT GLPLGQCQLM GLFPTSSHDN DATDSV
|
| |