Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1001 |
Symbol | |
ID | 6274121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1192801 |
End bp | 1193706 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642613051 |
Product | short chain dehydrogenase |
Protein accession | YP_001877609 |
Protein GI | 187735497 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.903414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAC GTCTATCCAA AAAAGTCATG ATATTGACGG GAGCCGGGCA AATCGGCATG GCCATTGCCC GCAGAATAGG AAGCGGCATG AAAATCGTCA TTGGCGACAA AAATATCGGG AACGCGCATG CCATTGCCCG GACGATGAAT CAGGCCGGAT TTGACACCAT CCCCTTAGAG ATGGATCTCT CTTCCCGGTA CTCCATCCTG CATTTGATTG CAGAAGCACA GAGGTACGGC AATATTTCCA TGCTCGTCAA TGCAGCGGGG GTTTCCCCCA GCCAGGCATC CGTTGAAACC ATTCTTAAAG TGGATCTCTA CGGAACCGCC GTATTGCTGG AAGAGGTGGG GAAAGTTATT TGCCCCGGCG GCTCGGGAGT AACTATCTCC AGCCAATCCG ATCACCGCAT GCCCGCTCTG ACTGCCGAAC AGGACGAACA ACTGGCCATG ACTCCAACAG AGGAATTGCT GAACCTCGAA CTCCTCCAAC CCGGAAATAT CAAGGACACT CTGCATGCCT ACCAGATGGC GAAACGCTGC AACGTCAAGC GTGTCATGGC GGAAGCCGTC AAATGGGGTG CCAAAGGCGC ACGCATCAAC TCCATTTCCC CGGGCATTAT AGTCACCCCT CTGGCAATCG ATGAATTCAA CGGTCCCAGA GGTGATTTTT ACAGAAACAT GTTTGCGAAG TGCCCTGCAG GAAGACCCGG TACGGCGGAT GAAATAGCCC ATGTAGCGGA ATTGCTGATG GGCGGCAAGG GGGCTTTCAT CACCGGCGCG GACTTCCTGA TTGACGGGGG AGCCACCGCC TCCTATTTCT ACGGTCCGTT GAAACCGCAG ATTCAAAAGA GAAAACCTCT CAGACAATCG AAAACGGCAG GCAAAGCACA AAATGAAGCT TACTGA
|
Protein sequence | MEQRLSKKVM ILTGAGQIGM AIARRIGSGM KIVIGDKNIG NAHAIARTMN QAGFDTIPLE MDLSSRYSIL HLIAEAQRYG NISMLVNAAG VSPSQASVET ILKVDLYGTA VLLEEVGKVI CPGGSGVTIS SQSDHRMPAL TAEQDEQLAM TPTEELLNLE LLQPGNIKDT LHAYQMAKRC NVKRVMAEAV KWGAKGARIN SISPGIIVTP LAIDEFNGPR GDFYRNMFAK CPAGRPGTAD EIAHVAELLM GGKGAFITGA DFLIDGGATA SYFYGPLKPQ IQKRKPLRQS KTAGKAQNEA Y
|
| |