Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0666 |
Symbol | |
ID | 6273972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 783434 |
End bp | 784846 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642612718 |
Product | 3-isopropylmalate dehydratase, large subunit |
Protein accession | YP_001877284 |
Protein GI | 187735172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR00170] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAA CGCTTTTCCA AAAAATCTGG GACGCTCATA CCGTCGGCAT CCTGCCGGAT GGGAGAACGC AAATGTTCAT CGCTACGCAC CTGCTGCATG AAGTCACCTC TCCGCAGGCT TTCGGAATGG TCCGGGACCT GGGCCTGACT GTGCGCCACC CGGAACGCAC CTTTGCCACT GTGGACCACA TCATTCCCAC AGACAACCAG GCGGAACCGT TCGCGGACGC CACGGCTGAC GCCATGATCA GGGAACTGCG CCGGAACTGC GCTGAAAACG GCATCCGCTT TTTCGACCTC CCTACCGGGC TCCAGGGCAT CGTGCATATG GTAGGGCCGG AACTCGGCAT CACTCAGCCG GGCATGACTA TCGTATGCGG AGACTCCCAT ACGGCCACCC ACGGAGCCTT CGGTGCCATT GCCATGGGCA TCGGCACCAC GCAGGTGCGC GATGTGCTGG CTACGCAGAC CCTGGCCCTC AGCCCGCTCA AGGTGCGCCG CATCAATGTG AACGGAAAGC TGGCCCCCGG CGTGCGCGCC AAGGATGTAG CCCTGCACAT CATCGGCCTT CTGGGAGCCA AGGGCGGCCT GGGCTTCGCC TACGAATACG GAGGCGAGGT CATTGACGCC ATGAGCATGG ACGAACGCAT GACCCTCTGC AACATGTCCA TTGAAGGCGC GGCGCGCTGC GGTTACGTGA ACCCTGACCG GACCACGGTG GAATACATCA AAGGACGCCT GTTCGCCCCC ACCGGCGCGG ACTGGGACAA GGCCGTGGAA CGCTGGCTGG GCTTTGCTTC CGACGCAGAT GCGGAATATG ATGAAATCGT GGAAATTGAC GGAGCTTCCA TTGAGCCTAC ATTGACATGG GGCATTTCTC CGGACCAGAA TACGGGCATC AGCGGCAGCA CTCCCAACCC ATCCGACGCA GCGGACGACG ATGAACGGAA GATGATCAAT GAAGCGCTGG AATACATGAA ATTCCCCGCG GACATGCCTC TTAAGGGGCT GCCGGTTCAA GTGTGCTTCG TAGGTTCCTG CACCAATGGG CGCATTTCAG ACTTCCGGGA AGTGGCCGCC CTCATCAAGG GTCGCCATGT GGCCCCCGGC ATCAGGGCGC TGGCCGTTCC CGGCTCCCAG ATGACTGCCC GGCAGTGTGA AGAGGAAGGC ATCGCGGACA TTTTCCGTGA AGCCGGCTTT GAATGGCGTC TGGCGGGTTG CTCCATGTGC CTGGCCATGA ATCCGGACAA GCTCCAGGGT GACCAGCTCT GCGCCAGTTC CTCCAACCGG AACTTCAAGG GCCGGCAGGG AAGCCCCACC GGACGCACCC TGCTGATGAG CCCGGCCATG GTGGCCGCCG CTGCTCTGAC CGGGAAAGTC TCCGATGCCC GCGAAGTGTT CTCCCTGAAT TAA
|
Protein sequence | MGKTLFQKIW DAHTVGILPD GRTQMFIATH LLHEVTSPQA FGMVRDLGLT VRHPERTFAT VDHIIPTDNQ AEPFADATAD AMIRELRRNC AENGIRFFDL PTGLQGIVHM VGPELGITQP GMTIVCGDSH TATHGAFGAI AMGIGTTQVR DVLATQTLAL SPLKVRRINV NGKLAPGVRA KDVALHIIGL LGAKGGLGFA YEYGGEVIDA MSMDERMTLC NMSIEGAARC GYVNPDRTTV EYIKGRLFAP TGADWDKAVE RWLGFASDAD AEYDEIVEID GASIEPTLTW GISPDQNTGI SGSTPNPSDA ADDDERKMIN EALEYMKFPA DMPLKGLPVQ VCFVGSCTNG RISDFREVAA LIKGRHVAPG IRALAVPGSQ MTARQCEEEG IADIFREAGF EWRLAGCSMC LAMNPDKLQG DQLCASSSNR NFKGRQGSPT GRTLLMSPAM VAAAALTGKV SDAREVFSLN
|
| |