Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1117 |
Symbol | |
ID | 6273954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1335469 |
End bp | 1336605 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613168 |
Product | biotin and thiamin synthesis associated |
Protein accession | YP_001877724 |
Protein GI | 187735612 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTT CTGAAGAATT AAACGGTTTG ATGGATTGCC CCACTCCCCT GGTACGCCGT TTCATGGCGT TGCTGGAACC GGTTGACGAC GCCCGCCTGG AGGAGATGGC GCAGGAGAGC CGGCGTCTGA CGCGGCTGCA TTTCGGCCGG ACCATCCGCC TGTTCGCCCC CATTTACCTG TCCAACGAAT GCATCAATAA TTGCAAGTAC TGCGGTTTTT CCCGGGATAA TCCCATTATC CGCACCACGC TTACGGTGGA TGAAGTGGTG CAGGAGGCCC GCTACCTGCA CGGCCTGGGG CTGCGCAGCA TCCTGCTGGT GGCCGGGGAG CATCCCAAGT TCGTTTCCGA CGGGTATATG CAGGAATGCC TGGACGCCCT GCATTCCTTT ATTCCATCCC TGGGGCTGGA GATAGGACCG TTGCCGGACG ACCGTTATGC GGAGATCGTC CGCCACGGGG CGGAACAATT GGCCGTGTAT CAGGAAACCT ATAACCGGGA AGTGTATGAA ACCCTGCATA CGGCAGGGAT GAAGAAAAAT TTCAACTGGA GGCTGGACTG CCCGGAACGC GCCTACCAGG GCGGTTTCCG CCGCATTCAG ATAGGAGCCT TGTTCGGGCT TTCTCCGTGG CGGCGGGAGG CCATGGCACT TGCCGTCCAT CTGGATTACC TGCAGAAGCA TTGCTGGAAA TCCGCGCTTT CCGTGGCGTT TCCGCGCATG CGTCCCTACG CCGGGAATTA CGAGTATGAA CCTGATCCGG ACTTGATGCT GGATGACCGC CATTTTGTCC AGCTTATGGC CGCCCTGCGC ATCTGTTTCC CCAAGATAGG CATGTCCATC AGCACCCGTG AGCCCGCGCC GATGAGGAAT GCCCTGATGC ATTTGGGCAT GACCCACATG TCCGCCATTG CGCGCACGGA ACCGGGGGGC TACACGGGCG TGGGAACGGC TGCCGCCCAT TTGACGGTGC GGGGCAACCG GGTGGATCTT CCCGATGGCC GGAAAGGGAA TTGCAAGGCG ACGGAGCAGT TTGAGATTTC CGACCAGCGC ACACCGGAGC AGGTGGTCGG AGCCATACGG AATGCCGGGC TGGAGCCTGT CTGGAAAGAC TGGGATGCCG CTCTGGATGT GGTGTAG
|
Protein sequence | MSFSEELNGL MDCPTPLVRR FMALLEPVDD ARLEEMAQES RRLTRLHFGR TIRLFAPIYL SNECINNCKY CGFSRDNPII RTTLTVDEVV QEARYLHGLG LRSILLVAGE HPKFVSDGYM QECLDALHSF IPSLGLEIGP LPDDRYAEIV RHGAEQLAVY QETYNREVYE TLHTAGMKKN FNWRLDCPER AYQGGFRRIQ IGALFGLSPW RREAMALAVH LDYLQKHCWK SALSVAFPRM RPYAGNYEYE PDPDLMLDDR HFVQLMAALR ICFPKIGMSI STREPAPMRN ALMHLGMTHM SAIARTEPGG YTGVGTAAAH LTVRGNRVDL PDGRKGNCKA TEQFEISDQR TPEQVVGAIR NAGLEPVWKD WDAALDVV
|
| |