Gene Amuc_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1117 
Symbol 
ID6273954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1335469 
End bp1336605 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content59% 
IMG OID642613168 
Productbiotin and thiamin synthesis associated 
Protein accessionYP_001877724 
Protein GI187735612 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTT CTGAAGAATT AAACGGTTTG ATGGATTGCC CCACTCCCCT GGTACGCCGT 
TTCATGGCGT TGCTGGAACC GGTTGACGAC GCCCGCCTGG AGGAGATGGC GCAGGAGAGC
CGGCGTCTGA CGCGGCTGCA TTTCGGCCGG ACCATCCGCC TGTTCGCCCC CATTTACCTG
TCCAACGAAT GCATCAATAA TTGCAAGTAC TGCGGTTTTT CCCGGGATAA TCCCATTATC
CGCACCACGC TTACGGTGGA TGAAGTGGTG CAGGAGGCCC GCTACCTGCA CGGCCTGGGG
CTGCGCAGCA TCCTGCTGGT GGCCGGGGAG CATCCCAAGT TCGTTTCCGA CGGGTATATG
CAGGAATGCC TGGACGCCCT GCATTCCTTT ATTCCATCCC TGGGGCTGGA GATAGGACCG
TTGCCGGACG ACCGTTATGC GGAGATCGTC CGCCACGGGG CGGAACAATT GGCCGTGTAT
CAGGAAACCT ATAACCGGGA AGTGTATGAA ACCCTGCATA CGGCAGGGAT GAAGAAAAAT
TTCAACTGGA GGCTGGACTG CCCGGAACGC GCCTACCAGG GCGGTTTCCG CCGCATTCAG
ATAGGAGCCT TGTTCGGGCT TTCTCCGTGG CGGCGGGAGG CCATGGCACT TGCCGTCCAT
CTGGATTACC TGCAGAAGCA TTGCTGGAAA TCCGCGCTTT CCGTGGCGTT TCCGCGCATG
CGTCCCTACG CCGGGAATTA CGAGTATGAA CCTGATCCGG ACTTGATGCT GGATGACCGC
CATTTTGTCC AGCTTATGGC CGCCCTGCGC ATCTGTTTCC CCAAGATAGG CATGTCCATC
AGCACCCGTG AGCCCGCGCC GATGAGGAAT GCCCTGATGC ATTTGGGCAT GACCCACATG
TCCGCCATTG CGCGCACGGA ACCGGGGGGC TACACGGGCG TGGGAACGGC TGCCGCCCAT
TTGACGGTGC GGGGCAACCG GGTGGATCTT CCCGATGGCC GGAAAGGGAA TTGCAAGGCG
ACGGAGCAGT TTGAGATTTC CGACCAGCGC ACACCGGAGC AGGTGGTCGG AGCCATACGG
AATGCCGGGC TGGAGCCTGT CTGGAAAGAC TGGGATGCCG CTCTGGATGT GGTGTAG
 
Protein sequence
MSFSEELNGL MDCPTPLVRR FMALLEPVDD ARLEEMAQES RRLTRLHFGR TIRLFAPIYL 
SNECINNCKY CGFSRDNPII RTTLTVDEVV QEARYLHGLG LRSILLVAGE HPKFVSDGYM
QECLDALHSF IPSLGLEIGP LPDDRYAEIV RHGAEQLAVY QETYNREVYE TLHTAGMKKN
FNWRLDCPER AYQGGFRRIQ IGALFGLSPW RREAMALAVH LDYLQKHCWK SALSVAFPRM
RPYAGNYEYE PDPDLMLDDR HFVQLMAALR ICFPKIGMSI STREPAPMRN ALMHLGMTHM
SAIARTEPGG YTGVGTAAAH LTVRGNRVDL PDGRKGNCKA TEQFEISDQR TPEQVVGAIR
NAGLEPVWKD WDAALDVV