Gene Amuc_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1001 
Symbol 
ID6274121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1192801 
End bp1193706 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content53% 
IMG OID642613051 
Productshort chain dehydrogenase 
Protein accessionYP_001877609 
Protein GI187735497 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.903414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAC GTCTATCCAA AAAAGTCATG ATATTGACGG GAGCCGGGCA AATCGGCATG 
GCCATTGCCC GCAGAATAGG AAGCGGCATG AAAATCGTCA TTGGCGACAA AAATATCGGG
AACGCGCATG CCATTGCCCG GACGATGAAT CAGGCCGGAT TTGACACCAT CCCCTTAGAG
ATGGATCTCT CTTCCCGGTA CTCCATCCTG CATTTGATTG CAGAAGCACA GAGGTACGGC
AATATTTCCA TGCTCGTCAA TGCAGCGGGG GTTTCCCCCA GCCAGGCATC CGTTGAAACC
ATTCTTAAAG TGGATCTCTA CGGAACCGCC GTATTGCTGG AAGAGGTGGG GAAAGTTATT
TGCCCCGGCG GCTCGGGAGT AACTATCTCC AGCCAATCCG ATCACCGCAT GCCCGCTCTG
ACTGCCGAAC AGGACGAACA ACTGGCCATG ACTCCAACAG AGGAATTGCT GAACCTCGAA
CTCCTCCAAC CCGGAAATAT CAAGGACACT CTGCATGCCT ACCAGATGGC GAAACGCTGC
AACGTCAAGC GTGTCATGGC GGAAGCCGTC AAATGGGGTG CCAAAGGCGC ACGCATCAAC
TCCATTTCCC CGGGCATTAT AGTCACCCCT CTGGCAATCG ATGAATTCAA CGGTCCCAGA
GGTGATTTTT ACAGAAACAT GTTTGCGAAG TGCCCTGCAG GAAGACCCGG TACGGCGGAT
GAAATAGCCC ATGTAGCGGA ATTGCTGATG GGCGGCAAGG GGGCTTTCAT CACCGGCGCG
GACTTCCTGA TTGACGGGGG AGCCACCGCC TCCTATTTCT ACGGTCCGTT GAAACCGCAG
ATTCAAAAGA GAAAACCTCT CAGACAATCG AAAACGGCAG GCAAAGCACA AAATGAAGCT
TACTGA
 
Protein sequence
MEQRLSKKVM ILTGAGQIGM AIARRIGSGM KIVIGDKNIG NAHAIARTMN QAGFDTIPLE 
MDLSSRYSIL HLIAEAQRYG NISMLVNAAG VSPSQASVET ILKVDLYGTA VLLEEVGKVI
CPGGSGVTIS SQSDHRMPAL TAEQDEQLAM TPTEELLNLE LLQPGNIKDT LHAYQMAKRC
NVKRVMAEAV KWGAKGARIN SISPGIIVTP LAIDEFNGPR GDFYRNMFAK CPAGRPGTAD
EIAHVAELLM GGKGAFITGA DFLIDGGATA SYFYGPLKPQ IQKRKPLRQS KTAGKAQNEA
Y