Gene Amuc_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2116 
Symbol 
ID6275476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2578670 
End bp2579791 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content58% 
IMG OID642614178 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_001878706 
Protein GI187736594 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.00796302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACAAA CCGCACGCGC CGCTGTCCTG ACAGCACCAA AGACATTTGA AATCCGTGAA 
TATCCCATTC CCGCCATCGG AGACGATGAA ATGCTGATCA AGGTGGAAGC CTGCGGCGTT
TGCGGAACGG ACGGCCACGA ATACAACCGG GACCCCTTCG GCCTCTGCCC CGTGGTCCTG
GGCCATGAAG GCACCGGGGA AATCGTCGCC ATGGGCAGGA ATATCACTAA AGACACCGCC
GGAAATCCCG TAGCGCTGGG GGACAAGATC GTCACCTGCA TCATTCCCTG CGGCACCTGT
GACGCCTGCC TGAATACTCC GGCCCGCACC AACCTGTGCG AAAATGTGGG CGTGTATGGC
CTGATGCCTG ACGACGACGT GCATCTGAAC GGCTACTTCG GGGAGTACCT CGTCATCCGC
AAGGGCTCCA CATTTTTCAA TGTTTCCGGC ATGACGCTGG ACCAGCGCAT TCTGGTGGAA
CCCGCCGCAG TGGTGGTCCA TTCCCTGGAA CGCGCCAAGT CCACCGGGCT CCTCAAGTTC
AATTCTGTGG TTCTCGTGCA GGGCTGCGGC CCCATCGGCC TTCTTCAAAT CGCCACGCTG
CGCACGCTGG GCATTGAAAC CATCATCGCT GTGGACGGCA ATGACTCCCG CCTGGAACTG
GCCAGGGAAA TGGGAGCCTC CCGCACGTAT AACTTCACCC GGTACGCGGA TCTGAACGAA
CTGCTGGATG CCGTGAAAAA GGACAACGGC GGCCGCCTGG CGGACTTCGT CTTTCAGTGC
ACGGGCGTAG GCAAGGCCGG GGCCAACGCC TGGAAGTTCG TGAAGCGCGG CGGCGGCCTG
TGCGAAGTGG GCTTTTTCAT GGATGGAGGG GAAAGCGTTA TCAACCACCA TTACGACCTC
TGCAACAAGG AGGTAACCGC CGTAGGCTCC TGGGTGTACT CCCCGCAGGA CTACCCGACC
ACATTCGACT TTCTGAAGCG AGCCTACGGC ATCGGCCTGC CGCTGACCAA GCTGATCTCC
CACCGCTTCA AGCTGGATGA AATCGCGGAA GCCCTGGAAA CCAACGTCCA GATGAAAGGC
ATCAAGATTG CCGTCATTTG TAATTGCAGT AAAAATATAT AA
 
Protein sequence
MQQTARAAVL TAPKTFEIRE YPIPAIGDDE MLIKVEACGV CGTDGHEYNR DPFGLCPVVL 
GHEGTGEIVA MGRNITKDTA GNPVALGDKI VTCIIPCGTC DACLNTPART NLCENVGVYG
LMPDDDVHLN GYFGEYLVIR KGSTFFNVSG MTLDQRILVE PAAVVVHSLE RAKSTGLLKF
NSVVLVQGCG PIGLLQIATL RTLGIETIIA VDGNDSRLEL AREMGASRTY NFTRYADLNE
LLDAVKKDNG GRLADFVFQC TGVGKAGANA WKFVKRGGGL CEVGFFMDGG ESVINHHYDL
CNKEVTAVGS WVYSPQDYPT TFDFLKRAYG IGLPLTKLIS HRFKLDEIAE ALETNVQMKG
IKIAVICNCS KNI