Gene Amuc_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1111 
Symbol 
ID6273966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1326464 
End bp1328140 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content58% 
IMG OID642613162 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001877718 
Protein GI187735606 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGCG ATAGAGTTAA AGCAGGGTTC GAGCGTGCGC CGCACCGCAG TTTGATGCGT 
GCCACCGGAA TGACGGATGA GGATTTAAGC CGTCCTTTCA TTGCCATTTG CAATTCTTTT
AATGAAGTGA TTCCGGGCCA TGTCCACCTG AACAGGGTGG CCGCCCTCAT CAAGGAGGAG
GTACGCAAGG CCGGAGGAAC TCCCGTGGAA TTCAACCTTC CCGGCGTTTG TGACGGCATT
GCCATGGGCC ACGGCGGCAT GAAGTTTTCC CTGGCCAGCC GTGAGCTGAT CGCGGACAGC
GTAGAGACGA TGCTGAGCGC CCATGCGTTT GATGCTATGA TCTGCATCCC TAATTGCGAC
AAGATTGTTC CCGGCATGAT TATGGGCGCC CTGCGCTGCA ATATTCCCAC TATTTTCTGC
AGCGGCGGTC CGATGGCCGC CGGCATGGCG GAGGACGGCA CGGTGCTGGA CCTGAACAGC
GTGTTTGAGG CTGTCGCCCG CTTTAAGGCA GGCAAGATTA ATGAGGAGGA ACTTCATTCC
CTGGAATGCC GCGCCTGCCC CGGCGCCGGT TCCTGCTCCG GCATGTTTAC AGCCAATTCC
ATGAATTGCC TGAGCGAGGT GATCGGCCTG GCCCTTCCCG GCAACGGTTC CCTGCTGGCT
ACTTCAGAGG AACGAAAGGA GTTCTGGAAG CAGACTGCCC GCCGCGCCGT GGAGATGGCG
AAGGCGGACG GCCCCCTGCC GCGAGACATC GTAACCCGTG ACGCTATCGA CAATGCTTTC
ACAATTGATA TGGCCATGGG CGGCAGTTCC AATACCGTGC TCCATACGCT GGCTATCGCC
AGGGAAGCCG GCGTGGAGTA TGACCTCCAG CGCATCAATG ATATTTCTAG GCGAACCCCG
AATATTTGCA AGGTGGCTCC TTCCTCCCGC TTCCACATGC AGGATGTTCT GCGTGCAGGC
GGGGTGAGTG CCATCATTCA TGAAATTGCC AGAATTCCCG GAGCCCTTCA TCTGGACGCC
ATGACCGTCA GCGGGAAAAC GCTGGGTGAA ACAGTGGAGG GATGCGGCAT TGCGGATGAA
ACCGTGATTC ATCCGTTGGA AAATGCCTAT TCCCGTGATG GCGGCCTGGC GATTCTGTTC
GGCAATCTGG CTGAGGAGGG CGCTGTGGTG AAAAAGGCGG GTGTGCATCC GAATATGATG
AGTTTCCGCG GGCCCGCCGT GATTTTCGAG TCTCAGGAAG AGGCCTGCGA AGGCATCCTT
GCCGGGAAGG TGAAATCAGG CGATGTGGTT GTCATACGCA ATGAAGGCCC CAAGGGCGGC
CCCGGCATGC AGGAAATGCT GGCTCCCACT TCCTATATTA TGGGGCAGGG CCTTGGCGCG
GAAGTGGCGC TTATTACGGA CGGCCGTTTT TCCGGAGCCA CGCACGGAGC CTGTATTGGC
CATATTTCCC CGGAAGCGGC GGAAGGCGGC CTGATCGGCC TGCTGAGGAA CGGGGATATT
ATTGAGTATT CCATTCCGGA CCGCACGCTG AACGTCTGTT TGAGCGAGGA GGAGATCGCA
CGCCGCCGTG CGGATTGGAA ACCTACCTAT AACAGGGTTT CCTCCTCCTG GCTGAGCCGT
TACCGCCAGC TTGCCACGAA TGCCAGCAAG GGGGCAGTCC TCCGGCGCGG GGAATAA
 
Protein sequence
MRSDRVKAGF ERAPHRSLMR ATGMTDEDLS RPFIAICNSF NEVIPGHVHL NRVAALIKEE 
VRKAGGTPVE FNLPGVCDGI AMGHGGMKFS LASRELIADS VETMLSAHAF DAMICIPNCD
KIVPGMIMGA LRCNIPTIFC SGGPMAAGMA EDGTVLDLNS VFEAVARFKA GKINEEELHS
LECRACPGAG SCSGMFTANS MNCLSEVIGL ALPGNGSLLA TSEERKEFWK QTARRAVEMA
KADGPLPRDI VTRDAIDNAF TIDMAMGGSS NTVLHTLAIA REAGVEYDLQ RINDISRRTP
NICKVAPSSR FHMQDVLRAG GVSAIIHEIA RIPGALHLDA MTVSGKTLGE TVEGCGIADE
TVIHPLENAY SRDGGLAILF GNLAEEGAVV KKAGVHPNMM SFRGPAVIFE SQEEACEGIL
AGKVKSGDVV VIRNEGPKGG PGMQEMLAPT SYIMGQGLGA EVALITDGRF SGATHGACIG
HISPEAAEGG LIGLLRNGDI IEYSIPDRTL NVCLSEEEIA RRRADWKPTY NRVSSSWLSR
YRQLATNASK GAVLRRGE