Gene Amuc_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0233 
Symboltdh 
ID6275290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp291556 
End bp292593 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content54% 
IMG OID642612281 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001876857 
Protein GI187734745 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.43371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGCA TGAAAGCTCT TGTAAAAACG CAGGCTGGCC CCGGTTTGGA ATTGATGGAT 
GTTCCTATGC CGGAAGTCGG CCCGAATGAC GTCCTGATCA AAATTCATAA AACAGCCATT
TGCGGCACGG ATCTTCATAT TTGGAATTGG GATAAATGGG CCCAGCAGAC CATTCCGGTA
GGGATGCATG TGGGCCATGA GTTCTGCGGC GTGATTGAGT CCGTAGGTTC TTCCGTGACG
GAATACAAGC CCGGGGAGAT TGTTTCCGGT GAGGGGCATA TTGTCTGCGG CCATTGCCGC
AGCTGCCGTT CAGGGCAGAA GCACTTGTGC CCCAACACAA AGGGTGTGGG AGTCAACAGG
CCCGGCTGCT TTGCGGAGTA CCTTTCCATT CCGCAGGATA ACGTGGTGCG CATCCACAAG
AGCATTCCGA TGGAAATCGC CTCCATTTTT GACCCGCTGG GCAACGCCGT CCATACGGCT
TTGTCCTGGG ATCTGGTGGG CGAGGACGTA CTGATTACGG GAGCCGGGGT TATCGGCTGC
ATGGCTGCCG CCGTCTGCAA GAAGGCCGGA GCCAAGACGG TGGTTATTAC GGACATCAAT
GATTTCCGCC TGGGTCTTGC CAAAACGCTG GGGGCGGACC GAACCGTGAA CGTGACCCGT
GAAAAGCTGG AAGACGTGAT GAAGGAACTG GAAATGACGG AGGGATTTGA CGTGTGCCTG
GAAATGAGCG GCGCTCCGTC CTGCCTGAAG GACATCATCG ACAATTCCCG CAACGGAGCC
AACATTTCCC TGCTGGGGAT TCAGCCCGAT GGTTCCAGCA TCGAGTGGAA TAAGTTCATT
TGGAAAGGGT TGAAGATGAA AGGCATTTAT GGCCGTGAAA TTTTTGAAAC TTGGCATAAA
ATGGATTCCA TGATCCGCAG TGGCTTGAAT GTGGCGCCCA TCATCACGCA CCGTCTGCCC
TACACGGAAT TCCGGGAAGG GTTTGAAGCC ATGAATTCGG GAAAATCCGG CAAGGTTGTT
CTGGACTGGA TTGTTTGA
 
Protein sequence
MGGMKALVKT QAGPGLELMD VPMPEVGPND VLIKIHKTAI CGTDLHIWNW DKWAQQTIPV 
GMHVGHEFCG VIESVGSSVT EYKPGEIVSG EGHIVCGHCR SCRSGQKHLC PNTKGVGVNR
PGCFAEYLSI PQDNVVRIHK SIPMEIASIF DPLGNAVHTA LSWDLVGEDV LITGAGVIGC
MAAAVCKKAG AKTVVITDIN DFRLGLAKTL GADRTVNVTR EKLEDVMKEL EMTEGFDVCL
EMSGAPSCLK DIIDNSRNGA NISLLGIQPD GSSIEWNKFI WKGLKMKGIY GREIFETWHK
MDSMIRSGLN VAPIITHRLP YTEFREGFEA MNSGKSGKVV LDWIV