Gene Amuc_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1964 
Symbol 
ID6274981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2383131 
End bp2384459 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content57% 
IMG OID642614026 
ProductHomoserine dehydrogenase 
Protein accessionYP_001878558 
Protein GI187736446 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.052998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.000903796 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGAAA AACCTATACA ACTTGGGCTT GCCGGGCTGG GAACGGTCGG TTCCGGCGTT 
TATGAAACGC TGTGCCGCAA TCACGCCCTT CTGGAAGCCC GGAGCAAGAT TCCTTTCAGG
CTCAAGCGCA TCGCCGTGCG CAACCTGGAA AAGCCCAGGG AAACCGTCGT TCCCCGGGAA
CTGCTGACGG ACGATTGGCA GGATTTGGTA AACGACCCGG AGATAGACAT CATTATTGAG
CTGATTGGCG GAACGCGCCA GGCTTACGAT CTGGTGACGC TGGCCCTCCG TGCCGGCAAG
CCCGTCGTTA CGGGGAACAA GGCTCTTCTG GCCGAATATG GAGCGGAGAT TTTCAAGCTT
TCCGCGGAAA TGGGCACCCC CATTTATTTC GAAGCGTCCG CCGGCGGCGG CATCCCCATT
ATCCAGAGCT TGCAGAATTC CCTGATTTGC AACCACATCA ATTCCATTGT GGGCATTATC
AACGGAACGT CCAACTATAT TCTCTCCGCC ATGGGGGAAC ACGGGGCCGA TTACGCGGAC
GCCCTGGCCC AGGCCCAGAA GCTGGGCTTC GCGGAAGAAG ACCCCTCCCT GGACGTCAAC
GGCTGGGACG CCGCCCATAA GGCGCTCATC CTGGCGATGC TGGCCTACGG AACAACCATT
TCCCCGGATA AAATTTACGT CAGAGGCATT GAGAATATCA CCAGCCGGGA TTTCGAATTT
GCCAAAAAAC TGGGCTATAC CATCAAGCTC CTTGTCGTCA TCCGCTACCA TGAGGGGCAG
GAAGACGCTC TGGAACTGCG TGTCCAGCCC TGTTTTGTCC ATGACTGGCA TATCCTGGCT
TCCGTGAACG GCGTGTTCAA CGCTATTTCC GTCAACGGGG ACATTGTGGG GGAAACGCTG
TTTTACGGCC GGGGAGCGGG CAAGAACCCC ACAGCTTCCG CCGTCATCAG CGACATCATC
ACCGCCATGC GCGAAAGCCG CTATCCGGAA TACCATACGG GCTTCAATCC CTATGCCAAG
GCCTGCGGGA TCATGCATAT CAATGATACG GTCACTCCGT ATTACGTCCG CTTCCAGGTA
GCGGACCAGC CGGGAGTCAT TGCGGAAATA GCCCGCATTC TGGCCACCTT CGGCATCGGC
ATTTCCGCCA CCTCTTCCGC TCCCAGCCAT ATTGATGAAG GCGGAGCCCC CTGGAACGAC
CTCGTTTTCA TCCTCCATTC CTGCCCGTGG GGCCAGCTCC AGAAGGCTCT GGAGGAAATA
ACCCGCATTT CCTGCGTGGC GGCGGAGCCC CGTGTCCTGC GCATAGAACA TCTTCTTCCT
CAATCCTAA
 
Protein sequence
MTEKPIQLGL AGLGTVGSGV YETLCRNHAL LEARSKIPFR LKRIAVRNLE KPRETVVPRE 
LLTDDWQDLV NDPEIDIIIE LIGGTRQAYD LVTLALRAGK PVVTGNKALL AEYGAEIFKL
SAEMGTPIYF EASAGGGIPI IQSLQNSLIC NHINSIVGII NGTSNYILSA MGEHGADYAD
ALAQAQKLGF AEEDPSLDVN GWDAAHKALI LAMLAYGTTI SPDKIYVRGI ENITSRDFEF
AKKLGYTIKL LVVIRYHEGQ EDALELRVQP CFVHDWHILA SVNGVFNAIS VNGDIVGETL
FYGRGAGKNP TASAVISDII TAMRESRYPE YHTGFNPYAK ACGIMHINDT VTPYYVRFQV
ADQPGVIAEI ARILATFGIG ISATSSAPSH IDEGGAPWND LVFILHSCPW GQLQKALEEI
TRISCVAAEP RVLRIEHLLP QS