Gene Amuc_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1965 
Symbol 
ID6274953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2384480 
End bp2385697 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID642614027 
Productaspartate kinase 
Protein accessionYP_001878559 
Protein GI187736447 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0866474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.000866025 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCTTA TCGTTCAAAA ATTCGGGGGC AGCTCCGTCG GCACCATTGA CCGCATCCGC 
AATGTAGCGC GCCGCATCCA TGAAACCGCC AGGGAAGGCA ACCAGGTGGT CGCCGTCGTT
TCCGCCATGA GCGGCGTGAC GGACAAGCTG ATCGGGCTTG CCAGGGAATT GTCTGAAACG
CCTTGCGAAC GTGAACTTGA CGTGCTGATG GCCACCGGCG AACAGCAGTC CATCGCCCTG
CTCTGCATGG CCCTGCATGA ACTGGGTGAA AAAGCCATGT CCTTTACGGG GGCGCAGGCC
GGAATCACCA CCTTCGGCAG CCACACGCGG GGGCGCATCC ACAGCATTGA CCCGACGCTG
ATGAACAAGT ACCTGCAGGA AGGCAACATC CTTATCTGCG CCGGCTTTCA GGGGGTTACG
GAAGAAGGAA TGGTCCAGAC GCTGGGCCGC GGAGGTTCCG ACCTCTCCGC CATCGCCATC
GCGGCCGCTC TGAAAGCGGA CGTGTGCCAG ATTTTTACAG ATGTGGACGG CGTCTATACC
TGTGACCCCC GCGTGGTCAA AGACGCCAAG AAGATACAAA CCCTTTCATA TGACGAGATG
CTGGAAATGG CTTCCAACGG GTCCAAGGTG ATGCAGTCGC GTTCCGTGGA ATTCGCCAAA
AAATTCGGTG TCGTCTTTGA AGTTCGCAAC TCCATGAACA ACAACCCCGG TACAATCGTG
CAAGAAGAAA CTCCCTCCAT GGAAGCCGTC GTCATCCGCG GCATTTCCAT TGACCGCAAC
CAGGCCCGCG TCACCATTAC CGGCATTCCG GACCAAATCG GCTACACGGC CCAGATACTG
GGCGCCCTGG CAGAAGCGGA AATCAACCTG GATATGATTC TGGCCAATAC TGCCCACGAC
GGCTATGTCC GCCAGTCCTT TACGATGCCC TCCAACGAAC TGGGCCGCGC CCAAGCCGCC
CTTAAACCGG TCATGGCCGC CCTCGGCTCC ACCGTCAAGG TGGAAACGGA AGCGGGGCTG
GCCAAGCTTT CCCTGGTCGG CATCGGCATG CGTTCCCACT CAGGCGTGGG AGCCACCGCT
TTCAAGGCCC TGGCGGACGC CAACATCAAG ACCGGCATGA TTTCCACCTC GGAAATCAAG
ATTGCCGTGA TGGTGGACGA ATCCGATATT GAGGAAGCGG CCCGGGTCGT ACATAAGGCG
TTCAACCTGG GAGCCTGA
 
Protein sequence
MALIVQKFGG SSVGTIDRIR NVARRIHETA REGNQVVAVV SAMSGVTDKL IGLARELSET 
PCERELDVLM ATGEQQSIAL LCMALHELGE KAMSFTGAQA GITTFGSHTR GRIHSIDPTL
MNKYLQEGNI LICAGFQGVT EEGMVQTLGR GGSDLSAIAI AAALKADVCQ IFTDVDGVYT
CDPRVVKDAK KIQTLSYDEM LEMASNGSKV MQSRSVEFAK KFGVVFEVRN SMNNNPGTIV
QEETPSMEAV VIRGISIDRN QARVTITGIP DQIGYTAQIL GALAEAEINL DMILANTAHD
GYVRQSFTMP SNELGRAQAA LKPVMAALGS TVKVETEAGL AKLSLVGIGM RSHSGVGATA
FKALADANIK TGMISTSEIK IAVMVDESDI EEAARVVHKA FNLGA