Gene Amuc_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1994 
Symbol 
ID6274124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2421413 
End bp2422462 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID642614054 
Productserine/threonine protein kinase 
Protein accessionYP_001878586 
Protein GI187736474 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0495985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.225513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAG AAACACTGGT TTACCCGCTT GCGGACAATA CTGTTCTGCA AGACAAATAT 
ACGATATTGA GCGTGCTGAA CGCCGGCGGC TTCGGCATCA CGTACCTGGC TCTGGACAAC
CCCGGTTCCC GGTATGTCGT CATCAAGGAA TGCATGCCGG ACGCCTACGC CTGCCGGGAT
ATGGAAACCG GCGTTGTCCA TCCCCGGAAC GAACAGACCG CCGTCAATTT TTCCCAGAGC
GTTTCCAATT CCCGGCAGGA GGCTTCCGTC CTGTCCCAGC TCAACCATCC CGGCATTGTC
CAGGTGTTTG ACATGTTTGA CGCCAACGGC ACCTGCTATT ACGTCATGGA GAATATCCAG
GGACAGACCC TGTTCGACCT GATGACCACC ATGCACGCCA CCGGGCAAAC CATGGAACCG
GCCCAGGCCA CGGATCTTCT GTTCCGCCTG CTGGACATCC TGCACTACCT TCACTCCATG
GGTGTGTACC ATTGCGACAT CAAGCCGAGC AACATCTTCA TCCAGCCGGA CGGAACGCCC
AAGCTCATCG ACTTCGGCGC CGTGCGCACC AAGACCCTCC AGCATCAGGG GCTCGTCCAG
ATCACGCCCG GCTATACCCC GCCGGAATTC TACCCCGGAC GCCGGAGCGA AATAGGTCCC
TGGTGCGATA TGTACGAACT GGGCGCCACG TTCTACGAAT TGCTCACGGG CCAGGTTCCC
CATCCTGCGG ACCAGCGTTC CGTGGTGGAC CGCAACCCGA AAGTGACTAG TTACGCGGCC
CTGCGGAAAA CTTATCCCAT GAACTTCCTT TCCGGAATTG ACAAAGCCCT GTCGCCGGAC
GAACGCAACC GCTTCCATTC CGCCAAGGCA TGGAATGACT ATATCAACGC CATGGCGGCG
GCAGGCACCC TGCAGGCCGG CGGAGTATCG AGGAAGGCTC TTCCCCAGGC CAGGAAAAAA
TCTTCCGCAG GCACGGCTTT CCTCATCATC CTGCTCATCG CAACGGCAAT CGGCTGGGTA
TGCTGGAAAC AGGGTCTGCT TAACTTCTGA
 
Protein sequence
MEQETLVYPL ADNTVLQDKY TILSVLNAGG FGITYLALDN PGSRYVVIKE CMPDAYACRD 
METGVVHPRN EQTAVNFSQS VSNSRQEASV LSQLNHPGIV QVFDMFDANG TCYYVMENIQ
GQTLFDLMTT MHATGQTMEP AQATDLLFRL LDILHYLHSM GVYHCDIKPS NIFIQPDGTP
KLIDFGAVRT KTLQHQGLVQ ITPGYTPPEF YPGRRSEIGP WCDMYELGAT FYELLTGQVP
HPADQRSVVD RNPKVTSYAA LRKTYPMNFL SGIDKALSPD ERNRFHSAKA WNDYINAMAA
AGTLQAGGVS RKALPQARKK SSAGTAFLII LLIATAIGWV CWKQGLLNF