Gene Amuc_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1109 
Symbol 
ID6273970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1324301 
End bp1325530 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content57% 
IMG OID642613160 
Producthistidine kinase 
Protein accessionYP_001877716 
Protein GI187735604 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCCA TCGACTACAT CCTAACCATC CTTATTCTGG CCTCCCTGTA CCTGAACTGG 
CATCTGGTCC AGGTATGTCG GTCCGCCATG AAAGCCCGGA AAAAAGCCTT GCGGGACGCT
CAGCGCCTGC TGAAGCGCGG GGAAGAAGCG CAGGAACAGG CCATCGCGGA CAAACGGCGC
TTCCTGGAAG CTCTGGGAGA GGCCTTCCTG CTCATCGGTC CATCCGGACA CATCGTGCTG
GCTAATACGC TGGCCAAAGA ACTCTTTCAG GAAGAAAAGC TGGAAGGGCG CAAAGTGGGG
GCCCTGGTCT GCAACCAGGA ATTGCTGGGG CATGTTCAGG AAGCATTCGA TACGGACGGC
CCCGTCACCA AGGAATTCAC GCTGAGCGCC GCCAATTCCC CCGGCGGCGT GCAAAACGGC
ATCACGGCGT GGCATCTGGA CAGCGCCATC ACGGACGCCC CAATCAGAGA AAAGCGCATC
CTGCTGCGCA ACATCACGCA GAACTACCTC ACCAACCAGA TGCGCCGGGA CTTCGTGGCA
AACGCCTCCC ACGAGCTGCG TACGCCCCTC ACCATCATCG TGGGATATCT GGAAAACCTG
ATGGAGGACG ATCTGGTGGA GGAAAGTCCC GGACTGGCCC GCAAATTCAT CGGAGTCATG
CACCAGAACA GCCAGAGGCT GATGAACATT ATTGAAGACA TGCTCATGAT CTCCAAACTC
GAATCAGGCC ACAAGGCGAT TCTGAAGGAG CAGTGGTTCC GCCTCACCTC CTGCGCGGAC
GACGTCTTCT CCCGTCTGGA TTCCATCCGG GAGAAAAAAC AGGCCGTCCT GCACATGGAC
ATTCCCACGG ATTGGGAACT TTATGGAGAT CCCTTTTACT GGACGCAAAT TCTGTTCAAT
TTGGTGGAAA ACGCCCTCAA GCAAAACACG GAGCCGGGAC TTTCCATTAC TGTGGCCGCC
GCCAAAACAC AGGACGCCTG CGTCATCACC GTCACGGATA CGGGCGTGGG CATTCCTGTG
GAAAGCATCC CCTTCCTCTT CAACCGCTTT TACCGGGTGG AAACCCACCA CTCCTCGGAA
ATCAAGGGAA CGGGCCTAGG CCTCTCCATT GTGAAACGCG CCGTGGAAGC CCACGACGGA
GCCATCACCG TCTCCAGCAT CCCCCACCGG GAAACTGTTT TTACCATCAC CATTCCCCTG
AAAAGGTTCC GGGAAGAAAA GGCGGCGTAA
 
Protein sequence
MSAIDYILTI LILASLYLNW HLVQVCRSAM KARKKALRDA QRLLKRGEEA QEQAIADKRR 
FLEALGEAFL LIGPSGHIVL ANTLAKELFQ EEKLEGRKVG ALVCNQELLG HVQEAFDTDG
PVTKEFTLSA ANSPGGVQNG ITAWHLDSAI TDAPIREKRI LLRNITQNYL TNQMRRDFVA
NASHELRTPL TIIVGYLENL MEDDLVEESP GLARKFIGVM HQNSQRLMNI IEDMLMISKL
ESGHKAILKE QWFRLTSCAD DVFSRLDSIR EKKQAVLHMD IPTDWELYGD PFYWTQILFN
LVENALKQNT EPGLSITVAA AKTQDACVIT VTDTGVGIPV ESIPFLFNRF YRVETHHSSE
IKGTGLGLSI VKRAVEAHDG AITVSSIPHR ETVFTITIPL KRFREEKAA