Gene Amuc_0311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0311 
Symbol 
ID6275068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp366881 
End bp368032 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content55% 
IMG OID642612365 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001876934 
Protein GI187734822 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.138677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG ACACCATTGA CAAGCTGATC GGCCGCATTG ACCACATCAA GGAGGAGGAT 
CTCCAGCGCT TTTTTGTCAA GCTGGCGGAA CAGCAGGGTT TTTTCCAGCA GGTATTTGAA
GCCATTCAGG AAGGCCTGAT CCTGCTGGAT AACAAGGGAA AAATTCTGTT CGTCAACAAG
GCTGCCCTTA AACTCTTTGA CAAGGAACGC GGGCAAATAA CCCCGGATGA CTTCTGTATT
TTCCTGGGCA GGGACTGCTC ATGGGACACG ATCCAGCAGA GCCAGACCGC CGTCTCCCGG
GACACGGAAA TCTTTTACCC GGAACACAGA TTCCTGAACA TTTTCATTTC CCCCATCGGC
AGCAAAAACC AGGGCCACCT GGTCCTGATA CGGGATGAAA CGCCCCGGCA TAAAAAAAAC
GCGGAAAACC TTGAGGCAGA ACGCCTGAAT GCCTTGACGC TGCTGGCCGC AGGAGTGGCT
CATGAAATAG GCAACCCCCT CAATTCCATA GGCCTCCATC TCCAGCTCCT TGCCAGGAAG
GCAAAGCAGC TGCCTCCAAA GTACCGGACG GACATGGAGG AATTGCTGAA AACAGCGGAG
AGCGAAACCA CGCGGCTGGA CGTTATCCTG AAGCAATTCC TCCAGGCCAT CCGCCCCACC
AGGCCCATCC GGGAACCCTA CAACATTGAA ACCATCCTCA TGGAAGTGCT GAAACTGCTG
GAACCGGAAA TCCAGCAGCG CGGCATCCAA ATCAACACGG ACCTCCAGCC CAGTCTTCCC
ATCCTCAGCC TGGACCCCGT CCAAATCAAA CAGGTCTTTT ACAACCTGAT CAAGAATGCC
TACCAGTCCA TCCCTCCGGA GGGAGGCACC ATTCTGCTCA AAAGCGGTTA TACGGATGAC
AGCGTATTCG TCACTGTGGC GGATACCGGC TGCGGTATTT CACCGGAGGT CATGGGCAGC
ATTTATGAAC CTTTCCTGAC CACCAAGTCC ACCGGCTCCG GACTGGGGCT GCTTATCGTT
CGCCGCATCG TGAAGGAGCA CGGCGGGTCC ATCACGCTGG CCAGCCAGCC GGGTCAGGGA
ACCACCATCA CGGTCTTCCT GCCGCGCGTG GAACGCACCA TCAGGCTCCT GCCATCCTCC
ATTCCCTCAT GA
 
Protein sequence
MKKDTIDKLI GRIDHIKEED LQRFFVKLAE QQGFFQQVFE AIQEGLILLD NKGKILFVNK 
AALKLFDKER GQITPDDFCI FLGRDCSWDT IQQSQTAVSR DTEIFYPEHR FLNIFISPIG
SKNQGHLVLI RDETPRHKKN AENLEAERLN ALTLLAAGVA HEIGNPLNSI GLHLQLLARK
AKQLPPKYRT DMEELLKTAE SETTRLDVIL KQFLQAIRPT RPIREPYNIE TILMEVLKLL
EPEIQQRGIQ INTDLQPSLP ILSLDPVQIK QVFYNLIKNA YQSIPPEGGT ILLKSGYTDD
SVFVTVADTG CGISPEVMGS IYEPFLTTKS TGSGLGLLIV RRIVKEHGGS ITLASQPGQG
TTITVFLPRV ERTIRLLPSS IPS