Gene Amuc_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1516 
Symbol 
ID6274664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1810066 
End bp1811703 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content56% 
IMG OID642613575 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001878118 
Protein GI187736006 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00955123 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00000000236227 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGCCCT CACGCAACAG ACATTCCGCC CAGAGAGTTT TTACCTGGGA CAAAATCAAA 
ATGGCCCTGG CGGCGGCGGA AGAAGGATTT TATATCTGGA ATATTAAAAC GGGCGTCATT
CATTACACGG ACCGCTGCCT GACCATGATG GGGGCCAGCC GCAAGGAGAA AGCCCCCAAT
ATCTTCACCC AGCCGGAGCT TACCATACAC GAGGAAGACC AGGCTTTTTT TTCCCAGGAA
GTGCGCCGTT ATCTGGACGG CCACTCCCAT GTGCCCATGC GCATTGAAAT CCGGATGAAG
AAGCTGAACT CCAAAAGCTG GAGCTGGGTC CGCGTCAACG GACTCGCCCG CAGGGACAAG
CAGCGCCGGC CCGTTATGCT GGTGGGCGTA TGGGTCAATA TTACCCGGCG GAAAACAGCG
GAACTGCGTG CTGCGGAGGA CAGGGACCTG TTTCATACCC TGATTGAACA CATTCCCGAC
AGTATCTATT TCAAGAACCG GGAATCACGT TTCGTCCTGG CGAATACCGC CACAGCCAAT
AAGCTGGGCG TACCGACGCC GGCAGACCTG ACCGGACGCA CGGACGATTA CTTTTTTGAC
CAAACGATGT CCGATATTTC CCGCAAGGAG GAAATGGACA TCATGGTTAC CGGACGCCCC
ATCCGCGCCC GCCTTCATCA TGAGACATGG CTCCACAAGG ATGATTCCTG GAGCCAGATC
AGCAAGTTCC CGTGGTACGG ACGCAACGGA GAGCTAAAGG GAATTGTAGG TATTTCCAGT
GATGTCACCA AACTGGTGAA GACTGAAATC AAGGCCACGG AAACGGCCCG CATCCTGGAG
GAACGCAACA GAACTCTGGA AAAGGAAATA GATTTGGCCA GGGAAATCCA GTTCGCCCTG
CTTCCCTACG AAATTCCCTC CCGCTCCCAT ACGGAACACG GCCTGACCCG CCATGCGGAT
TTTCACCATA TTTTCACTCC TTCGGAGGGG GTTGCAGGAG ACTGGTTCGA TGCTTTTCCC
GTAGGCAGTT CCGGCGTCGG CGCCATTGTC TGCGACGTGA TGGGCCACGG CATCCGCGCC
GCCCTGATCG CCTCCATGCT CCGCGGCCTG ATGGAACAGT TGTCCCACCT GGCGGATAAC
CCGGCGGCTT TCCTTACTTC CCTCAACCAC CAGCTCGCCA AAATCCTGCA ACGGGCGAAC
ACCACCATGT TTGCCTCTGC CGTTTATATT TACCTGGATC TGGAAACCGG GGTCATGACG
GCTTCTACAG CCGGGCATCC GCATCCCATC ATTATGGGAC CGGACGGAGT CGCCCGCAAA
ATGCCGCTGC CCAAAGGAAT CGCCCTGGGG CTGCTGGATG ACGCTACGTA CCATAATGCC
CAGTTTTCGC TCCTGGCGGG GTCCCGCATC CTGATGTACA CGGACGGCCT GACGGAAGCA
GCCAACCAGG ACGGGGAAGA AATGGGCGTG GAAAGGCTGA TTGACTATTT CAATAATTCC
TCCCCCCACA GCACCAAGGA TTTTGTCCAT CAGGCCCTTA CCTGCGTAGC CAAATTTACC
GGCTGCACCA ATCAGGCCGA CGATATCTGC ATGCTGGGCA TCAGCTATTC CGAACACGAG
GCAGAAACGG ACGGCTAA
 
Protein sequence
MPPSRNRHSA QRVFTWDKIK MALAAAEEGF YIWNIKTGVI HYTDRCLTMM GASRKEKAPN 
IFTQPELTIH EEDQAFFSQE VRRYLDGHSH VPMRIEIRMK KLNSKSWSWV RVNGLARRDK
QRRPVMLVGV WVNITRRKTA ELRAAEDRDL FHTLIEHIPD SIYFKNRESR FVLANTATAN
KLGVPTPADL TGRTDDYFFD QTMSDISRKE EMDIMVTGRP IRARLHHETW LHKDDSWSQI
SKFPWYGRNG ELKGIVGISS DVTKLVKTEI KATETARILE ERNRTLEKEI DLAREIQFAL
LPYEIPSRSH TEHGLTRHAD FHHIFTPSEG VAGDWFDAFP VGSSGVGAIV CDVMGHGIRA
ALIASMLRGL MEQLSHLADN PAAFLTSLNH QLAKILQRAN TTMFASAVYI YLDLETGVMT
ASTAGHPHPI IMGPDGVARK MPLPKGIALG LLDDATYHNA QFSLLAGSRI LMYTDGLTEA
ANQDGEEMGV ERLIDYFNNS SPHSTKDFVH QALTCVAKFT GCTNQADDIC MLGISYSEHE
AETDG