Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1516 |
Symbol | |
ID | 6274664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1810066 |
End bp | 1811703 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613575 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_001878118 |
Protein GI | 187736006 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00955123 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00000000236227 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCGCCCT CACGCAACAG ACATTCCGCC CAGAGAGTTT TTACCTGGGA CAAAATCAAA ATGGCCCTGG CGGCGGCGGA AGAAGGATTT TATATCTGGA ATATTAAAAC GGGCGTCATT CATTACACGG ACCGCTGCCT GACCATGATG GGGGCCAGCC GCAAGGAGAA AGCCCCCAAT ATCTTCACCC AGCCGGAGCT TACCATACAC GAGGAAGACC AGGCTTTTTT TTCCCAGGAA GTGCGCCGTT ATCTGGACGG CCACTCCCAT GTGCCCATGC GCATTGAAAT CCGGATGAAG AAGCTGAACT CCAAAAGCTG GAGCTGGGTC CGCGTCAACG GACTCGCCCG CAGGGACAAG CAGCGCCGGC CCGTTATGCT GGTGGGCGTA TGGGTCAATA TTACCCGGCG GAAAACAGCG GAACTGCGTG CTGCGGAGGA CAGGGACCTG TTTCATACCC TGATTGAACA CATTCCCGAC AGTATCTATT TCAAGAACCG GGAATCACGT TTCGTCCTGG CGAATACCGC CACAGCCAAT AAGCTGGGCG TACCGACGCC GGCAGACCTG ACCGGACGCA CGGACGATTA CTTTTTTGAC CAAACGATGT CCGATATTTC CCGCAAGGAG GAAATGGACA TCATGGTTAC CGGACGCCCC ATCCGCGCCC GCCTTCATCA TGAGACATGG CTCCACAAGG ATGATTCCTG GAGCCAGATC AGCAAGTTCC CGTGGTACGG ACGCAACGGA GAGCTAAAGG GAATTGTAGG TATTTCCAGT GATGTCACCA AACTGGTGAA GACTGAAATC AAGGCCACGG AAACGGCCCG CATCCTGGAG GAACGCAACA GAACTCTGGA AAAGGAAATA GATTTGGCCA GGGAAATCCA GTTCGCCCTG CTTCCCTACG AAATTCCCTC CCGCTCCCAT ACGGAACACG GCCTGACCCG CCATGCGGAT TTTCACCATA TTTTCACTCC TTCGGAGGGG GTTGCAGGAG ACTGGTTCGA TGCTTTTCCC GTAGGCAGTT CCGGCGTCGG CGCCATTGTC TGCGACGTGA TGGGCCACGG CATCCGCGCC GCCCTGATCG CCTCCATGCT CCGCGGCCTG ATGGAACAGT TGTCCCACCT GGCGGATAAC CCGGCGGCTT TCCTTACTTC CCTCAACCAC CAGCTCGCCA AAATCCTGCA ACGGGCGAAC ACCACCATGT TTGCCTCTGC CGTTTATATT TACCTGGATC TGGAAACCGG GGTCATGACG GCTTCTACAG CCGGGCATCC GCATCCCATC ATTATGGGAC CGGACGGAGT CGCCCGCAAA ATGCCGCTGC CCAAAGGAAT CGCCCTGGGG CTGCTGGATG ACGCTACGTA CCATAATGCC CAGTTTTCGC TCCTGGCGGG GTCCCGCATC CTGATGTACA CGGACGGCCT GACGGAAGCA GCCAACCAGG ACGGGGAAGA AATGGGCGTG GAAAGGCTGA TTGACTATTT CAATAATTCC TCCCCCCACA GCACCAAGGA TTTTGTCCAT CAGGCCCTTA CCTGCGTAGC CAAATTTACC GGCTGCACCA ATCAGGCCGA CGATATCTGC ATGCTGGGCA TCAGCTATTC CGAACACGAG GCAGAAACGG ACGGCTAA
|
Protein sequence | MPPSRNRHSA QRVFTWDKIK MALAAAEEGF YIWNIKTGVI HYTDRCLTMM GASRKEKAPN IFTQPELTIH EEDQAFFSQE VRRYLDGHSH VPMRIEIRMK KLNSKSWSWV RVNGLARRDK QRRPVMLVGV WVNITRRKTA ELRAAEDRDL FHTLIEHIPD SIYFKNRESR FVLANTATAN KLGVPTPADL TGRTDDYFFD QTMSDISRKE EMDIMVTGRP IRARLHHETW LHKDDSWSQI SKFPWYGRNG ELKGIVGISS DVTKLVKTEI KATETARILE ERNRTLEKEI DLAREIQFAL LPYEIPSRSH TEHGLTRHAD FHHIFTPSEG VAGDWFDAFP VGSSGVGAIV CDVMGHGIRA ALIASMLRGL MEQLSHLADN PAAFLTSLNH QLAKILQRAN TTMFASAVYI YLDLETGVMT ASTAGHPHPI IMGPDGVARK MPLPKGIALG LLDDATYHNA QFSLLAGSRI LMYTDGLTEA ANQDGEEMGV ERLIDYFNNS SPHSTKDFVH QALTCVAKFT GCTNQADDIC MLGISYSEHE AETDG
|
| |