Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0193 |
Symbol | |
ID | 6275350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 242365 |
End bp | 243591 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642612239 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_001876818 |
Protein GI | 187734706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.70976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.651856 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACA CAGCAACCAT TCGCCCGCAG TTCCCCATTC TGGAAACCAG CGTGCACGGC AAGCCTCTCA TTTACCTGGA TAATGCGGCC ACTACGCAAA AACCCCTGGC CGTGCTGGAC GCCATCCGTC ACTACTACGA TACGGAGAAC GCCAATGTGC ACCGCGGCTC CCACTACCTG AGCCAGCTCG CGACGGAAGC GCATGAAGAA TCGCGGGAAA CGGTGGCGCG GTTCATCAAC GCGCCGGAAA CGGCGGAAGT CCTGTTCACC TCCGGCTGTA CGATGGGCAT CAACCTGGCG GCGGATACCA TCGCCGGGTC CGGCATGGTC AAACCGGGAG ACGAAGTCAT CGTAACCGCT TCCGAACACC ATTCCAATAT CGTTCCCTGG CAAATGCTGT GCGAACGCAC GGGCGCCGTC CTGAAAGCGG TTCCCCTGAC GCCGGGCCAG ACCCTGGACA TGGAAGCTTA CCGGAACATG CTTTCCCCAC GCACCCGCAT CGTAGCTGTG GGACACGTTT CCAATACGCT GGGAACGGTT AATCCCGTGA GGGAAATGGC CGCGCTCGCC AAGGAGAACA GGCAGGAAAC CATCGTGCTG ATTGACGGAG CCCAGGCTGT TTCCCATATG AATGTGGACG TTCAGGAACT GGGCTGCGAC CTGTATGCCT TTTCCGGCCA CAAGCTGTAC GCACCCACCG GCATCGGCGC GCTGTGGGGA AAAAGGGAGC TGCTGGAAAA ACTGCCGCCG TGGATGGGCG GCGGGGAAAT GATCAAGGAA GTCACCTTTG AAAAAACCGT TTACAACGAC ATCCCGTTCA AATATGAAGC GGGAACGCCC AACATTGGCG GGGCAGTGGG TCTGGCGGCC GCCATCCGCT ACGTCTCCGG GCTGGGTCTG GACAACATTG CCGCCCATGA ACAGAAACTG ACGGATATGG CGGTGGAAGG CCTGAAGGCC ATGCCGCGCC TGACCGTACT GGCCCCGGAC GTGCCGCACA GCGCCGTGGT CTCCGTCCTG GCGGAGGGCG TCCACCACTA TGACCTGGGT ACGCTGCTGG ACCAGATGGG AATTGCCGTA AGAACCGGGC ACCATTGCTG CCAGCCGCTC ATGTGCGCCC TGGGCACCAC CGGGACTACC CGCGCCTCCT TTGCCCTGTA CAATACGGAA GAGGAAGTGC AGACCTTCCT CAAATCCATG AACCGGGCGC TGGACATGCT CTCCTGA
|
Protein sequence | MLDTATIRPQ FPILETSVHG KPLIYLDNAA TTQKPLAVLD AIRHYYDTEN ANVHRGSHYL SQLATEAHEE SRETVARFIN APETAEVLFT SGCTMGINLA ADTIAGSGMV KPGDEVIVTA SEHHSNIVPW QMLCERTGAV LKAVPLTPGQ TLDMEAYRNM LSPRTRIVAV GHVSNTLGTV NPVREMAALA KENRQETIVL IDGAQAVSHM NVDVQELGCD LYAFSGHKLY APTGIGALWG KRELLEKLPP WMGGGEMIKE VTFEKTVYND IPFKYEAGTP NIGGAVGLAA AIRYVSGLGL DNIAAHEQKL TDMAVEGLKA MPRLTVLAPD VPHSAVVSVL AEGVHHYDLG TLLDQMGIAV RTGHHCCQPL MCALGTTGTT RASFALYNTE EEVQTFLKSM NRALDMLS
|
| |