Gene Amuc_0193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0193 
Symbol 
ID6275350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp242365 
End bp243591 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content60% 
IMG OID642612239 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_001876818 
Protein GI187734706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.70976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.651856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACA CAGCAACCAT TCGCCCGCAG TTCCCCATTC TGGAAACCAG CGTGCACGGC 
AAGCCTCTCA TTTACCTGGA TAATGCGGCC ACTACGCAAA AACCCCTGGC CGTGCTGGAC
GCCATCCGTC ACTACTACGA TACGGAGAAC GCCAATGTGC ACCGCGGCTC CCACTACCTG
AGCCAGCTCG CGACGGAAGC GCATGAAGAA TCGCGGGAAA CGGTGGCGCG GTTCATCAAC
GCGCCGGAAA CGGCGGAAGT CCTGTTCACC TCCGGCTGTA CGATGGGCAT CAACCTGGCG
GCGGATACCA TCGCCGGGTC CGGCATGGTC AAACCGGGAG ACGAAGTCAT CGTAACCGCT
TCCGAACACC ATTCCAATAT CGTTCCCTGG CAAATGCTGT GCGAACGCAC GGGCGCCGTC
CTGAAAGCGG TTCCCCTGAC GCCGGGCCAG ACCCTGGACA TGGAAGCTTA CCGGAACATG
CTTTCCCCAC GCACCCGCAT CGTAGCTGTG GGACACGTTT CCAATACGCT GGGAACGGTT
AATCCCGTGA GGGAAATGGC CGCGCTCGCC AAGGAGAACA GGCAGGAAAC CATCGTGCTG
ATTGACGGAG CCCAGGCTGT TTCCCATATG AATGTGGACG TTCAGGAACT GGGCTGCGAC
CTGTATGCCT TTTCCGGCCA CAAGCTGTAC GCACCCACCG GCATCGGCGC GCTGTGGGGA
AAAAGGGAGC TGCTGGAAAA ACTGCCGCCG TGGATGGGCG GCGGGGAAAT GATCAAGGAA
GTCACCTTTG AAAAAACCGT TTACAACGAC ATCCCGTTCA AATATGAAGC GGGAACGCCC
AACATTGGCG GGGCAGTGGG TCTGGCGGCC GCCATCCGCT ACGTCTCCGG GCTGGGTCTG
GACAACATTG CCGCCCATGA ACAGAAACTG ACGGATATGG CGGTGGAAGG CCTGAAGGCC
ATGCCGCGCC TGACCGTACT GGCCCCGGAC GTGCCGCACA GCGCCGTGGT CTCCGTCCTG
GCGGAGGGCG TCCACCACTA TGACCTGGGT ACGCTGCTGG ACCAGATGGG AATTGCCGTA
AGAACCGGGC ACCATTGCTG CCAGCCGCTC ATGTGCGCCC TGGGCACCAC CGGGACTACC
CGCGCCTCCT TTGCCCTGTA CAATACGGAA GAGGAAGTGC AGACCTTCCT CAAATCCATG
AACCGGGCGC TGGACATGCT CTCCTGA
 
Protein sequence
MLDTATIRPQ FPILETSVHG KPLIYLDNAA TTQKPLAVLD AIRHYYDTEN ANVHRGSHYL 
SQLATEAHEE SRETVARFIN APETAEVLFT SGCTMGINLA ADTIAGSGMV KPGDEVIVTA
SEHHSNIVPW QMLCERTGAV LKAVPLTPGQ TLDMEAYRNM LSPRTRIVAV GHVSNTLGTV
NPVREMAALA KENRQETIVL IDGAQAVSHM NVDVQELGCD LYAFSGHKLY APTGIGALWG
KRELLEKLPP WMGGGEMIKE VTFEKTVYND IPFKYEAGTP NIGGAVGLAA AIRYVSGLGL
DNIAAHEQKL TDMAVEGLKA MPRLTVLAPD VPHSAVVSVL AEGVHHYDLG TLLDQMGIAV
RTGHHCCQPL MCALGTTGTT RASFALYNTE EEVQTFLKSM NRALDMLS