Gene Amuc_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2173 
Symbol 
ID6274639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2650897 
End bp2651907 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content54% 
IMG OID642614233 
ProductRedoxin domain protein 
Protein accessionYP_001878761 
Protein GI187736649 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCT TTCCTGCCTT TATTTTATCC GGTATCATCG CCGCCATGGC GGTTCCCGCC 
TCCGCTCAGA CGGCGGAAAG CGCCAATGGC CCCAAAGTCA CCTATCCAGC CTTCAACGAC
GGCAGCCATA TCCACGGGCC GAAACTCAAA ACTTCCGACC TGAAAGGGAA AGTGGTGTTT
TTTGAATACT GGGGCATCAA CTGCCCGCCC TGCATCGCCA GCATGCCGCA TCTGCAGGAA
TTGCAGGAAA AATTCCAGTC CAAGGGCTTT ACTGTCATAG GCAGCCACAG CCAGCTCCCG
TCCCCCAGGG TCAAACAGTT TCTGGAAGAA AAGAAGATCA CTTTCCCCAT CTATCAGAGT
CTGAGCATTC CGGAGGCTCC CTGCCCGGGC GGATTGCCTC ATGCCGTTCT GATTGGAGCC
AACGGAAAAG TCGTGGCCAA GGGCTATCCT CCCCAGCTCT ATGACCTGGT AAAAAAGGAA
GTGATGAAGA TGGAACGCGG CCTTCCTATT CTGGAAGGAG TGGAACTGAA CAAATACAAA
TCGCTGGCCA AAACGGTCGT TTCCACCGGC AGCAACATCG AATCCAAAAT CACACCTCTG
AGGAAAAAAA CGAATGACGA GGAAGCGCAG GCCGTATGTG AAGCTTTTGA CGCATGGTTG
GAAAATACCA AGGAAATCGT GCAGGCCCGG ATCCAGTCCG ACCCTCTGGA AGCGGTAACG
GCCATCATGC GCCTCAAAAC GGCGGTTCCC TCCGTCAAGG AATTTGACGA ACCTCTGGCG
GCCCTGAAAG CGAACAGGGA TTTATCAAAA CTGGCCGACC TCAATAAAAA AATCTCCGCT
CTGGAACAGC GCAAGGCAAA AGGGCGCAAA ATATCGGAAT CCGACCTTAA ATCCCTGACG
CAGGCCGTGG ACAAATTCAC GGAGTCCGAC AACGAAGCCA CGCAAACCGC CGCCGCCAAC
CTGAAGAAGA ACCTCTCCTC CCTGGCCGCT CCGGAAACTC CCGGAAAATA A
 
Protein sequence
MNAFPAFILS GIIAAMAVPA SAQTAESANG PKVTYPAFND GSHIHGPKLK TSDLKGKVVF 
FEYWGINCPP CIASMPHLQE LQEKFQSKGF TVIGSHSQLP SPRVKQFLEE KKITFPIYQS
LSIPEAPCPG GLPHAVLIGA NGKVVAKGYP PQLYDLVKKE VMKMERGLPI LEGVELNKYK
SLAKTVVSTG SNIESKITPL RKKTNDEEAQ AVCEAFDAWL ENTKEIVQAR IQSDPLEAVT
AIMRLKTAVP SVKEFDEPLA ALKANRDLSK LADLNKKISA LEQRKAKGRK ISESDLKSLT
QAVDKFTESD NEATQTAAAN LKKNLSSLAA PETPGK