Gene Amuc_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1401 
Symbol 
ID6274642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1674401 
End bp1675468 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content61% 
IMG OID642613458 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001878006 
Protein GI187735894 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.416582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGG AAGTTCGGCA GCTGATTGAA GAATTGCGCC ACGCTGTGAC GCGTCCGTGG 
GCGGTAATGG AGGTATGCGG AGGCCAGACG CATGCGATTG CTTCTCTGGG GCTGGAGGAG
TTGCTGCCTC CCGGCCTGCG CCTGATTCAT GGCCCCGGCT GTCCGGTCTG CGTTACGGCG
GTGGAGTTGA TTGACCAGGC CGTGGAATTA AGCCTGAGGC CCGGCGTGGT TTTGTGCAGC
TACGGGGATA TGATGCGGGT GCCGGGCTCC CGCGGGGATT TGTTTTCCGC CAAGGCCGGG
GGAGGGAACG TGTTTTTGAT GTATTCTCCG CTGGAAGCGG TCGCTTATGC CGGAAGCCAT
CCGGATACGG AAGTGGTGTT TTTCGCCGTA GGTTTTGAGA CGACGGCCCC GGCTACGGCT
CTTGCCCTGC AGCAGGCCCG CATGCTGGGG TATGCCAATT TTTCCGTTCT CTGCGCCCAT
GTGCTGGTCC CCCCGGCTCT GGAATGGCTG ATGGACCAGG ATGAGGGAAG GCCGGACGCT
TTTTTGGCTC CCGGCCATGT CTGCGCCGTC ACGGGGGAAA TGGATTACGT GCGTTTGGCC
GCCCGTTACC GGACGCCCAT GGTGGTGACG GGGTTTGAGG CTCCGGACCT GCTCCGCGGC
ATTCTGATGT GCGTCCGCCA GCTGGAAGCC GGGGAATATG TAGTGCGGAA TGCGTACGGA
CGTTATGTGA AGCCCGGAGG GAACAGGGCG GCACAGGAAA GAATGAATGA AGTTTTTGAG
CCGGAAGACC GCCGTTGGCG CGGTCTGGGG CTGATTCCCG GCGGCGGCAT GAGGCTGCGC
CGTGAGTGGA ATGACATGGA TGCCGTTCTG CGTTTTGAAT GCGGAAAGGG GAGGCCGGGA
GCAGATGAAG CTTCCGGATG CCTGGCCGGG CAGGTGCTGA GGGGGCTGAT TCGTCCTGGA
GAATGCCCTT TTTTCGGATC GTCCTGCACG CCGCTGGCCC CGCTGGGGGC TCCCATGGTG
TCCGGGGAAG GGGCTTGCGC CGCTTATTAT CATTATAAAA GAAGTTGA
 
Protein sequence
MQEEVRQLIE ELRHAVTRPW AVMEVCGGQT HAIASLGLEE LLPPGLRLIH GPGCPVCVTA 
VELIDQAVEL SLRPGVVLCS YGDMMRVPGS RGDLFSAKAG GGNVFLMYSP LEAVAYAGSH
PDTEVVFFAV GFETTAPATA LALQQARMLG YANFSVLCAH VLVPPALEWL MDQDEGRPDA
FLAPGHVCAV TGEMDYVRLA ARYRTPMVVT GFEAPDLLRG ILMCVRQLEA GEYVVRNAYG
RYVKPGGNRA AQERMNEVFE PEDRRWRGLG LIPGGGMRLR REWNDMDAVL RFECGKGRPG
ADEASGCLAG QVLRGLIRPG ECPFFGSSCT PLAPLGAPMV SGEGACAAYY HYKRS