Gene Amuc_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1402 
Symbol 
ID6275608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1675473 
End bp1676507 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content62% 
IMG OID642613459 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001878007 
Protein GI187735895 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.759509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGT GTCCTGTTCC GGAGCCTGTT TCCGACCGTA TCCAGATGGC TCATGGCGGC 
GGAGGCCGCT TGATGAATGA GCTGATCCGT TCCGTGTTTC TGAGCGCCTT CGGGTGCCCT
TCCGGAGGCG TGCAGAATGA CGCCGCCGTA TTGGAGATTC CTCCGGGGAG GCTGGCGATG
ACGACGGACA GCTTTGTGGT GCAGCCTTTG GAGTTTCCCG GCGGCTCCAT CGGTTCACTG
GCGGTGCACG GAACGGTGAA TGATCTCGCC ATGAGCGGCG CGGAGCCGTT GTATTTGACG
GCGGGCTTTA TTCTGGAAGA AGGGCTTCCG CTGGAAGTTC TGGCCCGCGT GGCACAAGAT
ATGGCTGCCG CGGCCCGTGC GGCGGGCGTC CGTATTGTGA CGGGGGATAC AAAAGTGGTG
GAGCGCGGAA AGGGGGACGG CATTTACATT AATACTGCCG GGGTTGGCAT CGTGCGCCAT
GGGTTGGAGA TCAGCCCTTC TTCCGTTCGT CCGGGGGATT CCGTGCTGCT CAGCGGGGAT
TTGGGGAGGC ACGGCATGAC GATTATGAGC CTGCGCGCCG GGCTGTCTTT CGGAGACGGC
CTGGAAAGTG ATTCCGCTCC GTTGCATGAA TCCGTGGCCG CCGTCATTCG TGCCGGCATT
CCCGTGCATT GCCTGCGTGA CGTGACCCGC GGCGGGTTGA CCGCCACTCT TTCGGAGATT
GCGGAATCTG CTGGCCTGAC AGTGAAGCTG AATGAAATGT CCATTCCCGT GCGTGAGGAT
GTCAGGGCGG CGTGCGGGCT GTTGGGGCTG GACCCTCTTC AAGTGGCCTG TGAGGGACGT
TATCTGGCTG TTCTTCCACG GGAGCATGAG GAAGAGGCCC TGAACCTGAT GCGCGGCTGC
GGCGTATCTG CCGGAGCCTG CGTCATAGGC CGGGTGGAGG AATTGGGGAC GGCGCCCCTG
CTGATGACGG GACTTCTTGG AGTGGAGCGG GTGTTGACGA TGCCTTCAGG AATGCAGCTT
CCCCGCATCT GCTGA
 
Protein sequence
MFECPVPEPV SDRIQMAHGG GGRLMNELIR SVFLSAFGCP SGGVQNDAAV LEIPPGRLAM 
TTDSFVVQPL EFPGGSIGSL AVHGTVNDLA MSGAEPLYLT AGFILEEGLP LEVLARVAQD
MAAAARAAGV RIVTGDTKVV ERGKGDGIYI NTAGVGIVRH GLEISPSSVR PGDSVLLSGD
LGRHGMTIMS LRAGLSFGDG LESDSAPLHE SVAAVIRAGI PVHCLRDVTR GGLTATLSEI
AESAGLTVKL NEMSIPVRED VRAACGLLGL DPLQVACEGR YLAVLPREHE EEALNLMRGC
GVSAGACVIG RVEELGTAPL LMTGLLGVER VLTMPSGMQL PRIC