Gene Amuc_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1933 
Symbol 
ID6275249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2345449 
End bp2346867 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID642613993 
Productprotein of unknown function DUF1111 
Protein accessionYP_001878527 
Protein GI187736415 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.242314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0976167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCTT CATGCGTGAG CTGCCATGTA AACCGCGGCC GTGATTTCAC GCCCCCCGTC 
CTCAGGACGA AAGAAGGCGA GCCCGGGCCG GCTTACTATC TGGGCGGTGA AAAGGGAACG
GTGTTTAATG CCACCAGTAA AGCGTTTGAG CAGGAATCGC CGGCAATTAC TGAAGCCGGC
CTGACGAATC GTTTCAAGGC GGGCGAAATT ATTTTTGAAG GGAACTTCGT ACCCGTGAAG
CGCAATCGCT TCGGCGGTCT CGGCCCCACT TACATTAAGT CATCCTGTCT GGCCTGCCAC
CCCGGTTATG GCCGCGGACA GCGCACCGGC AATTTTGACA GGCAGTACGG CAACGGTTAT
CTGGCCTTTG TACATAATCC CGATGGCACC CCAGTTAAGG GGTACACGGG CATGCTGCAG
ACGAAAGCGG TTCCTCCTTT TGTGCCTTAT GCCAAGGGGG TGAAAATAGA ATGGCATGAT
TTTGTTGACC AGTACGGAAA CAAATACCCG GACGGAACGC CCTACAATGC GGGCAAGCCG
ACGGAAGGCA CGTTGACTTA TCCCACGGCA GATGTGATTG AGCCGTTGCT TCCGCTTCCG
GCCGATTACC GCGTGTCAAT CGAATCAACA ATCGGCATTT ACGGAACGGG GCTGCTTGAT
GCCATCCGTG ACGAGGATAT TATTGCCGAA TACAGGCGCC AGCAGAGCAT GACAGGCCCG
GTGAAGGGGA TTCCTGGCAA ATGGATTGAC GAGCCCGACG GTACCCGGCG CCTCGGAAAG
TTCACGTGGG ACTGCTCGCG CGCCACACTG GAAAACGGTC CTGGCGCCAA TGCGCTTTGG
AACGTGACGA ATGTAACGCG CAAGAACCGT CCGAACATCT ACATGACGCC CGAATGGCTC
GAAAAACAGA AGGAACTTGG CATTGATGTA AGCGGTCTTG AAGGCCCGCA GGAAGAGGAA
CTCTCAATGC AGCAGTATGA AGATTTCATG GTCTGGCATC GCGGGCTAGC CGTGCCTGCC
GCCCGCAACC TGGACAAGCC TGACGTGCGC CGCGGACAGG AACTCTTCAA TAAACTGGGG
TGTGCCGGTT GCCACAAGCC TGAATGGACA ACGGGGGAAT ATAAGCCGCT TCCCGGTTAT
GCAAACCAGA CCATCCGCCC CTATACGGAT ATGCTGCGTC ACGATATGGG GGAAATCAAC
CGCGGACGTT CTCGTTTCTG GCGTACGCCG CCCCTCTGGG GAAGGGGGCT GATGCACAAA
ACCGCCAATC ATACAGATAT GTTCCATGAC CTGCGCGCCC GCGACTTTGA AGAGGCTATC
CTGTGGCATT TCGGTGAAAG CGAATTCTCC CGTGAAATGT TCCGCCATCT CTCCGCCGAA
GAGCGCGGCC AACTGATTCA ATTCCTGAAA GCACTTTAA
 
Protein sequence
MTASCVSCHV NRGRDFTPPV LRTKEGEPGP AYYLGGEKGT VFNATSKAFE QESPAITEAG 
LTNRFKAGEI IFEGNFVPVK RNRFGGLGPT YIKSSCLACH PGYGRGQRTG NFDRQYGNGY
LAFVHNPDGT PVKGYTGMLQ TKAVPPFVPY AKGVKIEWHD FVDQYGNKYP DGTPYNAGKP
TEGTLTYPTA DVIEPLLPLP ADYRVSIEST IGIYGTGLLD AIRDEDIIAE YRRQQSMTGP
VKGIPGKWID EPDGTRRLGK FTWDCSRATL ENGPGANALW NVTNVTRKNR PNIYMTPEWL
EKQKELGIDV SGLEGPQEEE LSMQQYEDFM VWHRGLAVPA ARNLDKPDVR RGQELFNKLG
CAGCHKPEWT TGEYKPLPGY ANQTIRPYTD MLRHDMGEIN RGRSRFWRTP PLWGRGLMHK
TANHTDMFHD LRARDFEEAI LWHFGESEFS REMFRHLSAE ERGQLIQFLK AL