Gene Amuc_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1644 
Symbol 
ID6274431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1986772 
End bp1987911 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content58% 
IMG OID642613704 
Producttransglutaminase domain protein 
Protein accessionYP_001878245 
Protein GI187736133 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.258709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGC TGCCTTTCGC CCTTCTCCCG GGCCTGACCG CCGCTATTTG CGCCGCCCAT 
GACGCCATTC AGGACGGGGT GGAGTTTCTG CGTGCCTACA TGCCCGCGCA GGACAGGGGA
ACGGTTACGC AAGAAAGGCT GGTCCGGGAA GTCCGCCTCG CCCTGGCTGC ACGGAGCCAA
TTCCCCTGGG CTGCCCAGGT TCCGTGGGAA CTTTATGAAA ATAACGTTCT CCCTTATGCC
GTGGTCAACG AACCGCGCGA CGAATGGAGG GAGCAGTTCC ATCACCTCTT CGCACCGCTT
GTTTCCCCAT GCAAGACGGG GCGGGAAGCC GCCCTCGCCA TCGCCTCCCG CATCCAGAAA
ACCCTGAATG TACGCTATTC CACGGAGAGA AGAGTTCCCC ACCAGGGAGT CAAGGAATCC
CTGCAATCCG GCAAGGTCTC CTGTACGGGC CAAAGCATCC TGCTAATCTG CGCTCTTCGC
TCCGTGGGCA TTCCGGCCCG CATGGCGGGC GTTCTTACCT GGAACCACGT GCGCGGCAAC
CACAACTGGG TGGAAGCCTG GTGTGACGGA GAATGGAAAA TGCTGGAATA CAATGAAAAG
GACTTCAACA CCCCGTGGGT GATGTCCGCC ATCAGCATGC TGGATCCCCG CAAACCGGAG
AACGCCATTT ATGCCACCTC CTGGAAAAAA GAACCTTCAG GAGCCTTTTT CCCTATGATA
TGGGAAGCCC GCTACGACGA CAAACGGCAC GCGCTGGCTT TCCCTCCGGA AAGCCGTACC
GTCCCCGCCG TCAATATCAC GGACCGCTAC ATGAAACTGG CGAATGAATG GGTGGCGGCC
CAACCGGAAT ATGTGCCCGG CAGCCGGCTG ATGCTGGACA TCAGGGAAGA GAGAAAAAAC
GGTGCCAGAA GGCTTCCCTT GCACGTCGTC CTCAAATCGG AAGAAGGGAA AGTTCTGGCG
GAAGGCATTA CACCCGGACC GTCCGACGAC ATGAGGAAGT TTCTTGAGGT ACTCCTGCCG
GACAATATTT CCCGCGGCAT GCTGGAGTTC AAGCTGCCTG ACGGAACCGT GCGCCATGAA
CCTGTGGCAC ACACGGAGGC CCCGGTTCAG ATTTTGAACT TCTTCGTGTC CGCTCCATGA
 
Protein sequence
MNLLPFALLP GLTAAICAAH DAIQDGVEFL RAYMPAQDRG TVTQERLVRE VRLALAARSQ 
FPWAAQVPWE LYENNVLPYA VVNEPRDEWR EQFHHLFAPL VSPCKTGREA ALAIASRIQK
TLNVRYSTER RVPHQGVKES LQSGKVSCTG QSILLICALR SVGIPARMAG VLTWNHVRGN
HNWVEAWCDG EWKMLEYNEK DFNTPWVMSA ISMLDPRKPE NAIYATSWKK EPSGAFFPMI
WEARYDDKRH ALAFPPESRT VPAVNITDRY MKLANEWVAA QPEYVPGSRL MLDIREERKN
GARRLPLHVV LKSEEGKVLA EGITPGPSDD MRKFLEVLLP DNISRGMLEF KLPDGTVRHE
PVAHTEAPVQ ILNFFVSAP