Gene Amuc_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0623 
Symbol 
ID6274203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp731325 
End bp732635 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content54% 
IMG OID642612674 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_001877241 
Protein GI187735129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.815723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGC ACCTATCATC CCTGGCAGCG TTGCTCCTGG CATCATACCT TCCGGCGCAG 
GCCACCGTAC CGGCCCATTC CCCTTCCACA GCGTTCATCC GGAGCGGCCT CCCGATCGTT
GATCTGGATC AATGGACAGA AGCCCAGGTA GTAGTCGACA AGGAAAAAGG AAAATACCTG
GGACATCCAA CGACGCTATT GCTGAAAGAC GGAAAAACCA TTCTGTGCGT TTATCCTAAA
GGGCACGGCT CCGGGGAAAT CATCCTGAAG AAATCCACGG ACGGAGGCAA AACATGGAGC
GAAAGGCTGC CGGTTCCAGA ATCATGGAAA ACCAGCAGGG AAGTACCCAC ACTATATGAA
ACGGAAGACT CCCGGGGCAA GCGCCGCATT CTGCTTTTCA GCGGCATTCA GGGGGGAAAC
AGAAACACAG CCCCCAGAAA CCGGATGGCG GTCAGTGAAG ACAACGGAAC AACATGGTCC
GAGCTGACCC CCATTCCCAA CCAGGTCGGA GGCATTGTTG TCATGAGTGA CCTGATTCCT
CTAAAAACGG GAAAAGGGCA TTATATGGCC TCCTACCATG CCAATGCCCG GGGCAAAGAC
GGACATGGAG AGTTTCACAC CATTGAACAA TATGTCACCT TTACGGAAGA CGGTGGGCTG
ACCTGGACCT CCCCCCAGGT CATTTTCCCA GGGACAAGGG ACATGCACCT GTGTGAGGGA
GGTTTTGTCC GCAGCCCGGA CGGAAAAACA ATCGCGCTGC TGTTACGGGA AAACAGCAGG
CACCATAACT CCCAGATCAT GTTTTCCGAA GATGAAGGTA AAACATGGAC TCCCCCCAAA
GAACTGCCGG CAGCCCTGTG CGGAGACCGC CACCAGATTC TTCCCCTGCC TGACGGAAGG
CTTCTGGTTC AATTCAGGGA TGCTCCCCCG ACCAGGAAGA AAGGGCAGGC CGCCAGCCCG
ACGGAAGGAG ACTGGGTAGC ATGGATAGGC CGGTGGGAAG ACCTGAAAAA CGGCACGGAA
GGCTCATATA AAATCCGTTT TAAGGACAAC CGCAACGGTT GGGACTGCGC CTACCCGGCC
GCCGAACTAT TGCCGGACCA TACCCTGGTA TGCACTACCT ACGGACACTT TGACAAAGGG
GAATTGCCAT ACATCCTCTC CGTCAGATTT AAAATCAGCG ATACGGACAA GATGGTCAAA
CAATATGCGG GGAACAATCA CCCCAAGATC AAAAATGACA CAGGAGCGGG AGAAACCGTT
TTTGACCCCA ATGAGCCGGA CTCCGTTAAT CGCCTTCTGA AACGTCCCTG A
 
Protein sequence
MNLHLSSLAA LLLASYLPAQ ATVPAHSPST AFIRSGLPIV DLDQWTEAQV VVDKEKGKYL 
GHPTTLLLKD GKTILCVYPK GHGSGEIILK KSTDGGKTWS ERLPVPESWK TSREVPTLYE
TEDSRGKRRI LLFSGIQGGN RNTAPRNRMA VSEDNGTTWS ELTPIPNQVG GIVVMSDLIP
LKTGKGHYMA SYHANARGKD GHGEFHTIEQ YVTFTEDGGL TWTSPQVIFP GTRDMHLCEG
GFVRSPDGKT IALLLRENSR HHNSQIMFSE DEGKTWTPPK ELPAALCGDR HQILPLPDGR
LLVQFRDAPP TRKKGQAASP TEGDWVAWIG RWEDLKNGTE GSYKIRFKDN RNGWDCAYPA
AELLPDHTLV CTTYGHFDKG ELPYILSVRF KISDTDKMVK QYAGNNHPKI KNDTGAGETV
FDPNEPDSVN RLLKRP