Gene Amuc_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0005 
Symbol 
ID6275245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp5246 
End bp6817 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content57% 
IMG OID642612045 
Producthypothetical protein 
Protein accessionYP_001876633 
Protein GI187734521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CACCAGACAC CCCCGCCGCG GCCGCAGAAG CCTCCCAGCC CGCCGCCCGC 
AAGGTCAAAC GCCCCTGGAT GACCTGGATC AAGCTTTTCC TGCTGTTCCT TATCGTCGTA
TGCCTGAATT ACGTAGGCTG CCACGAGTAT TACCGCCGGG ACCTGACGGA AGACCAGCGT
TATGAAATTT CCCGTCAGAG CATCAACATG CTCCAGTCCC CGGAAATCCA GAAGCGCAAA
ACCCCCGTCA AAATCACGTT CGCTTTCCTG CGCACCACGC AGAACTACAC CCGCATGCGT
TCTCTGCTTG AGGAGTACGA ACGTTATTCC AACGGCAAGG TGAAGGTGGA GTATGTGGAT
CCCCTCCGCC AGCCGAACAA GGCCCGTGAA ATCTCCAATA TCTACGGAAT TGAATTCAAG
AAGAACCTGG TCATCATTGA TGCCCGGGAG GATACGGAAA AAGCGCTCAA GACGTTTGAA
GGCACCCAGG CGGACGCCGC CCACGTGCGC ATCCTGCCCG GAGACGCCTT CGTAGTATAC
GCACCCGGGC CGGACGGCAA AAGCATGAAG GCAGTGGCGC TCCAGATTGA AGACATGATG
ACTGCCGGCA TTTACGGAGC GGCCAACGGC GAACCTCGTA AAATTTATAT CGCGGCGGAT
AAGAGCAACT TCAACGAGTC CCTGAGCAAC AACCAGGAAG AAAGCATTTT CACGACGCTG
GGCAAAATCT GCCGTTCCGT CAACCTGCAG CTTGTTCCCA TCCGCATGAG CGGTCTGGAA
GAAATTCCGG AAGACGCCGC AGGATTCATG ATTATCGGTT CCAAATATGA TCTGTCCCCG
CAGGAGGCGG AAGTGCTCCA GTGGTACTGG GCGCGCCCGA ACGCCGCCAT TCTAATCATG
CTGGAACCCC AGAATGACAC ACCCAAACAG CTTTACCGCT TTCTCCGCGA ACAGGGGCTA
CGGCCCCAGA ATGACCGCGT GATGCTCCGC AACAGGGGCA ACCGTTCCGT TTTTGAAATT
AACTCCATTT TCGCCCCCTC CCTGAATTGC ACCCGTGAAT TCTGGAATTC CAGCACCGGA
CTGGAAGGGG AGAGCATCTC CCTCATTCTG GATTCCGACA ATGCGGCCAT GGAACAGAAG
CGCATTACGC CATACCCCCT CCTGGTCACA ACGGAGGATT ATTACGGAGA AACCAAATAC
AACCAGTTCC CTGCCCAGTT CGACGCAAGG GAAGACAATC CGGGCCCTCT GATGATCGGC
GCGGCCCTCA TCCGGGGGAA TGCCGGGGAC GTGAACCAGA ACAAGACTAC CGGGCGCCTG
GTTCTGCTTG GCAATACGGA CCTGCTCCAG CCCCGGCAAA TCAAACCGGA ACAGAGGGAT
TTCATGCGTA CGCTGATCGG CTGGATGACG GACCGTGAAG AATTGCGTGG CCTCGGCTCC
CGCCATGACC TGACCGTCAA GCTGAATCTG GATCGCAACG CCCTGGGCGT TTTGGAACTC
CTGACGAATA TCGGACTCCC CCTGCTGGCG CTGCTGATCG CCCTGATTAT CTGGAACACG
CGCCGTCATT AA
 
Protein sequence
MSETPDTPAA AAEASQPAAR KVKRPWMTWI KLFLLFLIVV CLNYVGCHEY YRRDLTEDQR 
YEISRQSINM LQSPEIQKRK TPVKITFAFL RTTQNYTRMR SLLEEYERYS NGKVKVEYVD
PLRQPNKARE ISNIYGIEFK KNLVIIDARE DTEKALKTFE GTQADAAHVR ILPGDAFVVY
APGPDGKSMK AVALQIEDMM TAGIYGAANG EPRKIYIAAD KSNFNESLSN NQEESIFTTL
GKICRSVNLQ LVPIRMSGLE EIPEDAAGFM IIGSKYDLSP QEAEVLQWYW ARPNAAILIM
LEPQNDTPKQ LYRFLREQGL RPQNDRVMLR NRGNRSVFEI NSIFAPSLNC TREFWNSSTG
LEGESISLIL DSDNAAMEQK RITPYPLLVT TEDYYGETKY NQFPAQFDAR EDNPGPLMIG
AALIRGNAGD VNQNKTTGRL VLLGNTDLLQ PRQIKPEQRD FMRTLIGWMT DREELRGLGS
RHDLTVKLNL DRNALGVLEL LTNIGLPLLA LLIALIIWNT RRH