Gene Amuc_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2010 
Symbol 
ID6275769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2440271 
End bp2443576 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content49% 
IMG OID642614069 
Producthypothetical protein 
Protein accessionYP_001878601 
Protein GI187736489 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.320871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.091538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGTT CTCTCACTTT TTCGTTTGAT ATTGGTTATG CATCCATTGG ATGGGCTGTC 
ATTGCTTCCG CATCCCATGA TGATGCGGAT CCCTCGGTTT GCGGTTGCGG TACGGTTCTG
TTTCCGAAAG ATGATTGTCA GGCATTTAAA AGGCGTGAAT ACAGACGTTT GAGACGCAAT
ATCCGCTCCC GGCGCGTTCG CATTGAGCGT ATCGGCAGAT TGCTGGTTCA GGCGCAAATC
ATCACGCCGG AAATGAAAGA AACTTCCGGG CACCCCGCTC CCTTTTATTT GGCGTCAGAA
GCGTTAAAAG GGCATCGAAC TCTCGCCCCG ATTGAGCTTT GGCATGTTCT CCGCTGGTAT
GCTCATAACA GAGGGTACGA CAATAATGCC TCATGGTCTA ACAGCCTTTC AGAAGATGGC
GGGAACGGTG AGGATACCGA GAGAGTGAAG CATGCTCAAG ATTTGATGGA TAAACATGGG
ACGGCGACCA TGGCGGAAAC CATTTGCCGG GAGTTGAAAC TGGAAGAAGG CAAAGCGGAT
GCTCCGATGG AGGTTTCAAC GCCGGCTTAT AAAAATCTCA ATACCGCCTT TCCTCGCTTA
ATCGTGGAAA AGGAGGTACG GCGCATATTG GAGCTTTCCG CGCCTCTGAT TCCTGGGCTG
ACTGCGGAGA TCATAGAGTT GATTGCGCAG CATCATCCCC TGACAACGGA ACAGCGCGGC
GTGTTGCTTC AGCACGGGAT AAAATTGGCT CGGCGTTATC GTGGAAGTCT TTTGTTCGGG
CAGTTAATCC CCCGTTTTGA TAACCGCATC ATCAGCCGCT GCCCTGTCAC GTGGGCGCAG
GTGTATGAAG CTGAGTTGAA GAAAGGCAAT TCTGAGCAAA GCGCCCGTGA ACGGGCAGAA
AAACTATCCA AGGTGCCCAC GGCGAATTGC CCGGAATTTT ATGAATACCG CATGGCCCGG
ATTTTATGCA ATATCCGTGC AGACGGAGAA CCTCTTTCTG CAGAGATACG CAGAGAATTG
ATGAATCAGG CCCGACAGGA AGGCAAGTTG ACCAAAGCCT CTCTGGAGAA GGCTATTTCT
TCCCGTCTGG GAAAGGAGAC AGAGACTAAT GTAAGCAACT ATTTTACTTT GCATCCTGAC
AGCGAAGAGG CTCTTTACCT GAACCCTGCC GTGGAAGTTC TGCAAAGAAG CGGCATCGGG
CAAATTCTTT CGCCGTCTGT GTATCGAATT GCCGCCAATC GCCTGCGTCG CGGGAAGTCC
GTTACTCCAA ACTATTTGTT GAATTTGCTT AAGTCTCGTG GGGAATCTGG CGAGGCGTTG
GAAAAGAAAA TAGAGAAAGA ATCTAAAAAG AAAGAGGCGG ATTATGCCGA CACTCCGTTA
AAACCCAAAT ATGCGACGGG GCGTGCGCCG TATGCCCGCA CCGTCTTGAA AAAAGTGGTG
GAAGAAATTC TTGATGGAGA AGATCCGACG CGTCCTGCCC GGGGGGAAGC GCATCCGGAT
GGGGAACTGA AAGCGCATGA CGGTTGCCTG TATTGCCTCC TTGATACGGA TTCTTCCGTG
AATCAGCACC AGAAAGAGCG CCGTCTTGAT ACGATGACCA ACAACCACCT TGTGCGTCAC
CGTATGTTGA TTCTGGATCG CTTGCTGAAG GATCTGATTC AAGATTTCGC TGACGGGCAA
AAAGACAGAA TCTCCCGCGT TTGCGTGGAA GTTGGCAAGG AGCTGACGAC GTTTTCCGCC
ATGGACAGCA AAAAAATTCA GAGAGAACTA ACTCTGCGCC AGAAAAGCCA TACGGATGCC
GTCAATAGAT TAAAACGGAA GTTGCCGGGG AAAGCGCTTT CTGCCAACCT GATACGCAAG
TGCCGCATTG CCATGGACAT GAACTGGACA TGCCCGTTCA CCGGCGCAAC GTATGGCGAT
CATGAGCTGG AAAATCTGGA GCTGGAACAT ATCGTGCCCC ATTCTTTCCG GCAGTCTAAC
GCGCTTTCTT CTCTGGTTCT TACCTGGCCG GGAGTCAATA GGATGAAAGG TCAGCGCACC
GGGTACGACT TTGTGGAGCA GGAGCAGGAG AATCCTGTGC CGGATAAACC CAACCTGCAT
ATTTGTTCCC TGAATAATTA CAGGGAATTG GTTGAAAAGT TGGATGACAA GAAGGGGCAT
GAAGATGACC GCAGGCGCAA AAAGAAGCGC AAAGCCTTAC TGATGGTGAG GGGATTGTCT
CATAAACATC AATCACAAAA TCACGAGGCC ATGAAGGAAA TAGGCATGAC GGAAGGCATG
ATGACGCAGA GTTCCCACCT GATGAAACTG GCATGCAAGT CTATTAAAAC CTCTCTGCCG
GATGCGCACA TCGACATGAT TCCCGGCGCT GTTACTGCTG AAGTTCGCAA GGCGTGGGAT
GTTTTTGGGG TCTTTAAGGA ATTATGCCCG GAAGCTGCCG ACCCGGACTC CGGCAAGATT
CTTAAGGAAA ACCTGCGTTC TCTCACTCAT TTGCATCATG CCTTGGATGC CTGTGTGCTG
GGGCTTATTC CCTATATCAT ACCCGCTCAT CATAATGGTT TGCTGAGACG TGTTCTTGCC
ATGCGCCGAA TTCCGGAAAA ACTGATACCT CAAGTCAGGC CTGTTGCGAA TCAGCGTCAT
TATGTCCTGA ATGATGATGG ACGCATGATG TTGCGTGATC TTTCCGCCTC TCTTAAAGAA
AATATTCGTG AACAATTGAT GGAGCAGAGG GTCATTCAGC ATGTCCCTGC AGACATGGGC
GGCGCTTTAC TCAAGGAAAC CATGCAGAGA GTGCTTTCTG TTGATGGAAG CGGGGAGGAT
GCCATGGTTT CTCTTTCCAA AAAGAAAGAT GGGAAGAAGG AAAAAAATCA GGTAAAAGCA
AGCAAATTGG TCGGAGTGTT TCCGGAAGGC CCGTCAAAAT TGAAGGCTCT TAAGGCAGCC
ATAGAAATTG ATGGCAATTA TGGAGTGGCG TTAGATCCCA AGCCGGTGGT GATCAGACAT
ATTAAGGTGT TTAAGCGAAT CATGGCCCTG AAAGAACAGA ACGGCGGCAA GCCGGTGCGC
ATTTTGAAAA AAGGCATGTT GATTCATTTA ACCTCGTCTA AAGATCCCAA GCATGCAGGT
GTATGGAGAA TTGAATCCAT ACAGGATTCA AAAGGTGGCG TAAAATTAGA TCTTCAGAGA
GCGCATTGCG CTGTACCTAA AAATAAGACG CATGAATGTA ATTGGCGTGA AGTAGATCTC
ATTTCTTTAT TAAAAAAATA CCAGATGAAA AGATACCCTA CTTCTTATAC GGGAACTCCA
CGATAA
 
Protein sequence
MSRSLTFSFD IGYASIGWAV IASASHDDAD PSVCGCGTVL FPKDDCQAFK RREYRRLRRN 
IRSRRVRIER IGRLLVQAQI ITPEMKETSG HPAPFYLASE ALKGHRTLAP IELWHVLRWY
AHNRGYDNNA SWSNSLSEDG GNGEDTERVK HAQDLMDKHG TATMAETICR ELKLEEGKAD
APMEVSTPAY KNLNTAFPRL IVEKEVRRIL ELSAPLIPGL TAEIIELIAQ HHPLTTEQRG
VLLQHGIKLA RRYRGSLLFG QLIPRFDNRI ISRCPVTWAQ VYEAELKKGN SEQSARERAE
KLSKVPTANC PEFYEYRMAR ILCNIRADGE PLSAEIRREL MNQARQEGKL TKASLEKAIS
SRLGKETETN VSNYFTLHPD SEEALYLNPA VEVLQRSGIG QILSPSVYRI AANRLRRGKS
VTPNYLLNLL KSRGESGEAL EKKIEKESKK KEADYADTPL KPKYATGRAP YARTVLKKVV
EEILDGEDPT RPARGEAHPD GELKAHDGCL YCLLDTDSSV NQHQKERRLD TMTNNHLVRH
RMLILDRLLK DLIQDFADGQ KDRISRVCVE VGKELTTFSA MDSKKIQREL TLRQKSHTDA
VNRLKRKLPG KALSANLIRK CRIAMDMNWT CPFTGATYGD HELENLELEH IVPHSFRQSN
ALSSLVLTWP GVNRMKGQRT GYDFVEQEQE NPVPDKPNLH ICSLNNYREL VEKLDDKKGH
EDDRRRKKKR KALLMVRGLS HKHQSQNHEA MKEIGMTEGM MTQSSHLMKL ACKSIKTSLP
DAHIDMIPGA VTAEVRKAWD VFGVFKELCP EAADPDSGKI LKENLRSLTH LHHALDACVL
GLIPYIIPAH HNGLLRRVLA MRRIPEKLIP QVRPVANQRH YVLNDDGRMM LRDLSASLKE
NIREQLMEQR VIQHVPADMG GALLKETMQR VLSVDGSGED AMVSLSKKKD GKKEKNQVKA
SKLVGVFPEG PSKLKALKAA IEIDGNYGVA LDPKPVVIRH IKVFKRIMAL KEQNGGKPVR
ILKKGMLIHL TSSKDPKHAG VWRIESIQDS KGGVKLDLQR AHCAVPKNKT HECNWREVDL
ISLLKKYQMK RYPTSYTGTP R