Gene Amuc_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1020 
Symbol 
ID6274097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1211974 
End bp1213614 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content53% 
IMG OID642613069 
Producthypothetical protein 
Protein accessionYP_001877627 
Protein GI187735515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT ATATACCCCT GAACAAGGCG GTCGGCCCGA AGCTTATCTG CGACGGGGAG 
CAGGCTCTCA TCCCGCAGAA ACTCGGAGTT GACGCGATGG AGTTCAACTT TGACGGTTCG
GCGTGCGAGC ACGCCACTTG CGCCGGAGCG CCGGAAATGT TCATCAAGGT CGAGGAAGAC
GGCCTGTATT ATTTCGGCGT GGAAGCCGAC GACACAGGCA GCATCTCCAT TGCCAACGAG
AAACTTTGTG AGAAGGATGG AACGAAGCCC AACGGCAAGC TGAATCTGGC AACTGGTGCC
AGATACCTGA AAGCGGGCTA TTACAAGGTG GCTCTGTCGT GGACCAACAA CGCCTACCCC
CCCGTCAGCA ACAACGCTGC CGCGTTCAAC GTCACGATGG ACAGGGAACC CATTCAGGCA
GGGAAGTATG AAGGCAACTC GACGATGAAA CGCGAATTCT CGCCGTCCCC CAAAATCAAA
TTGTGGACGA TTGAAAAGGA ATCCACCATC ACCTGTGAAG AGTCCAGGAA AGTGGAGGTG
ACGCTCGAAG AACCTGCTCC TGTCGTTGAA GAAAGCGAAG GCATATGCGA AACGTATATC
ATTCCCCCCC AGATTTTTGT AACGGCTTGC AAGGATGAAG TGCTCGACGA GTGGAGACTG
CGCGTGCAGC AAGTTTCCGC CGGTTCCAGG ATTCTCTTGC GCACGGGAGG ATACAGGGAT
GCCCTTAAAA ATCCCCCTGT CACCGAAGAA GAAGCTATAG ATGCTGTCAA TGAAATGAAC
CGTTACCAGT CTCATAGACT CGGCGCATGG CATACTCAGG AGGCATCCCT CGCCCACGAG
GAACACCACC GCCGAGAATT TAATGACGCC TTCCAATTCT ACTGGGACAA TTTGAGAATA
CAGGATACGC TTGAAATGAA GCATGAATCT TGTGAAAAAG TTCCGAAAAT GGAAGAGTTT
CTGGAAAATA TGCAGCCATT CGTTACCGAG TTGAGGAATA TGTATTCTGA CGCTGTCTAT
AACTATGTCT TGGTATTGCC GGATGATGCC AATGATCGTC CCTACTGTGC CGGACAAAAA
GTTCTGAACG AGGCGACCCG GAAGATTATT GCCCAGGCAA AGGCCAATGG ATGGAGCAGG
GTACCCGATG AAGTCACGGA ACCGGGAACG ATTGAACCGC CCTGTTTCCT TCCGCCCGTC
AACGGCATGT ATGCCCGCAG CGTGGCTGCT GCGGGGGAAC AGGCTCCCTT GACCCTCTCC
ATTGCGGATA CCTCCCGGTT CAGGGAGGGC AGAATCACAG TCCGCTTCCG CAACGAAGGA
AACGAGCCTG TCCGGATTCC GGACGAAATC AACGACGAAA CAGCCGATTA TTTCTTCCTG
ACGGTGTTGA GGACGGAACG GGGGAACTTC CGCGTCCTGA ACAGGAAAGT CGGGAAAATA
ACCTTCAATC GTTCCTTGAA CTACCGGGAA CTGGCCCCCG GCCAGGAATA CAGCGTCACG
ATTCCGGTGT GTCTGGATGA AGTCGATCTG GAAGGCTGGA AACAATGTTC TTGTGAACTG
GAAACGCGCT ACTATAATCA GCAGGGTAAG GATTGTTTCC GGGGGATGCT CCGGGCGACG
GCCAGGCTCG CGCTGCAATG A
 
Protein sequence
MNNYIPLNKA VGPKLICDGE QALIPQKLGV DAMEFNFDGS ACEHATCAGA PEMFIKVEED 
GLYYFGVEAD DTGSISIANE KLCEKDGTKP NGKLNLATGA RYLKAGYYKV ALSWTNNAYP
PVSNNAAAFN VTMDREPIQA GKYEGNSTMK REFSPSPKIK LWTIEKESTI TCEESRKVEV
TLEEPAPVVE ESEGICETYI IPPQIFVTAC KDEVLDEWRL RVQQVSAGSR ILLRTGGYRD
ALKNPPVTEE EAIDAVNEMN RYQSHRLGAW HTQEASLAHE EHHRREFNDA FQFYWDNLRI
QDTLEMKHES CEKVPKMEEF LENMQPFVTE LRNMYSDAVY NYVLVLPDDA NDRPYCAGQK
VLNEATRKII AQAKANGWSR VPDEVTEPGT IEPPCFLPPV NGMYARSVAA AGEQAPLTLS
IADTSRFREG RITVRFRNEG NEPVRIPDEI NDETADYFFL TVLRTERGNF RVLNRKVGKI
TFNRSLNYRE LAPGQEYSVT IPVCLDEVDL EGWKQCSCEL ETRYYNQQGK DCFRGMLRAT
ARLALQ