Gene Amuc_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0137 
Symbol 
ID6274869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp169490 
End bp170860 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID642612182 
Producthypothetical protein 
Protein accessionYP_001876762 
Protein GI187734650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTTG CCGGAGCGTG TCAACCCCTC CGGGGGACAG GCGATGATGC ATTCCTTCCC 
CATATTCTTT TCAAGGAACC TGTTCCGGAA GAACGGGAGA CGGAAAAAGG GTTGCGTCCG
CGGGACAATG TGGTAGCTTG TGCGGGCAAA GGGAACCCTA TGACCTTACC TGTAAAGATT
CTGTCCGTAT TGGCAGGGCT GAGCCTGGCG GGAAGCGCCC GGGCGCGCAT TTGGACGAAT
GACCGGGGCG CTACCGTCGT AGCCGTGCTG GTGGCCGTCC GTGATGCGGA GGTGGATTTG
AAGCTTCAGG ATGGTCGCGT GGTGGCTGTT CCCAAGAATA TTTTTTCCGG AGCCGACCAG
GAATATATTC TGGAATGGGT AAAGTCTGGC GGGACGCTCC CCGATACTGA TGTGGGGGCT
GAGAATCCTC CTGTGCAGGG GGATTCCGGC AGACGGTACG TTCCCCTGGA GCCGAATTGG
GATGCTCCCT GGCCGGAGAG GGTGGTTGTT CCCGGTTTCC TGCTGGTGAA AACCGTGCAG
GAGACGGAGG AGCTGAGTGT GTATGAAACG GATCATTTCA TCATGGAGTC TCCCGGAAAG
CTGCCGGAAG CGGAGCGTAT GATCCTTGCC CGGCGTTTTG AAACCATTCT GTCCGCCCTG
GCAGCTGTTC CGTTGAATCT GGCGGTGGCC CGCCGGCCTT CCCGCAAGTA TCTGGTGAGG
GTATGTTTCC AGGAAGAGGA TTTCAACAGG GCTCCGGGCC TGCGGAACGG GCATCTCAAG
TTTTCCCCTA CCAGTTTCAC GGCTCTGCTG TTGCGGGATA AAAAGGGAAA GCTGCTGAAG
CCGGGACTGG ATCCGCGCTT TGCCGTCACG CACTGGGCGG CGCAGTCCAT GGATTGGGAT
CACTGGCTGG TGGACGGCTT TTCCGCGTAC ATGGCTTTTC TTCCCATGGA GAAGGAAGCG
CCGGTATTCC GGAAGATTCC GGAAAGGCTG GCCGCGATGG TTCCGCGCGC CGTCCGCACG
GGCAGAGAAG CTCTGCCGGC CCTGGCGGAC ATGCTGTCCA GGGATTCAGC ACATTCCGCC
GCGGAGCATG GCGTTTCCTC CAGGGGAGGA ACGCTTCAGT ATTGGGCCGA TTTGTTGTGG
ATGGTTTACT GGAGCCACCT GGAAGGGAGC GGAAAGGCGG AGCGCCTCAG AAGCTATTTG
AGGGTTCGTG ATGCGGAGGG CGGCGGGAAG GCGCGTGCTG TTTTGCTGGA TGGGAAGACT
CCGGAGGATG TACAGGGGGA AATGGTCGCG GCCTGGAAAA AAATGGGATT GAGGCTGAGA
TTTTCTCAGC CGGCTTCTGC TGCCGCGGAC AGGGAGTCTG CGGGAAAATA A
 
Protein sequence
MGFAGACQPL RGTGDDAFLP HILFKEPVPE ERETEKGLRP RDNVVACAGK GNPMTLPVKI 
LSVLAGLSLA GSARARIWTN DRGATVVAVL VAVRDAEVDL KLQDGRVVAV PKNIFSGADQ
EYILEWVKSG GTLPDTDVGA ENPPVQGDSG RRYVPLEPNW DAPWPERVVV PGFLLVKTVQ
ETEELSVYET DHFIMESPGK LPEAERMILA RRFETILSAL AAVPLNLAVA RRPSRKYLVR
VCFQEEDFNR APGLRNGHLK FSPTSFTALL LRDKKGKLLK PGLDPRFAVT HWAAQSMDWD
HWLVDGFSAY MAFLPMEKEA PVFRKIPERL AAMVPRAVRT GREALPALAD MLSRDSAHSA
AEHGVSSRGG TLQYWADLLW MVYWSHLEGS GKAERLRSYL RVRDAEGGGK ARAVLLDGKT
PEDVQGEMVA AWKKMGLRLR FSQPASAAAD RESAGK