Gene Amuc_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1085 
Symbol 
ID6274019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1296592 
End bp1297809 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content52% 
IMG OID642613136 
ProductCarbohydrate-selective porin OprB 
Protein accessionYP_001877692 
Protein GI187735580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000116954 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.02155e-19 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGAAGC ATAACATAAT AGCAATGGCG TTGACGAGCC TCATGGCGCT TCCTGTAATC 
GCTCAAGACA GCACCCCTGC TGTTGGTGAT ATTTTGCAGG GCATCAACAT GTTCACTCCG
GAAGAAGGTG ATCCTGCCTG CAAGGTGAAG GCCAATGAGA AATACGGCAC CCAATGGGGT
GTGGATATGG CCTACGGGCT TTGGAATACG GAAAAGGTTT CCGATGTTAA GCGCCATAAC
AACCTGGCTC TTCTTCATGC CCAGCTTAAT CAGCGCCTTA TTGAAGACAA GGCCAACGGG
GGAACCTGGT TGCGTGTGGA ATTTTCCGGT TCATGGGGGC TGGATCGTGA ATCCGCACAG
AGCGATACGT TTTTCACTGA AGCCTATGCC ACGGCTTCCG GCTTGCATGC TGACGCGATG
GGGCCTCATG AAGGAATATT CCCTGAAGTG GCGCTGATGC AATACTTTGC AGGGAAGCGC
GCTTGCCTCA TCGCAGGTAT GGTGAATCTC ACCAACTACT TTGATGCTGT CAGCATTGCC
AATGACTCCT TCTCCTCTTT CACCAATGAC GGGTTTGTGA ACTCCACTAT CCTACCTTTG
GTGGACAGCA ATATTGGCGG TATTCTGCAA GTTGAACTCA ACCGCAATAA TTACATGATG
GTTGCCGTTT CCCGCACGGG ATGCGATTCC GGATACAATC CTTTTAATTC CGATTATTGC
GATGGTTATG CCGTGGTTGG CGAATACGGC CATATCTTTG CCGACGGCGC TGCGACTCTC
CGCATCAATC CGTTCTATAC CAGCACGGAT GTGGACATGG ACGACGGGAC CGGGGAACGC
CGCCGCCAAA ATGCCGGGCT TGTCGCGAGC ATCGAATATA CTCCCTGCGA TCCTCTGACC
ATTTACTCCC GCGCCGGATT TGCCGCCAAA CAATACTTGA GCAACTCCGC TGAATTCTCC
GTGGGCGCCA ACATTAAGCT CTTCCCTTCC CGTGAAGATG ACTTCCTGGG CATTTCCTAC
GGTGTGTTCA AGGGGCAGAC CCCCTGTGAC GGAGAGCGCG CTGAGCATAA CCGCGAACAG
GTGCTGGAAG TCATGTACAG CTTCCAGGTG AATGATTATT TCAAAGTTGT TCCTCACTTC
CAGTACATCG CGAATCCGGC TTACAGCACT TCCAGCGAAA ACATTCTCTG GGGCGTTCAG
GCAGTCTTTT CTTTCTGA
 
Protein sequence
MKKHNIIAMA LTSLMALPVI AQDSTPAVGD ILQGINMFTP EEGDPACKVK ANEKYGTQWG 
VDMAYGLWNT EKVSDVKRHN NLALLHAQLN QRLIEDKANG GTWLRVEFSG SWGLDRESAQ
SDTFFTEAYA TASGLHADAM GPHEGIFPEV ALMQYFAGKR ACLIAGMVNL TNYFDAVSIA
NDSFSSFTND GFVNSTILPL VDSNIGGILQ VELNRNNYMM VAVSRTGCDS GYNPFNSDYC
DGYAVVGEYG HIFADGAATL RINPFYTSTD VDMDDGTGER RRQNAGLVAS IEYTPCDPLT
IYSRAGFAAK QYLSNSAEFS VGANIKLFPS REDDFLGISY GVFKGQTPCD GERAEHNREQ
VLEVMYSFQV NDYFKVVPHF QYIANPAYST SSENILWGVQ AVFSF