Gene Amuc_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1222 
Symbol 
ID6275585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1468707 
End bp1470575 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content54% 
IMG OID642613278 
Productoligopeptide transporter, OPT superfamily 
Protein accessionYP_001877828 
Protein GI187735716 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.107355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTA CGTCCCCCCA ATCTTCCAGC CCTCTCCCCT CCTGCTTGGA AAAACCGCTG 
CCTCTGGATG GATTCCAAGG TACTCCGGAT GAAGTAGAAC AGCAATGGTA CGATCAAGTT
TATCTGGGTT CCGGAGACAG AATGAAGCAG CTTACCTGGA GAGCCGTCAT CGTAGGCATG
CTTCTAGGCT CCATCCTTTC CCTCACTAAT CTGTACGCCA ACCTCAAAAT GGGATGGTCC
TTTGGGGTGG CTCTGACGGC AGGAATTATC TCTTTTGCCC TCTGGAACGC CTTTGTGCGC
CTGGGCATTT CCAAATCCCC CATGACTATT CTGGAAAATA CATGCATGCA GTCCGCTGCC
AGTTCTGCAG GCTATTCTAC AGGGGGAACC CTCACCTCTG CCGTAGCCGC CCTGCTCCTG
CTGACAGGAC AGCACATGCC TCTTGGCACT ACCTTCGCCT GGATATTTTT CATTGCCGTA
CTGGGTGTCA CCATGGCCAT TCCGATGAAA CGCCAGATGA TCAACATTGA ACAAATAAGG
TTCCCAGACA GTATTGCTAC GGCGGAAACC CTCAAAGTTC TCTATTCTGA AGGCAAAAAG
GCGGCGGGAC AGGCCAAAGC CCTTCTTTAT TCTGCCCTTT TCGCCGCCGC TAATGCCATC
GCCATGGCTG CAGGAGGAGA ACGATGGCTT GGAACGGTCC AGCAACATAT CCTCGGCAAC
TGGTACCAGC GTACTATCTT CTTCAAATGG GATCTCATGT TCGTGGGTGC GGGAGCTCTG
GTAGGCATGA AAACATCCCT CAGCCTCTTC ATCGGAGGAA CCGTTTGCTG GGCTCTTTAC
GTTCCGTGGC TGGAAAGCCA GAAATTGCTT CCCGCGGGAG CAGGTTATCG GGAGAGCGTA
AGCTGGACCC TGTGGGGAGG AACCGCCTGC ATGGTCGTCG CCAGCATTGT GGCTTTCCTA
TTCCAATGGA AAAGCATTGT TCGTTCCTTT TCTTCCCTGG GTGCCATGTT TTCCCTGAGT
AAAAAACGAA AACTGACAGA TGTGGAAAAA ATAGAAACGC CCATGAGCTG GTTTCTAACA
GGCCAGCTTA TCTCTCTGGG AGCTCTCGGC TATCTGGCTC ATACATCATT TAACGTTCCG
TACTGGATGA GCTGCATCGC GGTAGTCATA TCCTTTTTCC TGGCGCTGGT CGTCTGCCGA
ATCACCGGAG AAGCCAATAT TACACCCACC GGAGCCATGG GAAAAGTTAC ACAGCTCATC
TTCGGAGGGA TTGCACCCGG GCACGTAACA GCCAACCTGA TGGCGGCCAA TATCACTTCA
GGAGCGTCCA GTTCCTCGGC AGACCTGCTC GTAGACCTCA AAGTAGGCTA CCTGCTGGGA
GCTAACCCCC GCAAACAATT CATCGCCCAG TTTTCCGGAA TTTTTCTGGG AACCCTCGTC
TCCGTGCTGG CCTTCCGCTC CATGGTTCCG GATGCGAACG CTCTCCAGGC TTTCAATGCT
CCGGGAGCCA GAACATGGGC GGCCACAGCG GAAGCACTGG GCATGGGGTT CAGCCATTTG
CACAGCATCA AGGTGCTTTC CATCATTGCA GGCGGTATTC TCGGACTCAT TCTGGTGCTT
ATTCCCCGCT ATATTCCCCG GACAGGAAAA TGGCTCCCCA CCCCCATCGG CTTCGGCCTG
GCCTGGGCCA TCCAGTGGAA CGACTCCTTC CTTTTCTTTA CAGGAGCTGT GCTCGGCTGG
GCTGCGGACC ATCTTTTCAA GGCCAAATCA CGAGAATATA AAGTCCCCAC CGCCTCCGGC
ATCATTGCAG GCGCAGCCCT CACGGGAATG GCCATTCTGA TGTTCAGCAT TTACCAGGCA
GCCCTCTGA
 
Protein sequence
MASTSPQSSS PLPSCLEKPL PLDGFQGTPD EVEQQWYDQV YLGSGDRMKQ LTWRAVIVGM 
LLGSILSLTN LYANLKMGWS FGVALTAGII SFALWNAFVR LGISKSPMTI LENTCMQSAA
SSAGYSTGGT LTSAVAALLL LTGQHMPLGT TFAWIFFIAV LGVTMAIPMK RQMINIEQIR
FPDSIATAET LKVLYSEGKK AAGQAKALLY SALFAAANAI AMAAGGERWL GTVQQHILGN
WYQRTIFFKW DLMFVGAGAL VGMKTSLSLF IGGTVCWALY VPWLESQKLL PAGAGYRESV
SWTLWGGTAC MVVASIVAFL FQWKSIVRSF SSLGAMFSLS KKRKLTDVEK IETPMSWFLT
GQLISLGALG YLAHTSFNVP YWMSCIAVVI SFFLALVVCR ITGEANITPT GAMGKVTQLI
FGGIAPGHVT ANLMAANITS GASSSSADLL VDLKVGYLLG ANPRKQFIAQ FSGIFLGTLV
SVLAFRSMVP DANALQAFNA PGARTWAATA EALGMGFSHL HSIKVLSIIA GGILGLILVL
IPRYIPRTGK WLPTPIGFGL AWAIQWNDSF LFFTGAVLGW AADHLFKAKS REYKVPTASG
IIAGAALTGM AILMFSIYQA AL