Gene Amuc_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1514 
Symbol 
ID6275751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1806589 
End bp1808571 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content55% 
IMG OID642613573 
Producthypothetical protein 
Protein accessionYP_001878116 
Protein GI187736004 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.767692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000248553 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAAAT GGGAAAGATT CTTTCCCAAT CCCGATTACA TGAACATTCC CTCTTTTTCC 
CGTAGCGTTT CCAGAGGTTC CGCCGTGGGC TGGTTTTTAG TGTTGCTGCT GGTTTGCGGT
GCCGGAGCCG GATATTATCT TTACCAGGAT AACCTGGCAA AAAGAAAAGC GGCCCAGGAA
TTGACCGCCG AACGTAAATT GAAGGAGAAG AAGGCCAGGG AAGCTGCGGA GAAACAGCGA
ATAAAACGGG AACGGGAAAT CAGGGAGAAG AAGGAAAAGG AACGGCTGGC GGCCCAGAAG
GCTTATGAGG AAGCGCAGGA AGAGAAGGCG CGGCAGGCGG CGGAAGCGGC CCGTAAGCTT
CAGGAGCAGG CCGAACGCGA AGAACGGGAA AAAAGGAGAA GGGAGGAGCT GGAACGGCGC
GAACGTGAGG AAGAGGCCCG CAGGCAGGAG GAAGATACTC CTGTGGAGGA AGAGCCTGAA
CCGGAAGGGC GTTTTCCCCA ACCCGTGAAA AACCGGATGC CGGAGCTTTC CGTTTATTCT
ATTCCTTGCA GGGATGATAT CCAGACGGAA AAAGACAAGC TGCTGGAAAC ATGGTCCTGG
GACAAGGCGG AAAAAATGGA GGGTATGGAG GAATTCCCCA CGGGTTCCTC TCCGTGGAAA
AAAGGGAAAG ACGCCGGACG CATGCAGGCG CTTCTGGAAA AATGCCGGGA ATGGAAGGAT
GCCAAACTTG CCTCGCTGAA GGCGTGCCCG GCGGCGAAGG ATTTTCCCGG CGTTCCGGAG
AATGGGGCCC AGACGGTCAG GCGGACGGTG GAGATAGATT CCAATATTGG AGGCTGGCAT
AGTACGGGAC TGTATGCGCC GCCCGGGGCG GAAATTTCGT GTTCCCTGTC CGGCGCTCCC
AAAGACGGTT CGATCAGCGT CCGCATAGGA TGCCATACGG ACAGCCTTCA TAAGCTGGAT
GAATGGAAAA GAGTGCCGGA AATAACCATG CAGGTTCCGG CTGGCCGTGG GCGCGTGAAA
ATGGTGAATC CGATGGGCGG CCTTGTTTAT GTGAATGTAG GCCAGCGTCC CAGACGGGGA
AAGGTCTTCA AGGTTCAGAT TTCCGGAGCC GTGCCTTCAC CTCTGTTCGT AATGGGGAAG
ACCACTCCGG AACAATGGGC CGAACAATTG GAAAATACCA AGGCCCCGTG GGGGGAAATC
CGCATGCCCC GGCTTATTGT CACGATGCCC GTGGAACAGC TGAAACAATG TCCGGATGTT
CAGAAGACGG CGGAATTTCT GCAAAAAAAC ATGGCTCTTC AGGACTGGAT TATGGGATGG
GATACCAAGC CGGACCGCCT GCATCATCCG ATGCGCTTTG TCGTGGACAG GCAGATATCC
GCCGGGGCCG GGCATTCCGG TTATCCCGCC ATGGCCACGA AGGACTGGAC GAATTCCATT
GCCACCGGTT CCATCATCCA TTCCGGAAGC TGGGGTTTGT GGCATGAACT GGGGCATAAC
CACCAATCCC CTCCCTTTAC GATGGAAGGC CAGACGGAGG TATCCGTCAA CATATTCTCC
ATGGTGTGTG AAGTGATGGG GACTGGAAAA GACTTTGAAT CCTGCTGGGG CGGCGGCATG
GGGCCGTACG GCATGAGCGC GGAAATGAAA AAATATTTTT CAGGCACCCA GACTTACAAT
GAGGCTCCCA ACAAGGTGCA GCTCTTCTTC TGGGTGGAGC TGATGTACTA TCTGGGGTTT
GACGCCTTCC GCCAGGTGGC TCTTCAATTC CATGACAAGC CTTATGACAA CGGCGAACTG
AGTGATGAAA AGAAATGGGA ATGGGTCATG AACGCTTTTT CCAAGGTCAC GGGAAAGAAC
ATGGGGCCTT TCTTTAAGAT TTGGCGTACG CCGGTTTCCG AACGCGCTAC GGGCAGAATG
AAAGACCTTC CCGCCTGGCT TCCTTCCAAG GATTATCCGG CCTGTTATAC CGCAGAGGAA
TAA
 
Protein sequence
MQKWERFFPN PDYMNIPSFS RSVSRGSAVG WFLVLLLVCG AGAGYYLYQD NLAKRKAAQE 
LTAERKLKEK KAREAAEKQR IKREREIREK KEKERLAAQK AYEEAQEEKA RQAAEAARKL
QEQAEREERE KRRREELERR EREEEARRQE EDTPVEEEPE PEGRFPQPVK NRMPELSVYS
IPCRDDIQTE KDKLLETWSW DKAEKMEGME EFPTGSSPWK KGKDAGRMQA LLEKCREWKD
AKLASLKACP AAKDFPGVPE NGAQTVRRTV EIDSNIGGWH STGLYAPPGA EISCSLSGAP
KDGSISVRIG CHTDSLHKLD EWKRVPEITM QVPAGRGRVK MVNPMGGLVY VNVGQRPRRG
KVFKVQISGA VPSPLFVMGK TTPEQWAEQL ENTKAPWGEI RMPRLIVTMP VEQLKQCPDV
QKTAEFLQKN MALQDWIMGW DTKPDRLHHP MRFVVDRQIS AGAGHSGYPA MATKDWTNSI
ATGSIIHSGS WGLWHELGHN HQSPPFTMEG QTEVSVNIFS MVCEVMGTGK DFESCWGGGM
GPYGMSAEMK KYFSGTQTYN EAPNKVQLFF WVELMYYLGF DAFRQVALQF HDKPYDNGEL
SDEKKWEWVM NAFSKVTGKN MGPFFKIWRT PVSERATGRM KDLPAWLPSK DYPACYTAEE