Gene Amuc_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1143 
Symbol 
ID6273894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1365798 
End bp1371773 
Gene Length5976 bp 
Protein Length1991 aa 
Translation table11 
GC content59% 
IMG OID642613195 
ProductYD repeat protein 
Protein accessionYP_001877750 
Protein GI187735638 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0172483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCA ACGATCAACA AAATGACGCC CCCTCTTTGA ACAGGGGTTC CGCAAACATG 
ATTAACTCCA ACCCCGCCGC CGGGCTTCCC GATGCGGCCA GCCAGCCCGG CGCGCCGAGC
GCCGCAGCGC CCCTCAATGC CATGTCCACA CCCATTCAGC CATACGGCGC GGATTCCGTT
ATTTATGAGA AATCCGACAA CTTCGTTCAG ACGTCTGCCG GAGCCGATGT CTTCATGTCT
CCAGTCAACG ACACCTTTAC CGTCCCGGAA GGCGGGGCCA CGGCTGTGGC CAGCCTGACG
GTGGATGACT GGGGCAAGCT GACCATTTCC GGCCCGGGCG GCACGTTTGA GCTGGACCTG
ACCTCCGCCG CCGATGAACC CGGAGAACTG GGAGGCCACC AGGAATGGTC CAAATCAGGC
TCCTTTGAGT TATCCGAAGG CACGTACACT CTCTCCATCA CGCACCAGAA CATCGACATG
CCGCACAACG AATACAACCA GTCCGTGTGC AGGTACTCCG TGACGGTGAC CGCGCATGGG
GGCTCCAGCA GCAGTTCAAG CAGTTCGGAC ACCCCTCCTT CTTCCCTGTC GAGCAGCTCC
TCTTCGGACG TGCCGCCGGA GGAAAAGGAA ATCTGCTGCC GTTGCGGTTG TTGTACAGAC
GCGGAAGGCA ATGAATACAC CATTGCGGCG GAAAAACTGC CGGGTGACCC CGGCGTGGAA
ATCTGCATGT CCCAGGCCGA ATTCCTGGCC AGGGGAGGGG CTTCGGCTCC GTCTCCCGCG
GCCTTCAGCC TCCGCTCCGC GGAAAACGCC CGTGAAACGG CGGAAGCCTG CGGCGGCCTG
AAATACGTGA GCCCGTGGGC CTGGCGCGCC CATCTGGACG AAACGTCCGG CCTCATCACC
ATGGTGCCGC CGGCGGGCGC CGCGCTTTAC TTCAACGTGC AGGCCGGTTC CGATACGGCC
CTGCCTGCGG GCATCTCCCG CAAGCGCGAC TTCAGGGTGC AGCTGCTGGA TGAAACTCTG
GCTCCTGCCG CTTCCGGCGC CCCCGCCTAC CTTTCCCTGG TGGACGCGGA CGGGCAGAAA
ATCCGCTTCT CCGCGGAAAC CGGCGCTGTG GTCGGCATGA CCTCCGCCTC CGGCAGGGTT
CTTCTGGCGG AAGACTACTT CCGGAATGTG AGCAATACGT ATGACCATGA GGGCAGCCTG
GTAAGCAGCT ACAGCGCCGC GGAGGGGCTG ATGCGCACCC GGACCGGAGC GGACGGAGAA
CTCGTCATGG AATGGTACGC TCCCGCCGCC GTCACGGTCC TGGCCGACGG AACATACGAG
GTAACGGGGG AACCTTATAA AACCTCCTCC TTCCTGTCCT CGGAAGAAAA CGGCGTGCGG
ACCACCGTCA TCACGCGCCA GCAGCGCGGA CTGCCGGCCC ACACCATCAC CCGTACGGAA
GAACCCGGCA GAGTCAGCAT CGCCAAAGGC CAGGGGGACG ACACCATCAT CCGTACCATT
GAAACCAACC GCCTCTACGG AGGCCTCTCG GAACGCATTG AGACCGTCAG GGGCATCAAC
GATGCCGAGC CTGTTTCCTG CAGCCGCAGC GTCAGGCAAT ACACGGACGG CGGCTGGCTG
CTGGTAAGCG AAACGGAAGC CTTCAACACG CCGCTGGCGC GGACGACCTC CTACGAATAC
AACAGCCAGT ACCGCGTCTC CCGGATCAAC CGCCCGGACG GAGGCTACAC GCGCTATGAA
TACGACGGCG AAGGACGGGT CACGCTTGAG GCGGCGCCCT GGGCCGGCGG CGGAGAACAG
GTAACCCGGA CCGAATACGC CGGCCTGCGC TTCTACGACA ACCGTCCGGT GCGTGTCGCC
GAATCCCGGG TGCTGTCCGA CGGCACGGAA ATCGAACTGA CGGCCGTCGC TTACGCCTAT
GAGCAGTCTC CCCTCATGGA ACGGGTTGTC AAAACCGTGA CTGCCGCCGG TTCCAGCCAG
GAACAGACAA GCGTGGAAGA AACGTATGGA GAAGCCGCCG CCTATCCCTA TGCCGCCGGG
CAAATGAAAT TCACCCGGGA CATTGCCGGA GTGGAAACCT CCTATGACTA TGAGGCGGCC
GCGGAACACG GCGCCGCGCA TAAAAAGACG GCCATCACCA AAGCCGGCGG CGGACTGGTG
GCCGGACAGA GCCGCAAGAC GGAATCGTTC ATTGCCGCCA ATGATACGGT GCTTTTTGAG
CAGGAAAGCA TCTGGGATGG TGAAAACTGG CTGCTGCTCT CAAGCGGCGC CCATGAATAC
GATGAAGAAG GCCGCCGCAC GAAAACCACG CGGGGCAACG GCCGCGTCAG CGTCACGTCC
TGGATGTGCT GCGGCAAGCT CTCGGAAACG GACGAAGATG GTGTCCTGAC GTCGTACGGT
TACAACAGCG CCCACCAGCT GGTGGAAACC ATCCGTTCGG AAATCAGCGA CGGAGACACG
GTCGTTACCC CGGAAACCAT CACCACCTAC ACCCGGGACG CCTCCGGGCG CGCCCTGCAG
ACGCGCCGGG ACAGGGGAGC CATGACCACG ACGGAAAGCG TGGAATACGA CAGGCTCGGC
CGTATCGTCA GGCAAACGGA TGTGCTGGGT CGGGTGACGG CGACAGCCTA CAGCGAAGAC
GGCCTTACGG AAACCGTCAC GACGCCCTCC GGCGCTACCC TCGTCACGGA ATATCATGCC
GACGGCTCCG TACTTCATGA ATACGGCACG GGACAGCGCG AACGCTGCCA TGTCTATGAC
ATTGACAATA ACTGTTTGAG GGAAACCGTT ACCCTGGCCG GCCAGACCAT CATCCTTTCC
CGGACCCTGG TCAACGGCTT CGGACAAAGC GTCGTGCAAG TGACGCCCAC GACTGCCGGG
TTCCTGTATG ACCGTTCCGA ATACGATGAA CAGGGGAGTC TCATCCGCTC ATGGAGGGAT
GCGGGAACGC AGGAGGGGGC CGTCGCCATG GCGCCTGCGC TCTATGAATA CGATGCCTTT
GGCAACATGA CCAGGGAAAC GCTCGCCCTG GCGGAGCAGC CCGCTCCGGA CAACAGCCCC
ATCCGGGAAT ATGCCTTCAG CGTGGAAAAC GCGGAGGACG GCGTCTATAT GGTGACGGCG
CAAATCCGCT ACAATGCTGA GGGACAGCCG CTTGTCTCCG TACGGAAGCA GCTCCTGTCC
GAACTCTCCG GAGTTCTGGA AACAAAAACG GTCATCGTTA ACGAACGCGG CTTGACTTCG
GCGGAATGGA CGGAGTATGC TGGAAATACG AAAAGAATCC AAAAGAGCGT TATCCCCTCT
TCCAGCGTCA CGGCTCAAAC GGTGGCGATG GATGGTTGGG TGCTCTCGCA GCAGAACCAC
GCGGGCATCA CGGAGACGGC CGCCCGCGCC TATACGGCCT CGGGCATGAC TCTGACCCGC
ACGGACGGCC GCGGCAACAC GGTCACGACC CGGACCGACC TGGCCGGTCG GGCCGTCAGC
GTGACGGACG CCGCGGGGAA TGAAACCGTG ACGCAATACG ACTCCTGCCA CGACCTGGCT
GCCGTAGTGA CGGACGCGCT GGGCAATACG AAATGCGCCA GATACGACGC CAGAGGCCGG
AAAACGGCCG AATGGGGGAC GGGGACGCAG CCCCTGCTCA TGGGCTATGA CGAGGCCGAC
CGCCTGGTGA GCCTGACCAC CTTCCGCGCG GCGCAGGAAG GCGACATCGC GGAGGACCCC
TCCGAGCGCG CGGACGGCGA CACCACCACC TGGAACTATG ACGAAGCCAC GGGGCTGGAA
ACGCGCAAAA CCTATGCCGA CGGAACGCAC GTGGACAAAA CCTGGGACGC CTTCAACAGG
CTTGCTACGG AAACAAACGC CCGCGGCATC GTCAAGACCT GCACTTACGA ACAGCCACGC
GGGCTGCTGG TGGGAATCAG CTACTCAGAC GCCACGCCCG GCCAGAGCTT CGCCTACGAT
CACCTCGGTC AATTGACGCA AATCACTGAT GTTGCCGGAA CGCGAACCTT CGCCTACAAT
CTCTACGGAG AACCGGAAAC CGACAGCCTT GCGGCAAACG GCATCGCCTG GCAGGTCTCC
GAGCGCTATG ACGGGCTTGG CCGTCAGGCG GGGTACGAAT TAAGCGCGGA CGGCCGCCGC
GTCCAGCAGA CGCACCTGTC CTATGACGGG AAAGGCCGCC TCTCCACCCT CACGGCGGAA
GGCATGGAAA CGCCCTTCTC CTGGACTTAC TCCGAACATG GAGGGCTTGT GGAACAACTC
GCCTACCCCA ACGGCATGAC CCGGGTCAAC ACCTATGAAG ACAGCCGCGA CCTCCTCTCC
GTCATCGACT ACCAGAGGCC CGGAAGCGCC AACCCGCCGG CAAGGCACGA ATACGACTAC
GACGCGCTGG GCCGTCCTGC ACGGCGCAGG GACACGTGGA ACACGGCGGC GCCCAAAACG
ACGCGTTTGT TCACCTACAA CAGCCGTGGC GAACTGGTCG GAGATCAGCT CAGGCCCGGC
GGCCGCTTTG GCTATCAGTA CGACAACATC GGCAACCGGA AAGAAGCCTT CGAATTCGGC
AGCACCACGG ACTATGAAAC CGATGAACTC AACCGGTATG CGGGCATCGT CAGAAATAGA
GGGGAAGCCT TTACACCCCA ATACGACGCG GACGGCAACC AGACGCTGGT AAAAACATCC
ACGGGCATCT GGGAAGTCAC CTACAACGCG GAAAACCGGC CCGTGAAATT CGAAAGCGAA
GACGGAGGGA CAACCGTGGA ATGCGCCTAC GACTCCATGG GCAGGAGATT CGAGAAAAAA
GTGACGGTTG GAGGGACAAC GGGCTTCCAC GCGCGCTACC TCTACCGTGA CTACCTGCAG
GTGGCGGAGT GCGACTTGAC CGGGGAAACG CCGGAGGTTG TGCGCAGTTA CATCTGGGAC
CCCTCGGAAC CTGAGGCCAC GCGCGTCCTG TCCATGACGC GCTGGGAAGC GAACGGGACG
CAGGAGAAAG AGCATCTCTA CTGCATGCAC GACGCGATGA AAAACGTCAC CTCCCTCTTC
GGGGAAGCGC GCGGACGCCG CGCCCTGTAT GAATACCGGC CGTACGGAGG TCTGATCACG
TCGGAAGGCA ACATGGCGGA AGAGAACAAA TTCCGCTTCT CCAGCGAATA CATGGACGAC
GAACTTGGGC TGGTCTACTA CAACTACCGG CATCTCAATC CGCTTGACGG CAGGTGGATC
AGCCGCGATC CCATTGAGGA AGAAGGTGGT TGGAATTTGT TCGCGTTTGT AGGAAATAGA
ATTTTTAATC AAGCTGATAT TTTAGGGTTG TGGCCATGGT CCCAGAAACA ACCAGATCCT
CCAACCTTTA CAACAGAAAC AAAAAAATGT CCAGATAAAA ATACGATAAG CGTAGTTGTG
CGTAGAAGTA ACGAAATTAC GGTGGATGCA GACGGTTCTC CTCGTGCGTA TCATCCAAAA
AACATAGGGT TAGATGATAA TAGAAATGGA GGAATAGGAA AAGATAATTA CGGTATTGTT
AGTCCTGATG TTATTCAAGG GAAAAATGAT CCTGCTCCAG GTTATTATGT ATCAGTTACA
GCATTATTCG ATCCCCGGAA AAAGAAAACA GACCCTCGTA GATATGTAAA TTCAGAAGTA
ATTCCATATC TTGTTTTTAA TAAAGAGGAT AGAAAAAAAG GTGCTAAGGC CGGTGATTAT
GCAACAGTTA CTAAAAAGAT GCCAAATGGT GATCTTTTAA TTGTTCACGC TATTGTTGCA
GATTATAACC CTTATTCTAA AGGGGAAGGT TCTATAAAAT TAGTAAAGGA ATTAGGAGGA
AATCCGGATC CTAGAAGAGG AGGGGTAAAA TGTAAGGAAG GTTTTACTAT TTACGTGTAT
CCTGGGACTG CAGAAAAATT TGATAGCGAT AAAGTTTCTC ATGAAACTAT TCAAAAAAAA
GGTAAAGAAA TTTGGGATAA GCAGCATAAC AAATAA
 
Protein sequence
MFTNDQQNDA PSLNRGSANM INSNPAAGLP DAASQPGAPS AAAPLNAMST PIQPYGADSV 
IYEKSDNFVQ TSAGADVFMS PVNDTFTVPE GGATAVASLT VDDWGKLTIS GPGGTFELDL
TSAADEPGEL GGHQEWSKSG SFELSEGTYT LSITHQNIDM PHNEYNQSVC RYSVTVTAHG
GSSSSSSSSD TPPSSLSSSS SSDVPPEEKE ICCRCGCCTD AEGNEYTIAA EKLPGDPGVE
ICMSQAEFLA RGGASAPSPA AFSLRSAENA RETAEACGGL KYVSPWAWRA HLDETSGLIT
MVPPAGAALY FNVQAGSDTA LPAGISRKRD FRVQLLDETL APAASGAPAY LSLVDADGQK
IRFSAETGAV VGMTSASGRV LLAEDYFRNV SNTYDHEGSL VSSYSAAEGL MRTRTGADGE
LVMEWYAPAA VTVLADGTYE VTGEPYKTSS FLSSEENGVR TTVITRQQRG LPAHTITRTE
EPGRVSIAKG QGDDTIIRTI ETNRLYGGLS ERIETVRGIN DAEPVSCSRS VRQYTDGGWL
LVSETEAFNT PLARTTSYEY NSQYRVSRIN RPDGGYTRYE YDGEGRVTLE AAPWAGGGEQ
VTRTEYAGLR FYDNRPVRVA ESRVLSDGTE IELTAVAYAY EQSPLMERVV KTVTAAGSSQ
EQTSVEETYG EAAAYPYAAG QMKFTRDIAG VETSYDYEAA AEHGAAHKKT AITKAGGGLV
AGQSRKTESF IAANDTVLFE QESIWDGENW LLLSSGAHEY DEEGRRTKTT RGNGRVSVTS
WMCCGKLSET DEDGVLTSYG YNSAHQLVET IRSEISDGDT VVTPETITTY TRDASGRALQ
TRRDRGAMTT TESVEYDRLG RIVRQTDVLG RVTATAYSED GLTETVTTPS GATLVTEYHA
DGSVLHEYGT GQRERCHVYD IDNNCLRETV TLAGQTIILS RTLVNGFGQS VVQVTPTTAG
FLYDRSEYDE QGSLIRSWRD AGTQEGAVAM APALYEYDAF GNMTRETLAL AEQPAPDNSP
IREYAFSVEN AEDGVYMVTA QIRYNAEGQP LVSVRKQLLS ELSGVLETKT VIVNERGLTS
AEWTEYAGNT KRIQKSVIPS SSVTAQTVAM DGWVLSQQNH AGITETAARA YTASGMTLTR
TDGRGNTVTT RTDLAGRAVS VTDAAGNETV TQYDSCHDLA AVVTDALGNT KCARYDARGR
KTAEWGTGTQ PLLMGYDEAD RLVSLTTFRA AQEGDIAEDP SERADGDTTT WNYDEATGLE
TRKTYADGTH VDKTWDAFNR LATETNARGI VKTCTYEQPR GLLVGISYSD ATPGQSFAYD
HLGQLTQITD VAGTRTFAYN LYGEPETDSL AANGIAWQVS ERYDGLGRQA GYELSADGRR
VQQTHLSYDG KGRLSTLTAE GMETPFSWTY SEHGGLVEQL AYPNGMTRVN TYEDSRDLLS
VIDYQRPGSA NPPARHEYDY DALGRPARRR DTWNTAAPKT TRLFTYNSRG ELVGDQLRPG
GRFGYQYDNI GNRKEAFEFG STTDYETDEL NRYAGIVRNR GEAFTPQYDA DGNQTLVKTS
TGIWEVTYNA ENRPVKFESE DGGTTVECAY DSMGRRFEKK VTVGGTTGFH ARYLYRDYLQ
VAECDLTGET PEVVRSYIWD PSEPEATRVL SMTRWEANGT QEKEHLYCMH DAMKNVTSLF
GEARGRRALY EYRPYGGLIT SEGNMAEENK FRFSSEYMDD ELGLVYYNYR HLNPLDGRWI
SRDPIEEEGG WNLFAFVGNR IFNQADILGL WPWSQKQPDP PTFTTETKKC PDKNTISVVV
RRSNEITVDA DGSPRAYHPK NIGLDDNRNG GIGKDNYGIV SPDVIQGKND PAPGYYVSVT
ALFDPRKKKT DPRRYVNSEV IPYLVFNKED RKKGAKAGDY ATVTKKMPNG DLLIVHAIVA
DYNPYSKGEG SIKLVKELGG NPDPRRGGVK CKEGFTIYVY PGTAEKFDSD KVSHETIQKK
GKEIWDKQHN K