Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2164 |
Symbol | |
ID | 6275449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2636285 |
End bp | 2639986 |
Gene Length | 3702 bp |
Protein Length | 1233 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642614224 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_001878752 |
Protein GI | 187736640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0772652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.613153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAGG TGTTTTTCAA GAGGGCAGCC CATTTGCTGA CAGCGTTTTT TGCGGGAGGC GTCTGTGTCC TGCAGGCGGC GGAGACGTCC GGCTTCGGCA TAGACCTGGG TAATGGTTCC GGAACGGCGC AGGAGGGATG GACCAATGTA ACCATGCCAG CAACGGCATC GGGAGGCGCC AATACATTTG TTGCCGTTCC GTTAATCAGG AATGGTGCGG CTTCTTCTTC TTCGTCCGTG GCAGTGGGAA GCCTGTTCGG AAGGCTGGTT TCCCTGACCG TCTCCGCCAG ATGCAGTTCT GGAGGTTCTT ATACGCTGGC GGAACCCAAT ACCCGGTATA CCTGGTATGA AGGGGGGAAG CACAGGCACA ACGGTACGAA CGCTGACGCG CCTTCTTCTT TTGAATTCCC CAATGGCTGC GAAGCTTTCA ATACCAGCAT GCGTCTGAGC GCCAACGGGA TGCCGTCCGC CTCCGCCGCG GCCAGGCTTT CCTTCTCCGG ATTCCAGGCG GGCAAGGAAT ATACGGTATC TTTCTTTTGC GGGCACAATG CGCCTGCTTA TGAATCCATG ACGCTGGTCA GCGGAACTCT GGTGGATGTG CGGGGGCTTC AAAGTTCCAT GGGGAGCCTT TCCGGGAATC AATTTTCCGG TCTTTCCAAT TCATCGGAAC AGGACGCTTA TCTGGCTGTG GAGTGGAGGG CCTGTGCGGA TGAACAGGGC AGGCTGGTGT TTGATGTGAC CAAGGAGGCG GACAGCAGCG CTTCCGGCCG CATGGAACTG AATGCCGTTA CCGTCAGTAC GGACGATTCC CCGGCTCCGG AAGTTCCGGA TCCCCGGCCT GTCAGCGTCT TGCACAATAA GGCGCTCATT TTTCTTCCTT CTTCTCTTGC GTTCCCCGTT CCTTCTTCCA CATGGGCGGC TGAAGGCCAG TTAACCTGTG AATTCTGGGT CAGGCCCAAT GATGATGCCG TCACGGGAAA AATCATTTCT TTGGGAAATG CGTCTGTTTC CCTGGAACAC GGTTGTCTGG CTCTGTCTGC TCCGGATGGG GATTCCGTCG CTTCCGTGAA AGCTGCCGGG AAGGAATTGG CTGCGGGGGA ATGGCACCAT GTAGCCGTCT CCGTCGGGGA GGAGGTTATC AGTCTGTATG TGGACGGAGG CCTGGCCGTT TCCTCGCCGC GGTCGCCTTA CCTGCAGGCC ATGAAATCAG GCTGGAAAGG GATTGTGCTG GAGGCGGGGT TCAAGGGTGC TCTGGATGAA CTGCGTTTCT GGAATGCGGC TCTGGGCGGG GAGGAAGACG ACTTTTTCTT GAACGGACCG TTGCCCCTGT CCCATCCCAG GTATGGGAGG CTGGTGGGCT GCTGGCGTCT GGATGGGGAT TTCCGTGATG CCAAGTGGAC GGAATTTGCG GAAAAGACAG GGAGCGCCTT TGTGCGCCCT TACCAGGCGG TTGCCCCTGA AGGGACTGAA TTTTCCATTG TCACGGATAA TGAAACGTTC CGTTACATGC TGGTGACGGC GTATGTGAGG AATGTGCATG TTATTTATGA CTGGCCGGCG CGCGCCCATC TCATCAATAA TTCCGATTTG ATCTACATCA ATTCCGTTAC GCCGGCCGCG GACGGTTCTC TCAATTTTCA ATATCCGGAC AATGACGTGA CGGAAAGCTC CGGTGTGGTT CTGCTGCCGT CTGACGGGGA ATGTTCCAAT GTCCTGGACT TCTCCGGGGA AGGGGCTTAC ATGAACGTGG GCGGAGGTTT GCTGGGCGGC GGCAGTTCCG GAGCGTTTAC CGTGGAAGTC CGCATGGCTT TGGACGGAGA CGGACAGGAG GCGGTGCTGT TTGAAAATGA TAACGTTTCC ATGAGGCTGG AATGGAAGGA AGACCATTAC AGGGTCCGTA CCCAGGCAGG CCCGGATAAA GCCTGGGAGG CGGATTTGCC GCCTGTGGAA GCCGGCAGGT ATTTCTGGCT GGCCTTCGTC AGGAATTCTT CTGCGGATAC GGCGTCCTTT TATGTGGACG GGGAGGCCGT AAGCACGGCC GCCGCGGATG CGGGGGAAAT GGAGGGAACT GCCAATGCCG TCATAGGAAG GAATTTGGAT GGCAGAATTG ATGAAATGCG CGTGTGGCAT GAGGCCCGTG CGGCGGCCCG CCTGGGGGCG GCTGTCCAGC ACAGCTGGGG AGACAGGTTG TTGGTGGGCC GCTGGGGGAC AAGTGACCAA TTTGGCCATG ATACAGCTTC CTGGGTGGAA CATGTCCGTA TCCTCCGCAG GCTGACTGAA GGCGTTTCCG GAATGCGGAT ACGGCTGGGT GTCTCCGGCG GGAACTGGTC CGCCATGCTG TCTGATGCAA ATGCCCGTGA GGCTTTTGCC GAAAATGTGG CGGAAGTGGT ACGCAAGCAT CAATTGGACG GACTGGATCT GGACTTCGAA TGGATTGACC AGAACGATAC GGCCGCCTGG AATAATTACG GAGAATTGGC CAGGGCTATC CGGGCGGCTT CCCCGGATAT GTTTTTTACC ATTTCCCTGC ATACCTACTA TTACAAATTC CCGGCGGCCT GCATGCGTTA TGTGGATTAT TTCACGTTCC AGAATTATGG CCCCCAGATT GATGTAAACG GATATAGCAG CATGGTTTCC GCATGCGGAA CATACCGTTC ATGGGGCTAT CCGGACTCCA AGATCATGCT CAGCGCTCCG TTCCAGGGGA CTCCGGGCGC GGGGCAGGGC GCCGATATCC GGGCTTACCG GGACATTGTC TCAGCCTGCG CCGGAGTTCG GGAGGATCCT TCTCTGGATT CCGCCAGCTT CAATTATGGC GGAGGCAAGG TTAAGACGCT GCACTTCAAT GGTGTGGATA CGGTCAGGAA AAAGGCCCGC TACATCAGTG AACAGAAAGT GGCTGGGTTC ATGTACTGGG ATTTGGGAAT GGATGTGGCG GATTCCTCTG GGAAGAACAA CTACTTTGAC GAATGCTGCC TGCTTCGGGC CGCCAACCGT TATGTTTCCT CCACCGCTTA TCCGGATACT CCGGCCCCCT TTGCCCTTTC TTCTGCGGGG GAAACCGTCC CGGCCGGGGG CGGTGCCGTG GCGGTGGAAG TGCAGTCGGA AGAGAAGGCT CTGGGCTGGG TGGTCGCAGA TTGTCCGGAC TGGATTTCCG CCTCTACCGT TTCGGGAATC GGCCGGACTA CAGTCATTTT GACGGCGGCG GAAAACAAAT CCGCTGACGG ACGGTTCGGG ACGGTAATCT TCCGTTCTTC CGACAAACAG GAATGCTCTG TCATCATAAC GCAGGATGGC GCCGAATTGA CGGGCTACGA CAAGTGGGTG CAGGACTCCT TCCCTCCGGA TGCTGCCGCG GACCGGACGG CTGCGGATGC CGTTCCTGCC GGGGACGGTA TCCCCAACCT GATGAAATAC GCCACAGGAC AGGATCCGTT GAAACCCTGC GGGAGCGTTA CGAAAGTAAC GCTGGAAGAG GGGGAGGACG GATGCATGCA TCTGGTGCTG CGCTGGCCTG TAAATCCGCA GGCAACGGAT GTGAAGCATG AAGTGGAAGC CTCCACGGAC CTGGTCGACT GGATTTCCCT GGGAGAAGTG GAAACCGCCG GAAAGACGGC TGCCGAATTT TGGGATGCGG AACCCGTACG GGAAAGCGGG ATGGAACGCC GGTTTTTGCG GTTGAAAGTG ACTCGGGAAT AA
|
Protein sequence | MVKVFFKRAA HLLTAFFAGG VCVLQAAETS GFGIDLGNGS GTAQEGWTNV TMPATASGGA NTFVAVPLIR NGAASSSSSV AVGSLFGRLV SLTVSARCSS GGSYTLAEPN TRYTWYEGGK HRHNGTNADA PSSFEFPNGC EAFNTSMRLS ANGMPSASAA ARLSFSGFQA GKEYTVSFFC GHNAPAYESM TLVSGTLVDV RGLQSSMGSL SGNQFSGLSN SSEQDAYLAV EWRACADEQG RLVFDVTKEA DSSASGRMEL NAVTVSTDDS PAPEVPDPRP VSVLHNKALI FLPSSLAFPV PSSTWAAEGQ LTCEFWVRPN DDAVTGKIIS LGNASVSLEH GCLALSAPDG DSVASVKAAG KELAAGEWHH VAVSVGEEVI SLYVDGGLAV SSPRSPYLQA MKSGWKGIVL EAGFKGALDE LRFWNAALGG EEDDFFLNGP LPLSHPRYGR LVGCWRLDGD FRDAKWTEFA EKTGSAFVRP YQAVAPEGTE FSIVTDNETF RYMLVTAYVR NVHVIYDWPA RAHLINNSDL IYINSVTPAA DGSLNFQYPD NDVTESSGVV LLPSDGECSN VLDFSGEGAY MNVGGGLLGG GSSGAFTVEV RMALDGDGQE AVLFENDNVS MRLEWKEDHY RVRTQAGPDK AWEADLPPVE AGRYFWLAFV RNSSADTASF YVDGEAVSTA AADAGEMEGT ANAVIGRNLD GRIDEMRVWH EARAAARLGA AVQHSWGDRL LVGRWGTSDQ FGHDTASWVE HVRILRRLTE GVSGMRIRLG VSGGNWSAML SDANAREAFA ENVAEVVRKH QLDGLDLDFE WIDQNDTAAW NNYGELARAI RAASPDMFFT ISLHTYYYKF PAACMRYVDY FTFQNYGPQI DVNGYSSMVS ACGTYRSWGY PDSKIMLSAP FQGTPGAGQG ADIRAYRDIV SACAGVREDP SLDSASFNYG GGKVKTLHFN GVDTVRKKAR YISEQKVAGF MYWDLGMDVA DSSGKNNYFD ECCLLRAANR YVSSTAYPDT PAPFALSSAG ETVPAGGGAV AVEVQSEEKA LGWVVADCPD WISASTVSGI GRTTVILTAA ENKSADGRFG TVIFRSSDKQ ECSVIITQDG AELTGYDKWV QDSFPPDAAA DRTAADAVPA GDGIPNLMKY ATGQDPLKPC GSVTKVTLEE GEDGCMHLVL RWPVNPQATD VKHEVEASTD LVDWISLGEV ETAGKTAAEF WDAEPVRESG MERRFLRLKV TRE
|
| |