Gene Amuc_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1164 
Symbol 
ID6273806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1396118 
End bp1398697 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content61% 
IMG OID642613215 
Productvon Willebrand factor type A 
Protein accessionYP_001877770 
Protein GI187735658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.366027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAT TCAACACCCC GGAATGGTTC TTCCTGATCC CCCTGCTTCT GGCGGCAGGC 
TGGAAATACA GGCGTCTGCG CCTGGCCTCC CCCCTGCGCC TCCTTCTGCA GGCCTGCCTG
CTGCTGGCCC TGGCTCAGCC TGCCCTTGAC AGGGGCGGCA AAGGCATGGA CTTGTGGGTG
CTGGTGGACC GATCAAATTC CACGGCGGGA ATTCCGGCAG CCGGGGCAGA GGAAATCCAG
GCTATTCTGG AACGGCACAA ACATGCCGGG GACCGCGTCC GATTGGTGGA TTTCGCGCAA
TCCGCCGTGC TCCGGGGACG CGGAGACCCC GTCTTTGCCG GGGGAACCGC CGGAACGGAC
ATGGAACAGG CCCTGGCCTA TACCATGGCC CTGATGTCCC CGAACCGGAC GAACCGCATC
CTGATGTTGA CGGACGGCTG GCCCACCACG CCGCTGGACC AGGCGGCGGA ACAGCTCATC
CAGTCCCGGA TTCCGGTGGA CTACCGCCTG TCCGTCACCT TCCGGGAAGC GGACATCCGC
ATAGAACAAA TCCGAACCCC GGCGCGTATC CGGCCCGGCG AAGCCTTTAT TTTAGAAGCA
ACCCTGGCAG GCCCTCCCGG CCCGGCCATC ACCGTTCCGT GGCAAATCAG CAAAAACGGA
GGGGCTCCGC TGCGGGGGAC GGCTACCCTG CTCAACGGCA AAGCCACGGT GCAGCTGGCG
GACAGGCTCT CCACACCCGG CTGTTCCGCT TATGAAATGA CCATAACGCC GCAAAACGAT
CCAATCCCTG AAAACAACCG TGCCGTCAGC TTCCTTGAAG TGACGGGAGG AAACGGCGTC
CTGCTGCTCT CCGGCTACGA CCGCGACCCC CTGGCGCCTT TCCTGGAAGC GCAGGGATTC
CGCGTCCAAC AACCGTCCGG CCTGAACCAG CTTGATGCCC GCCACCTCTC CGGCGTGGGG
CTGGTCATCA TCAACAACCT CTCCGCATCC TCCGTCCCGC CGGACTTTCT GCATGCCCTG
GACTATTACG TGAGGGAACA GGGCGGCGGC CTGCTCATGT GCGGGGGCAG GCATAGTTTC
GGCTCCGGCG GATACTTCTC CTCCCCCATT GACGAACTGC TCCCCGTTTC CATGGAAATG
AAGAAGGACA AGATGAAGCT CATGACGGCC ATGAGCATCG TTCTGGACCG TTCCGGATCC
ATGTCCTGTT CCGTTCCCGG AGGAAAGACG AAAATGGATC TGGCCAATGC CGGAACCTGC
CAGACCATTT CCCTGTTGTC CGACCAGGAT CTCATCTCCG TCCACGCCGT GGACAGCGAA
CCGCACCCTA TTGTCACGCT AAGCAGTCTG GGCCCTAACC GGAAGAAAAT GATATCCAGC
GTCTCCAGGA TTGCCTCCAT GGGCGGAGGA ATCTTCATCG GCGCCGGGCT GAAAGCGGGC
TGGCAGGAAC TGCAACGTTC CGTGGCCGGA ACACGGCATC TGCTCCTCTT TGCAGATGCT
GACGATTCCG AAGAACCTGC CGACTACCGG GAAACTCTGA AAGAAATGGT TAAGGAAGGG
GTTACCGTAA GCGTCATCGC ACTGGGAACG GAAAAAAGTG CCGATGCCGG ACTGCTCAGG
GAAATAGCGG AATTGGGACG GGGGCGCATC TTCTTCTGCG ACCGTCCGGG GGATATCCCC
AGCATCTTTG CACAGGAAAC CGTCAGTGTG GCCCGGGCCG CCTTCATCAG GGAACGTACC
CTCCTGCGCG GCACGGCTGG CTGGCTTCAA ATTGCAGCCG GCCAGCCGGA ATGGCCCCCT
GCCGTAGACG GTTATAACCT GTGTTATTTA AGGAATGGAG CAACAGCAGC CTGCGTAACG
GAAGACGGAA ACGCGGCGCC CCTGGTCTCC TTCTGGAACA GGGGAACGGG ACGGACGGCC
GCCGTTACGT TCGCCATGGG AGGAGAACTC GGAAAAAACA TCCAGCAATG GGATAGCTAC
GGAGACCTTA TCCAAACCCT GGCCCGGTGG CTCAACAGGA AAAATCCCCC ACAGGGATAC
TCCGTGCGGG CGGACACGGC AGGAGACCGG TTACGGATTC GCCTTTACTA CAGTGAGGAA
AACATCCCGC GCCTGGCGGA ACGCATGCCG GAAATATCCC TGGAACTCTC CGGAAAAGAA
AGCTCCCGTA CACAAAACGG TATCTGGGAA CACCTGCAGC CGGGAATATT CCAGTGCAGC
TTTCCCCTCC CGCACGGAGT CATGGCGCGG GGCGCGGTCC GCATCGGCGG CAGCGTTATC
CCCTTCGGCC CTGTCAGCCA GCAGGTGGAC CCGGAATGGG CCATGCCGCC TGAATCCGGC
AACGCTTTCC TGGACCTGGT GAACCGGACC GGAGGCAGGG AAAGGATGGA CCTTCCCTCC
ATTTTCCGGG AACCGAGGCC CGGAACAAGC CTCCAACTCC GTCCCATCCT GCTGTGGAGC
ACCTGCGTCC TCCTTGTTCT GGACGCTCTG TTTACCAGGA CCGGCCTGCT TCCCCGGGGA
CAACCGCGGC AGGGGGGTAC GCGCTCCCGG CCTTCAGGAG AAAACAATCA ATCTATCTGA
 
Protein sequence
MIQFNTPEWF FLIPLLLAAG WKYRRLRLAS PLRLLLQACL LLALAQPALD RGGKGMDLWV 
LVDRSNSTAG IPAAGAEEIQ AILERHKHAG DRVRLVDFAQ SAVLRGRGDP VFAGGTAGTD
MEQALAYTMA LMSPNRTNRI LMLTDGWPTT PLDQAAEQLI QSRIPVDYRL SVTFREADIR
IEQIRTPARI RPGEAFILEA TLAGPPGPAI TVPWQISKNG GAPLRGTATL LNGKATVQLA
DRLSTPGCSA YEMTITPQND PIPENNRAVS FLEVTGGNGV LLLSGYDRDP LAPFLEAQGF
RVQQPSGLNQ LDARHLSGVG LVIINNLSAS SVPPDFLHAL DYYVREQGGG LLMCGGRHSF
GSGGYFSSPI DELLPVSMEM KKDKMKLMTA MSIVLDRSGS MSCSVPGGKT KMDLANAGTC
QTISLLSDQD LISVHAVDSE PHPIVTLSSL GPNRKKMISS VSRIASMGGG IFIGAGLKAG
WQELQRSVAG TRHLLLFADA DDSEEPADYR ETLKEMVKEG VTVSVIALGT EKSADAGLLR
EIAELGRGRI FFCDRPGDIP SIFAQETVSV ARAAFIRERT LLRGTAGWLQ IAAGQPEWPP
AVDGYNLCYL RNGATAACVT EDGNAAPLVS FWNRGTGRTA AVTFAMGGEL GKNIQQWDSY
GDLIQTLARW LNRKNPPQGY SVRADTAGDR LRIRLYYSEE NIPRLAERMP EISLELSGKE
SSRTQNGIWE HLQPGIFQCS FPLPHGVMAR GAVRIGGSVI PFGPVSQQVD PEWAMPPESG
NAFLDLVNRT GGRERMDLPS IFREPRPGTS LQLRPILLWS TCVLLVLDAL FTRTGLLPRG
QPRQGGTRSR PSGENNQSI