Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1470 |
Symbol | |
ID | 6274447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1761383 |
End bp | 1762330 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613530 |
Product | eight transmembrane protein EpsH |
Protein accession | YP_001878073 |
Protein GI | 187735961 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.537051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCCC CCGTTCCTGA TGGCAAAGAG AGATTTCTGA CCGCCCCCAT GATCGCCGCC CTGGCCGTGA GCACGCTGCT GCTGGCGCTG AACTATCTGG CATTCCCGGA ATTCGGTTCA TTCGGAAATG AAAACAGCAT CCAGTGGCTC ATTTCCTCCT GGAACAAGCA AACGGATTAT GAACACGGCT GGCTGGTGGT TCCCATCATC ATCTTCATGC TGTACCATGC CAGAAAGATA ATTGCCCAGG CACCCAGACG CATGGACTGG CGCGGCCTGA TTCTTTTCAT CCCGGCGATC ATGCTTCTGA TGCTTTCCTT CCGCGTGGGG CAGCCTCGCG TGGCCGTAGG CGCTCTTCCC CTGATTTTAC TGGGCGGGGC CTGGTACCTG GCCGGACCGC AGACCGCCAG GCTGTGCGCC TTCCCGCTGC TCTTTTTCTG GCTGTGCATC CCCCTGCCCT CCTTCCAGCA GGCTACGGTG GGGCTTCAAA TCATCGCGAC GGAACTGGGG CATTGGGGGG CAAGCATCTT CGGGGTGGAC ACCTATCTGC AGGGCACCAA TATCCGCTCT ACCGGCGGAC ACTGGGATGC CTTCAATATT GCAGGAGGGT GCAGCGGCAT GCGTTCCCTG ATGGCACTGC TCATGCTGTC CGCAGCCTGG GCCTACCTGT CCGACTTGAA ATTCTGGAAA AAATGCGTGC TCTTCCTCAG CGCGATTCCC CTGGCCGTCA TCGGCAACGG TGTCCGCATC ACCAGCATCG TGGTGATGGC GGAATACGGA AATCCCGAGT TTGCGTCAAA AACCTGGCAT GACTGGTCCG GCCTGCTGTT TTTTTTCCCT ATCAGCCTTT TCGGCCTGGC AGCCGTCCAC TCCCTGCTGG CCGGGGAACT CATCTGGAAG CCGTCACAGC GCAAGAAGCT GGTTGTTAAG ATGAACAAGT CCCATTAA
|
Protein sequence | MDSPVPDGKE RFLTAPMIAA LAVSTLLLAL NYLAFPEFGS FGNENSIQWL ISSWNKQTDY EHGWLVVPII IFMLYHARKI IAQAPRRMDW RGLILFIPAI MLLMLSFRVG QPRVAVGALP LILLGGAWYL AGPQTARLCA FPLLFFWLCI PLPSFQQATV GLQIIATELG HWGASIFGVD TYLQGTNIRS TGGHWDAFNI AGGCSGMRSL MALLMLSAAW AYLSDLKFWK KCVLFLSAIP LAVIGNGVRI TSIVVMAEYG NPEFASKTWH DWSGLLFFFP ISLFGLAAVH SLLAGELIWK PSQRKKLVVK MNKSH
|
| |