Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0018 |
Symbol | |
ID | 6275222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 23308 |
End bp | 24450 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612058 |
Product | hypothetical protein |
Protein accession | YP_001876646 |
Protein GI | 187734534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCT ATTGTACCAA TATTCATCCC GCAGAATCCT GGGCGGAAAC CAGGGAGGCT CTTTTCACCT GCGTTCCCCG CATCAGGCAG GAACTCGCGG CGATGGACTC CCCTCTGAAG GATCTCCCCC TGGGCATCGG CCTGCGCCTG TCCGCCAGGG CGGCAGCGGA GCTTCTGGCA ACGCCGCACG CCGCGGAAAC CCTGAAATCA TGGCTGGAAG ACCAGGGCGC GCGCGTAGAA ACCCTTAACG GGTTCCCTTA CGGAAATTTC CACGGGCAGC GCGTGAAAGA ACGCGTTTTC CAGCCGGACT GGACTACACC GGAACGTTTT GAATACACCT GCAACCTGTT CCGCATTCTG GCACTCATTG GTGACGAACA GGCTGACAGG CTGACCGTCA GCACGCTCCC TGCCTCGCAC AGCTGGTTTC ATGCGGATGA AGAACGCATC TTCTCCCGGC TGGACGCCAT GAGCGGATTC CTGGATGTGC TGGGCAGGCA GACCGGCTGC CTGATGCAGC TGGGGCTGGA ACCGGAACCC TTCGGCCATT TTCACGATAC GGATGGAGCC ATCCGTTTTT TCAACGGCCT CCGCAACCGT TCCCGCCGTC CCGAACTTAT CGAACGCCAC CTGGGGCTGA CATACGATAC CTGCCATTTC GCCATTCTCC GGGAAGAACC GGAATTCACC CTCTCCGCCT GGGAGGAAAA CAACATCGCC CTCTGCAAAG TGCAATTTTC CAACGCCCTG GAATGCCGCA TATGTGGGGA GGAAGACCTG GAACGCCTCC GGCAGTTTGA CGATGGCGTT TATTTCCATC AGACCAGCAT CCTCCACCGG GAAGGCGCCA TGCTTTTCCC GGACCTGCCC AATGCCCTGG CCTATGGGCG GGATTATGCA GAGGAAATAC GTGATTCCCA ATGGCGCATT CATTACCACA TTCCCCTGTA CGCTTCACCG GAACCACCCT TGAAAAGCAC GGAAGAATTC ATCCAGAAAA CGCATAATTT CCTCCGGAGC CGCAAAGGCC CGCAACCGCA TCTGGAGGTG GAAACCTATA CCTGGAGCGT CCTGCCGGAC CACATGAAGA TCCCCCTGGC AGCCCAGATT GCCCGTGAAC TGCATTATAT TGAAACCCTG TAA
|
Protein sequence | MLSYCTNIHP AESWAETREA LFTCVPRIRQ ELAAMDSPLK DLPLGIGLRL SARAAAELLA TPHAAETLKS WLEDQGARVE TLNGFPYGNF HGQRVKERVF QPDWTTPERF EYTCNLFRIL ALIGDEQADR LTVSTLPASH SWFHADEERI FSRLDAMSGF LDVLGRQTGC LMQLGLEPEP FGHFHDTDGA IRFFNGLRNR SRRPELIERH LGLTYDTCHF AILREEPEFT LSAWEENNIA LCKVQFSNAL ECRICGEEDL ERLRQFDDGV YFHQTSILHR EGAMLFPDLP NALAYGRDYA EEIRDSQWRI HYHIPLYASP EPPLKSTEEF IQKTHNFLRS RKGPQPHLEV ETYTWSVLPD HMKIPLAAQI ARELHYIETL
|
| |