Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0823 |
Symbol | |
ID | 6274354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 968155 |
End bp | 969216 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642612873 |
Product | hypothetical protein |
Protein accession | YP_001877437 |
Protein GI | 187735325 |
COG category | [S] Function unknown |
COG ID | [COG3528] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.26689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0377182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGTA CGGCTTTGTG CATGTGCGCC GCCCTGCTGC CGGTGGGCTT CCTGCAGGCA GGAACCCAGC AGATAGAAGC GCCGCAGGAG GGCTCCGTTA TCAGTTTCCA TCTGGAAAAC GATATGTTCG TGGGGGATGA TGATAATTAT ACCAACGGCG TCCGCTTTGC GTGGATGTCC GGCACCACGT CCCGGAGCCA TACGTTTTCC GGCATGCTGG GAACAGTGCT GGGCGGCACG AACGCCTCGG ATTCCTGGCG GCGGTTCATG GGCATGAACG GTTCCGCCAA CCTGCGCCAG CAGTGGGGTT TGGACCTGAC CCAGCTCATG TACACCCCGG AGCAGAAGGC CACCTATCCC ATCTACAACC AGCACCCCTA TGTGGGCAAC CTGACGCTGG GGCTGACCTC CCTGGTCAAG AATGAAGACC GGGCCAATTC CCTGGAGCTG CAACTCGGCA CCACGGGCAC GAATTCCCTC GCCAAGGGCT CCCAGCATTT CATCCATAAG CTGTGGGGTA TGGAGCAATG GCCCGGCTGG GCCAACCAGC TCCCCGGAGA GATGACCGCC AATTTGTTTT TCAAGCGGTA TTACCGCCTG CGCGGACTGG AGAAGCGCTA CGGCTCCGGT TTTGAAACGG ATGCCCTGGC TTACTGGCAT GCGGACGCCG GCACAGTAAA GGTGCAGGCG GGGGGCGGCA TGTCCTTCCG CTTCGGCTAT AATCTGGGCA ATACTTCTCC GGAGAACAGC ATTCGCGGAG CGACCAGTGC AGCACCTCCC TTCGTTTATA ACAGGATGTC CGTTTCCAAT TGGGGGTATT ACGGTTATAT TCATGCTGCC GTGCGAGCCG TGGCTCATGA CCTGTATCTG GATGGTACGG TGTTTCGTTC CTCCCCCAAG TATGTGAACA AGTATCCCGT AGTGGGAGAA TGGGGTTATG GCTTCGGCTT CCGGTACAAG CGCTCGGAAT TGCTTTTCGG CCTGCATTAC ATGACCAAGG AATACACCCA GCAGGAATCC ATGCAGTGTG TGGGCATTCT CCAGCTTCGG CATACTTTTT AA
|
Protein sequence | MKCTALCMCA ALLPVGFLQA GTQQIEAPQE GSVISFHLEN DMFVGDDDNY TNGVRFAWMS GTTSRSHTFS GMLGTVLGGT NASDSWRRFM GMNGSANLRQ QWGLDLTQLM YTPEQKATYP IYNQHPYVGN LTLGLTSLVK NEDRANSLEL QLGTTGTNSL AKGSQHFIHK LWGMEQWPGW ANQLPGEMTA NLFFKRYYRL RGLEKRYGSG FETDALAYWH ADAGTVKVQA GGGMSFRFGY NLGNTSPENS IRGATSAAPP FVYNRMSVSN WGYYGYIHAA VRAVAHDLYL DGTVFRSSPK YVNKYPVVGE WGYGFGFRYK RSELLFGLHY MTKEYTQQES MQCVGILQLR HTF
|
| |