Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2050 |
Symbol | |
ID | 6274762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2494850 |
End bp | 2495860 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614111 |
Product | protein of unknown function DUF1568 |
Protein accession | YP_001878641 |
Protein GI | 187736529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.633564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.705399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA CCCTGAGAAT AGAATACCCC GGAGCAGTCT ATCACGTGCA GAGCGAGGGC AACCGCGGAG ATGCCATTTA CCTGGATGAC GAAGACAGGG AAACCTTTTT GCGAACGTTT CAGGAGGCAG CCCGCAGAAG CGGCTGGACT GTGTACGCCT ACGCCCTGAT GGCCAATCAC TACCACATTC TTTTCCAAAC CGCGCGCGCC AACCTGGTGG ATGGCATGAA ATGGCTCCAG ACTGCCTATA CCCAGCGTTT TAACGCACGC CACCGAATGC GGGGTCATCT GTTTGCGGGC AGATACCACT CCATGGTGGT GGAAGCGGAT AACGCCCACT ATTTTTCAAC CATCATTGAC TACATCCACC TGAATCCGGC CCGGTCCGGC TTGGCGCGAC GCCACACGTT CCTTTCCGGC TGCAAATGGA CGAGCCTGCC CGCGTGGCTG GCCTCTCCCG CCAAAAGGCC CAAATGGATT CATCCGGAAC GCGGCCTCGT CTGCTTCGGC TGCGACGACA CGGAAGACGG CCGCCAAAAA TACCTCAACC ATCTGATGGG CCGTTTTGAA GCGGAACGCA TGGATGAACG CTCCCTGCTG CCCGCCGGCC ACGTTGGCCC CGGCACCGTC CAGCGGGGCT GGTGCTATGG CTCCAGCGCC TTCCGTGCCA GGCTGGTGGG GGAACTTCCC CGTCTGGCGC GCAGAAAGCC CGTCACGACG GGCATGCGAG CGTCGGAAAT TGGGGAGTAC CAGGCGGAAA TTATTGTAAA AAACGGTCTG AAGGCCTTCG GCCTCTCGGA GGAGGAGCTG CTTGTCACGC CTTACAGCCA CCCGTCCAAG CTCATCATCG CCCTGGCCGT CCGGCAAAGC ACGCTGGTAC CGTATGCCTG GATCAGCAAC CGGCTGCACA TGGGCATTCC CAAATCCATG GGAACCTTGC TCCACCGGGC AAAAAAAATG GCGGAAACGG ATTTGAAAAC GAGGGCGTGG ATCGAGCGCC TGAGCTCCTG A
|
Protein sequence | MSRTLRIEYP GAVYHVQSEG NRGDAIYLDD EDRETFLRTF QEAARRSGWT VYAYALMANH YHILFQTARA NLVDGMKWLQ TAYTQRFNAR HRMRGHLFAG RYHSMVVEAD NAHYFSTIID YIHLNPARSG LARRHTFLSG CKWTSLPAWL ASPAKRPKWI HPERGLVCFG CDDTEDGRQK YLNHLMGRFE AERMDERSLL PAGHVGPGTV QRGWCYGSSA FRARLVGELP RLARRKPVTT GMRASEIGEY QAEIIVKNGL KAFGLSEEEL LVTPYSHPSK LIIALAVRQS TLVPYAWISN RLHMGIPKSM GTLLHRAKKM AETDLKTRAW IERLSS
|
| |