Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0627 |
Symbol | |
ID | 6274195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 735567 |
End bp | 737087 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642612678 |
Product | hypothetical protein |
Protein accession | YP_001877245 |
Protein GI | 187735133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.300902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAA TCCTTCCATT ACTATCCCTT CCCCTGCTGT CGCTCGCAGC GGTATTTGCC GCTAATACGC CGGAACACAT CGGCAACGAC CTCAAACTGT TCAAAGACTC CTCCTGCACC TCCCTGAAGC CGGATGTGAA AAACACCTCC GCCTTCCAGT CTGACGCGAT GAAGGAACTC GCGACCAAAA TCCTGGCAGG CCATTACAAG CCGGACTACC TGTATGCCGA ATACCGGGCC CTCCCCTCAC CGCGCCAGAC GGGGAAAAAT CTCAGAATCG GCGACGGATT CAGCAAATAC GACAACATGA CGGGCGTCTA TCTGGAAAAG GGGAGGCATG TCGTCCTGGT CGGCAAGACG GAGGGGCAGG AAATCAGCCT CCTTCTGCCG AACCTGATGC GCAAGCCGGC CGAAGGAGTG CAGCCCACAA AAGACCCCAA CGGCTGGGGA TTGCATAAAA AGCAGATTCC GCTCAAGGAA GGAATCAACA TCATTGACGT GGAAACGCCC GCCAACGCCT ATATCAGCTA TTTCACCGAA GACGCCGGCA AGGCGCCGAA AATCCCCGTC CATTTCGTAA CCGGCAAAGC CAACGGCTAC TTTGACACCA CCCGGGGCGA CACCAACAAG GACTGGGTTC GCCTGCTTGA CCAGGCCGTC TCTCCGATCA TGGATGCCCG GGGAAAATAC ATCCAGGTTG CCTACCCCGT AGAATTCCTG AAAAAATTCA CCAAAGACCG CGGAACCGAA CTCATCAACG CCTACGACAA GCTCATCGGC ATCCAATACC AGCTGATGGG CCTGGATAAA TACGGCAAAA TCCCGGAAAA CCGCGTCCTG GCCCGCGTGA ACTTCAACTA CTACATGTTC CGCGACGGAG ACGGAGTCGC CTACCTCGGA AACGACGGGA CAATGCGCAT GGTAACCGAC CCTGAAAACG TTCTCAAGGG CGATGCCTGC TGGGGATTCT CCCACGAAGT CGGACACGTC ATGCAAATGC GCCCGATGAC CTGGGGCGGC ATGACGGAAG TCAGCAACAA CATCTTTTCC CTGCAGGCCG CAGCCAAAAC AGGCAATGAA AGCCGCCTGA AGCGCCAGGG CAGCTACGAC AAGGCGCGCA AGGAAATCAT CGAAGGGGAA ATCGCCTACC TGCAATCTAA GGATGTCTTC AATAAGCTGG TTCCCCTGTG GCAGCTCCAT CTTTACTTTA CCAAAAACGG ACACCCCGAC TTCTATCCTG ACGTCATGGA GTACCTGCGC AACAACGCGG GAAACTACGG AGGGAACGAC ACCGTCAAAT ACCAGTTCGA ATTCGTCAAG GCATGCTGTG ACGTCACAAA AACCGACCTG ACAGACTTCT TTGAGAAATG GGGATTCTTC AAACCCGGAA AATTCCACAT CGGCGACTAT GCCCAGTACG ACTTTAACGT CACCCCTGAA ATGGTGGAGG AAACGAAAAA GTGGATTGCC GGCAAAGGCT ACCCGAAACC CGAAACCGAC ATCACCGAAC TAAGCGAGTA A
|
Protein sequence | MNRILPLLSL PLLSLAAVFA ANTPEHIGND LKLFKDSSCT SLKPDVKNTS AFQSDAMKEL ATKILAGHYK PDYLYAEYRA LPSPRQTGKN LRIGDGFSKY DNMTGVYLEK GRHVVLVGKT EGQEISLLLP NLMRKPAEGV QPTKDPNGWG LHKKQIPLKE GINIIDVETP ANAYISYFTE DAGKAPKIPV HFVTGKANGY FDTTRGDTNK DWVRLLDQAV SPIMDARGKY IQVAYPVEFL KKFTKDRGTE LINAYDKLIG IQYQLMGLDK YGKIPENRVL ARVNFNYYMF RDGDGVAYLG NDGTMRMVTD PENVLKGDAC WGFSHEVGHV MQMRPMTWGG MTEVSNNIFS LQAAAKTGNE SRLKRQGSYD KARKEIIEGE IAYLQSKDVF NKLVPLWQLH LYFTKNGHPD FYPDVMEYLR NNAGNYGGND TVKYQFEFVK ACCDVTKTDL TDFFEKWGFF KPGKFHIGDY AQYDFNVTPE MVEETKKWIA GKGYPKPETD ITELSE
|
| |