Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1933 |
Symbol | |
ID | 6275249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2345449 |
End bp | 2346867 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613993 |
Product | protein of unknown function DUF1111 |
Protein accession | YP_001878527 |
Protein GI | 187736415 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.242314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.0976167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCTT CATGCGTGAG CTGCCATGTA AACCGCGGCC GTGATTTCAC GCCCCCCGTC CTCAGGACGA AAGAAGGCGA GCCCGGGCCG GCTTACTATC TGGGCGGTGA AAAGGGAACG GTGTTTAATG CCACCAGTAA AGCGTTTGAG CAGGAATCGC CGGCAATTAC TGAAGCCGGC CTGACGAATC GTTTCAAGGC GGGCGAAATT ATTTTTGAAG GGAACTTCGT ACCCGTGAAG CGCAATCGCT TCGGCGGTCT CGGCCCCACT TACATTAAGT CATCCTGTCT GGCCTGCCAC CCCGGTTATG GCCGCGGACA GCGCACCGGC AATTTTGACA GGCAGTACGG CAACGGTTAT CTGGCCTTTG TACATAATCC CGATGGCACC CCAGTTAAGG GGTACACGGG CATGCTGCAG ACGAAAGCGG TTCCTCCTTT TGTGCCTTAT GCCAAGGGGG TGAAAATAGA ATGGCATGAT TTTGTTGACC AGTACGGAAA CAAATACCCG GACGGAACGC CCTACAATGC GGGCAAGCCG ACGGAAGGCA CGTTGACTTA TCCCACGGCA GATGTGATTG AGCCGTTGCT TCCGCTTCCG GCCGATTACC GCGTGTCAAT CGAATCAACA ATCGGCATTT ACGGAACGGG GCTGCTTGAT GCCATCCGTG ACGAGGATAT TATTGCCGAA TACAGGCGCC AGCAGAGCAT GACAGGCCCG GTGAAGGGGA TTCCTGGCAA ATGGATTGAC GAGCCCGACG GTACCCGGCG CCTCGGAAAG TTCACGTGGG ACTGCTCGCG CGCCACACTG GAAAACGGTC CTGGCGCCAA TGCGCTTTGG AACGTGACGA ATGTAACGCG CAAGAACCGT CCGAACATCT ACATGACGCC CGAATGGCTC GAAAAACAGA AGGAACTTGG CATTGATGTA AGCGGTCTTG AAGGCCCGCA GGAAGAGGAA CTCTCAATGC AGCAGTATGA AGATTTCATG GTCTGGCATC GCGGGCTAGC CGTGCCTGCC GCCCGCAACC TGGACAAGCC TGACGTGCGC CGCGGACAGG AACTCTTCAA TAAACTGGGG TGTGCCGGTT GCCACAAGCC TGAATGGACA ACGGGGGAAT ATAAGCCGCT TCCCGGTTAT GCAAACCAGA CCATCCGCCC CTATACGGAT ATGCTGCGTC ACGATATGGG GGAAATCAAC CGCGGACGTT CTCGTTTCTG GCGTACGCCG CCCCTCTGGG GAAGGGGGCT GATGCACAAA ACCGCCAATC ATACAGATAT GTTCCATGAC CTGCGCGCCC GCGACTTTGA AGAGGCTATC CTGTGGCATT TCGGTGAAAG CGAATTCTCC CGTGAAATGT TCCGCCATCT CTCCGCCGAA GAGCGCGGCC AACTGATTCA ATTCCTGAAA GCACTTTAA
|
Protein sequence | MTASCVSCHV NRGRDFTPPV LRTKEGEPGP AYYLGGEKGT VFNATSKAFE QESPAITEAG LTNRFKAGEI IFEGNFVPVK RNRFGGLGPT YIKSSCLACH PGYGRGQRTG NFDRQYGNGY LAFVHNPDGT PVKGYTGMLQ TKAVPPFVPY AKGVKIEWHD FVDQYGNKYP DGTPYNAGKP TEGTLTYPTA DVIEPLLPLP ADYRVSIEST IGIYGTGLLD AIRDEDIIAE YRRQQSMTGP VKGIPGKWID EPDGTRRLGK FTWDCSRATL ENGPGANALW NVTNVTRKNR PNIYMTPEWL EKQKELGIDV SGLEGPQEEE LSMQQYEDFM VWHRGLAVPA ARNLDKPDVR RGQELFNKLG CAGCHKPEWT TGEYKPLPGY ANQTIRPYTD MLRHDMGEIN RGRSRFWRTP PLWGRGLMHK TANHTDMFHD LRARDFEEAI LWHFGESEFS REMFRHLSAE ERGQLIQFLK AL
|
| |