Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0120 |
Symbol | |
ID | 6274912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 148008 |
End bp | 149318 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612165 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_001876746 |
Protein GI | 187734634 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACG AACAACATCA AAATACCCAC ACTTCCGCCG CCCCCGCCAT TCCCTCCCCC TTCCAGAAGA AAACCTGCTG GCATGCCCTG ACAGGCGTAT CCATTCTGGT GATGCTGGGC ATTGCGGCCT TCGTCATTTT TGAAGTAGTG GAACTTCTGG GGTTTCTGGA ACCGGTGCTC CTGCCCATCC TGATTGCCGC CGTCGTGGCG TATTTGCTGG AACCCATCGT TTCCTGGCTG GTGCGTCTCA AATTCTCCCG TCCATGGGCC GTAGTCACGG TCATGTTTGC GGCTCTGGCC GTTCTGGTAG GCTTTGGAGC CACGATTCTC CCTCCCCTGA TCAGGCAGAC GGATGAACTG ATCGACAACC GGATGGAGCT ATGGGACAAA ACCTCCGAAC TGATTGACTC CACCATTGAA ATCCCCTTCA TTTCCCGCAC CATTGACAGT GTTTACAGCA CCAGCCTGCG GGAACTGAAT GCCAGCCATT ATACGGAAGC GGAAGTCCAT GACCTGAGAA ACGCCCGGAC CGCCCGGGAA AAGCTGGGAG CCTACATGAC CATCAATTCC TCCTTTTACC AGGACAAGCT GATGAGCTGG CTCACTTCCG GGGGACGGGC CCTGTACAGC ACCATAGGAA TCATGGTCAG CATCCTGATC ACGCCCATTT TCGCTTTTTA CTTTCTGCTG GAGGCGGATA AAATCAAAGA GAAATGGCCC AGCATCCTGC CCCTGAAAGT CTCCAAATTC AGAAAAGACG TGGTGGACAC CATGGAGGAA ATCAACGGCT ACCTGATTTC CTTCTTCCGC GGCCAAATGC TGGTAAGCAT CATTGAAGGC ATTCTGATTG CCATCTGCCT GAAACTGATA GGCCTGCCGT ACGCCATCAC CATCGGCGCC GCGGTCTGCG TGCTGGGCAT CGTTCCCTAC CTGGGCATCA TCACCGCCTT TATCCCTGCG GTGCTGCTGG CCTGGTTCAC ATGGGGGGAT TTCCAGCACG TGCTGATTGT TTCGGGCATC TTCCTGGCCG TCAACCAATT TGACGGATGG ATCATCCAGC CGAAAATCGT GGGGGATTCC GTGGAACTCC ACCCGCTCAC GGTCATGTTT TCCGTGTTGA TCTGGACACT CATCCTGGGC GGCTTGATCG GCGCCCTGCT GGCTGTCCCC CTGACAGCCG CCATCAAGGT CCTTTACAAG CGGTACATCT GGCAAAATGC CAGCATGCGC CCCATGACGG ACCCCGTGCT TCCGCCGGAA CATCCCGGCG AACAGCCTCC GGACCCCCCC GAAGGTTCGG CCCACGCTTA A
|
Protein sequence | MNNEQHQNTH TSAAPAIPSP FQKKTCWHAL TGVSILVMLG IAAFVIFEVV ELLGFLEPVL LPILIAAVVA YLLEPIVSWL VRLKFSRPWA VVTVMFAALA VLVGFGATIL PPLIRQTDEL IDNRMELWDK TSELIDSTIE IPFISRTIDS VYSTSLRELN ASHYTEAEVH DLRNARTARE KLGAYMTINS SFYQDKLMSW LTSGGRALYS TIGIMVSILI TPIFAFYFLL EADKIKEKWP SILPLKVSKF RKDVVDTMEE INGYLISFFR GQMLVSIIEG ILIAICLKLI GLPYAITIGA AVCVLGIVPY LGIITAFIPA VLLAWFTWGD FQHVLIVSGI FLAVNQFDGW IIQPKIVGDS VELHPLTVMF SVLIWTLILG GLIGALLAVP LTAAIKVLYK RYIWQNASMR PMTDPVLPPE HPGEQPPDPP EGSAHA
|
| |