Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1145 |
Symbol | |
ID | 6273890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1372509 |
End bp | 1374116 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642613197 |
Product | hypothetical protein |
Protein accession | YP_001877752 |
Protein GI | 187735640 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0989818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000170395 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAAT ATTCATCCTG CCTTATCCAC CGGAATTTTC TGGTGGGAAT TAGCTATTCG GACGCCACGC CCGGCCAGAG CTTCGCCTAC AATCCCTACG GAGAACTGGA AACCGACAAC CTGCGAGTGG GCAACCACAC CCACCTCATC ACCGAGAAAA AAGACGCCTT CGGGCGCAGC ACAGGCTACA TCTACGCGCG GAGCGGCGCG GCGCAGCACA CCGTAAGCAT CGGCTACGGG GAAGACGGCC GCATCGCCAC GGCCGGGTTC CTGCAGGGCA GCGCGCCCCA GACCTTCACC TGGCAATACA TGGAAGGAAG CGGCCTGCCC TCCGTGATGG CCATGCCCAA CGGCATGACG CTGGAATGGG GCTATGAGGA AAAGCGGAAT CTGGTCTCTA CCATGGTCTA CAAACGCGGC GCCACAAGAG TTGTGGAAAG GGAATACGTT TACGACAGCC TGGCAAGGCC CGTTACGCGG CGCACGGCCC GCCAGGGGAA TACCGTCAAC GACAGCTTTG CCTGCAACAG CCGGAGCGAA CTGACCGCCG CCACCGTCAG CGGCAAGACA TACGGCTACA GTTACGACAA CATAGGCAAC CGGAAAACCG CGCGGGAAGA CGCGGAAGAA GCGACCGCCT ACACAACCGG CCCGCTCAAC CAATACACCG CCATCGAGCG CGGAGAAGAA GCCGCCTTTG AGCCGGTTCA CGACGCGGAC GGCAATCAGA CATTGATCAG GACATCCACC GGCATCTGGC AAGTCGCCTA CAACGCGGAA AACCGCCCGG TGCGCTTCGT CAATGAAAGC GCCAAAACCG TGGTGGAATG CACCTACGAC TACATGGGCA GGCGGCACAC CCGCAAAGTG AGCGTCAACG GGACGGTGAG CAGCTACCTG CGCTACATGT ACCGTGGCTA CCTGCAAATA GCCGCCATAG ATGCCGTCAG CGGAGTCTTT CGATGGTTCC TGTTCTGGGA CCCGACGCAG CCTGAGGCTG CGCGTCCGCT GGCCATCCGC AAAGACGGCG CCTGGTACGC CTACGGGTGG GATCTGACCG GGAACGTCAC GGGAATCTTC GGGAAAGCCG GTTACCTGCG GACGGTCTAC ACCTACACGC CTTACGGAGA AGTCACCGCC GAAGGGGATG TCACCCAGCC CATCCAGTGG AGCAGCGAAT ACAGTGACGA AGAACTGGGG TTGGTCTACT ACAACTACCG GCATCTCAAT CCGCACGACG GAAGGTGGAT CAGCCGCGAT CCCATCGAGG AAGAAGGTGG TTGGAATTTG TTCGCGTTTG TAGGAAATAA AATTTTTAAT CAATCTGATA TTTTAGGGTT GATATGTACA ATAGAATATA GTATAAAATT ACATACAATA TTAATAAGGA AGGTAGATAA AGATAGTAAT ATACTTCGTT TAACGACGAG TAGAGTTTTT TCTGGAAATG GTGATGGGAA AAATAATCCG GACAATGTTG GAAATAAAGA TAACGGTCCT ATACCACCAG GGAAATACTA TGTGATAAAA AGACAATCTG GAGGAATTCG GAGCCAAATT AAAGATTGGA CCTACAAATT ATGGAATGAT AATGATAAGA ATCAATAG
|
Protein sequence | MSKYSSCLIH RNFLVGISYS DATPGQSFAY NPYGELETDN LRVGNHTHLI TEKKDAFGRS TGYIYARSGA AQHTVSIGYG EDGRIATAGF LQGSAPQTFT WQYMEGSGLP SVMAMPNGMT LEWGYEEKRN LVSTMVYKRG ATRVVEREYV YDSLARPVTR RTARQGNTVN DSFACNSRSE LTAATVSGKT YGYSYDNIGN RKTAREDAEE ATAYTTGPLN QYTAIERGEE AAFEPVHDAD GNQTLIRTST GIWQVAYNAE NRPVRFVNES AKTVVECTYD YMGRRHTRKV SVNGTVSSYL RYMYRGYLQI AAIDAVSGVF RWFLFWDPTQ PEAARPLAIR KDGAWYAYGW DLTGNVTGIF GKAGYLRTVY TYTPYGEVTA EGDVTQPIQW SSEYSDEELG LVYYNYRHLN PHDGRWISRD PIEEEGGWNL FAFVGNKIFN QSDILGLICT IEYSIKLHTI LIRKVDKDSN ILRLTTSRVF SGNGDGKNNP DNVGNKDNGP IPPGKYYVIK RQSGGIRSQI KDWTYKLWND NDKNQ
|
| |