Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1142 |
Symbol | |
ID | 6273896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1363989 |
End bp | 1365677 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642613194 |
Product | Glycosyl transferase, family 31 |
Protein accession | YP_001877749 |
Protein GI | 187735637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.954949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0746922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACGA GAACCGTTAC AAGCCTCTGG GTGGGCGGGG AACTTCCCCT GATGTCCGTT CTGTGCATCA AATCGTTCCT GGACCATGGC CATGCTTTCC AGCTTTTCAC CTACCGGAAT TACGACAATA TTCCCGCGGG AACGCTTGTG CGCGATGCGC GGGATATTCT CCCGGAGGAG GCGATTTTCC ATGATTCCCA CAATAGCCTG GCGCCGTTTT CCGATTGGTT CCGCATGAAA TTCCTTTCAC AGGAAGGCGG CTTCTGGGTG GATATGGACG TCATCTGCCT GGGTGATGAA CTTCCTGCCT CTCCTCTCTG GTTCTGCAGG GAGTGGGCGG AGGTGGTGGC CGTAGGCGCC ATGGCCTTTC CTCCCGGTCA TTCCGTTCCC GCAACCCTGT GCCGCCTTGC TGAGGATCCG GCGCTCCGCG TCCCCTGGGA CTCTCCGGAA GAAGTCCGGG CCAAGGAGGA ACTGCTACGC CGTGTGCCGG ATATCGCCGA TCGCCGGCGC CAGGTTCCAT GGGGATTTTG CGGCCCCACC GGGATGACGC GCGCGTTGCG CCACTGCGGC CTGTTTGACC GGGCCGCTCC GTCTTCCCAC ATGTATCCGG TCCCCTGGAC GAGATGGCGC GACTGCTACA ACGGCAGCAT ACGCCTTGCC GGGCCGGAAT TGTCCAATGC CTGGTGCGTC CACCTCTGGG GAGAGATGGC CAGGCGGGAG CCGGACGCCT GGGAAAATAT GAGCCGCAGC AGCATGGCAG GCGAGCTGCT GGACAGGCAT CTGCCGGGCC ACGCCTGGAA GCCTGCCCCC GGGCCGCGTA AAAAAGTGAA TATCCTGGTG GGCATCTGCA GCTGCACAGG CGCGGCGAAC CGCCGCAAGG CGTGCCGGGA GACCTGGCTT TCCCATCCTC AGGAGGGTGT GGAATGCAGA TTTTTCCTGG GGCGGCGCAC TCCTTTGCCC AATGAGCCCG ATGTAGTGGC CCTTTGGGTG GAGGACGATT ACAGGCACCT GCCCGCCAAG GGGCTTGCCT TTTATCAATA TGCCCTGGAA CATTATGACT TTGACTGGCT TTTCAAGTGC GACGACGATA CCTGGCTGGC GCTTGACCGC CTGGAAAGCC TCTGCGACGG CCGCTATGAC CTTGTGGGCG ACATGTCCCT GGCGGACAGG GGGTTCCCCA GCGGCGGAGC GGGCTACCTG ATGAGCCGGG CGCTTGTGGA GGGTATTGTG GCGCACGGCG GCCGGGTTCC CGCCGTCGGG GCGGAGGACG TCATCTTCGG CCGGCTGGCG CGGGAACTGG GCGCGCGCGT CCATGCCACG CCGCGCCTCT TCCTCAGCCA TGCTCCGGCG CCCCACCGCC TGAATGACCA GGTGAGCGCC CATTGGTGCT CTCCGGGCAG GATGCACGGC ATTGAGGCCC TTTTCCATGA TGAACCGGTG GCCGTTTATG ACGCCGTGCA TCCCCATTGG AGGGACGAAC TCCTGTTTTT TGCCCGGGGC CGTTTCATGC GCGGCGCCGG CGGCTGCACC GGGCGCTACG TCCTGCAGGA CGGGCTTCTC ACGCTGTTCT GGGATGACTG GGCGCCGGAA GCTCTGGAAA AAAACGGCAG CGGATTTTCC CGCGGTCCGT TCTCCCTGAC CCCTGCCGCC GGCAGCCGGC AGCTTCCTTT TCCGGAGTCC GTGTCCTGA
|
Protein sequence | MNTRTVTSLW VGGELPLMSV LCIKSFLDHG HAFQLFTYRN YDNIPAGTLV RDARDILPEE AIFHDSHNSL APFSDWFRMK FLSQEGGFWV DMDVICLGDE LPASPLWFCR EWAEVVAVGA MAFPPGHSVP ATLCRLAEDP ALRVPWDSPE EVRAKEELLR RVPDIADRRR QVPWGFCGPT GMTRALRHCG LFDRAAPSSH MYPVPWTRWR DCYNGSIRLA GPELSNAWCV HLWGEMARRE PDAWENMSRS SMAGELLDRH LPGHAWKPAP GPRKKVNILV GICSCTGAAN RRKACRETWL SHPQEGVECR FFLGRRTPLP NEPDVVALWV EDDYRHLPAK GLAFYQYALE HYDFDWLFKC DDDTWLALDR LESLCDGRYD LVGDMSLADR GFPSGGAGYL MSRALVEGIV AHGGRVPAVG AEDVIFGRLA RELGARVHAT PRLFLSHAPA PHRLNDQVSA HWCSPGRMHG IEALFHDEPV AVYDAVHPHW RDELLFFARG RFMRGAGGCT GRYVLQDGLL TLFWDDWAPE ALEKNGSGFS RGPFSLTPAA GSRQLPFPES VS
|
| |