Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1195 |
Symbol | |
ID | 6273838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1431708 |
End bp | 1432556 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642613246 |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
Protein accession | YP_001877801 |
Protein GI | 187735689 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000267147 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGTT GCAAGGCTCC GTGCAAAGTG AACGTTTCCC TCCGCGTTCT GGGAAAGAGG CCGGACGGCT TCCATGAGGT GGATACCGTG ATGGTTCCCT TGGATTTGTG TGATGTGCTG GAGTTTTCGC CGGCGGCCTG TCTGGAGATG AGCTGCGACG CCCCGGGCGT GCCTCTGGAT GAGAGCAACC TGGTCATGAA AGCGGGCCGG CTGATGGAGC GGGAGCTGGG AAGGCCCATG CCGTGGCATG TCCGCCTGGT GAAAAAGGTG CCCCATGGCG CGGGGCTGGG CGGCGGCAGC AGTGACGCCG CATGCGTGCT GTGCACCCTC AATGAGCTGG AACGCGGAGG TTTGTCCCGG GAGCGCCTGG CGGAGCTGGG CGGGGAAATC GGTTCCGACG TGGGGTTTTT CATTTATGGC GCGGCGAGCC GCTGCACCGG GCGCGGAGAG AAGGTGGAGC CCTTGCCGGA GTGGAAGGGG TGGCGTCCCC GGGTTGTCTT GTTGAAGCCG TCGTTTGGTG TCTCCACGCC GGACGCTTAC CGCCGCTGGT CCGGTTCCAG GGAATTGCCC GGCATTCCCT ATGGGGAGCA GGATGTGGAC GGCCATGTGC TAGTCAATGA TCTGGAAAGG CCTGTGTTTG AAAAGCATTT ATTCCTGGCG GAAGTGAAGC GCTGGCTGAT GGGGCGGCCC GGGGTGCGCG GCGCCATGAT GTCCGGTTCC GGTTCCACCA TGTTCGCCGT GGTGGAGGAT GAAGGAACCG GACGCGCGCT GATGGAGGAT GCCGCCCGGG AGCTGGACCC CACTTTATGG ATGTGGTCCG GCCTGGTGAT GCAGGATGAC GCGCGGTAA
|
Protein sequence | MISCKAPCKV NVSLRVLGKR PDGFHEVDTV MVPLDLCDVL EFSPAACLEM SCDAPGVPLD ESNLVMKAGR LMERELGRPM PWHVRLVKKV PHGAGLGGGS SDAACVLCTL NELERGGLSR ERLAELGGEI GSDVGFFIYG AASRCTGRGE KVEPLPEWKG WRPRVVLLKP SFGVSTPDAY RRWSGSRELP GIPYGEQDVD GHVLVNDLER PVFEKHLFLA EVKRWLMGRP GVRGAMMSGS GSTMFAVVED EGTGRALMED AARELDPTLW MWSGLVMQDD AR
|
| |