Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1232 |
Symbol | |
ID | 6275700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1480903 |
End bp | 1482393 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613289 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001877838 |
Protein GI | 187735726 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.271029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.25171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTT GCTCCACGGA AAATATGCAG ATCGCGGAAC GCGAGCTCAT CGTTAGCGGA ACACCAGCCA GAACCCTGAT GAAACTGGCC TCTTCCGGTA TTGCGGAATC TCTCATGCAG TTCTTCCCCG ACCCGGGATT GTGCGTAGCC TATGTGGGCA AAGGGAATAA CGGGGGAGAT GCTCTCACCG TCCTTAACAT ACTGAAACAA CATGGCTGGG AAATAGGCTT CCGAACCGCT TACCCGCGCA ACGAATGGAG CGAACTTTCC GTGCGGCAGT TGGCGGAAAT ATCTCCTCCC CCTCAGGAAT ACCAGGCCCC TCCCCTTCCC CGTACGGGAA AACCTATGAT CCTGCTGGAC GGCCTGCTAG GAATAGGGGC AAAGGGAATG CTCCGCAGGG AAATCTCCGC TCTCTGTGCG GAAATGAACT ATATAAGAAA CCGTTGCGGG GCTATACGTA CCGTGGCTAT TGATATCCCC ACCGGAGTGG ATCCAGATAC GGGAATGCCT CAGCAAAATG CAGTGGAAGC AGATTTCACC ATGTGCATAG GAGCCGTTAA GCAGGGCCTG CTGGACGATG ACGCCACCCT GTTTGCCGGA CGCTTAGTCT GCATTGACCT TCCCGGTCTT CACGTACAGG CGCTCCCCGC CACGGAACTT ATTACCTCCT CACGGCTTAC CAAATTCCTC TCTGCAAGAC CCTATACGGA TTATAAGAAC AAACGCGGGC ACATTGGCGT CATAGCTGGC TCTGAAGGAA TGCTGGGTGC GGCCCGGCTC TGCTGTGAAG CAGCTCTCCG GGCAGGAGCC GGTCTGGTTA CACTGCACGT TCACAAAAAC GTCTATCCCT TAATCGCTCC ATCCATGCCT CCGGAAATCA TGGTCAGACC CGTGGACAGT TACGCAGATA TCTCCATTCG CACATTCAGT GCTTTCCTCA TTGGCCCAGG TATCGGTTCC GTATCGGAGG AAGATGCGGA AGCCATCCGC CTCATTCTGG AAACAGGTAC CCCCACTGTT CTGGATGCGG ACGGGCTGAA TCTGGCCGCG GCCATGCAGT GGAACTTGGG AGAACACGTT CTGGCTACTC CCCACCATGG AGAAATTCGC AGACTTCTGC CAGATGCAGA CAACTATGCC ATCCGGGCAG ACATTGCCGA CTGCTTTCTG GCAGAACATG AAGCCGCCCT GGTTTACAAG GGAGCACGCA CCATCGTCAC CCAACGGGGA AAACCTCTTT TTTATAACAT CACTGGGGAT CCCGGCATGG CAACTGCCGG TCAGGGAGAC GTGCTGGCAG GTATTTGCGG AGGCTTTATA AGCCAGGGGG AATCCCTGCT GGTTTCCGCC GTTCTGGGCG TCTACCTGTG TGGCCGAGCT TCCGAAATGG CAATCTCCGC CGGAGAAGCC ACCCAGCAAA CATTGACGGC AGGGGATACG CTGCGACATC TGCCTTCCGC CATTCTTTCT ACCGCACGCC TCTGTTATTG A
|
Protein sequence | MKVCSTENMQ IAERELIVSG TPARTLMKLA SSGIAESLMQ FFPDPGLCVA YVGKGNNGGD ALTVLNILKQ HGWEIGFRTA YPRNEWSELS VRQLAEISPP PQEYQAPPLP RTGKPMILLD GLLGIGAKGM LRREISALCA EMNYIRNRCG AIRTVAIDIP TGVDPDTGMP QQNAVEADFT MCIGAVKQGL LDDDATLFAG RLVCIDLPGL HVQALPATEL ITSSRLTKFL SARPYTDYKN KRGHIGVIAG SEGMLGAARL CCEAALRAGA GLVTLHVHKN VYPLIAPSMP PEIMVRPVDS YADISIRTFS AFLIGPGIGS VSEEDAEAIR LILETGTPTV LDADGLNLAA AMQWNLGEHV LATPHHGEIR RLLPDADNYA IRADIADCFL AEHEAALVYK GARTIVTQRG KPLFYNITGD PGMATAGQGD VLAGICGGFI SQGESLLVSA VLGVYLCGRA SEMAISAGEA TQQTLTAGDT LRHLPSAILS TARLCY
|
| |