Gene Amuc_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1094 
Symbol 
ID6274008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1305302 
End bp1306300 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content58% 
IMG OID642613145 
ProductROK family protein 
Protein accessionYP_001877701 
Protein GI187735589 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000124678 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000000129961 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTTT CAGAACCCTG TGCCTTGGCC GTTGATTTCG GCGGCACGAG CATCAAAATG 
GGCGTAACGG CGGGGGATCG TATTCTGGCG ACGGCCGACC GCATCCCTAC TGCCATGTTC
GAAAGCCCGC AGGCAATCAT TGATGCCATG ATTGCGTCCG CCCGCACCCT GCGCGGACAA
TTCCCCTCCG CCTGTGTGAT GGGCATGGGA ATGCCGGGAT GGTGTGATTA CCAGCGGGGA
GTGCTTTACC AGCTTACCAA TGTGAGGGTC TGGGATAGGG AAATTCCGGT GAAAGAGATG
ATGGAGCAGG CCCTGGGCCT CCCCGTCGTG CTGGATAATG ACGCCAACTG CATGGCTTAT
GCGGAATGGA AGCTTGGCGC CGGGCGCGGC ATGTCCAGCC TGGTGTGCCT GACGATGGGA
ACGGGGATAG GCGGGGGAAT CGTGGTGCAT GACCGCATGC TGCGGGGAAG GCGGCTTTCC
GCTGCGGAAC TGGGCCAGAC CAGCATTCAT TACCAGGGGA AAACGGGACC GTTCGGCAGC
CGGGGAGCCA TTGAGGAATA CATCGGCAAC AACGAACTGG CGGCGGAGGC GGTTAAACGG
TATGCCGGGG CGGGAATCAT CAAGACGGTG GATGAATGCA CGCCCAGGCA TCTGGACGAG
GCTGCCCGGT CCGGATGTCC TATAGCCCTT CAATTATGGG AAGATACGGC GGAAATGCTG
GGCTGCCTGA TCATGAACCT GATGTATACG CTGGTGCCGG ACGCCTTCAT CATCGGGGGC
GGTGTGGCCA AGGCAGGGGA TTTGTTGATG AAGCCGCTGC TGGAGAACCT CAGGAAACAG
TTGTTTCCTC TCCTGATGGA GGATTTGAAA ATTCTGCCTG CCAGATTTGG AGCGGAGGCG
GGGTTGCTGG GAGCGGGAGC CATGGCGATG GATGAATTCA TGGGGCTGGG GATTTTGGAA
CGGTTTAAGA ACCAGAAATC AACGCAGACT TTTTGTTAG
 
Protein sequence
MSFSEPCALA VDFGGTSIKM GVTAGDRILA TADRIPTAMF ESPQAIIDAM IASARTLRGQ 
FPSACVMGMG MPGWCDYQRG VLYQLTNVRV WDREIPVKEM MEQALGLPVV LDNDANCMAY
AEWKLGAGRG MSSLVCLTMG TGIGGGIVVH DRMLRGRRLS AAELGQTSIH YQGKTGPFGS
RGAIEEYIGN NELAAEAVKR YAGAGIIKTV DECTPRHLDE AARSGCPIAL QLWEDTAEML
GCLIMNLMYT LVPDAFIIGG GVAKAGDLLM KPLLENLRKQ LFPLLMEDLK ILPARFGAEA
GLLGAGAMAM DEFMGLGILE RFKNQKSTQT FC