Gene Amuc_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0530 
Symbol 
ID6275284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp622491 
End bp624524 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content60% 
IMG OID642612580 
Producttransketolase 
Protein accessionYP_001877149 
Protein GI187735037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA TCCGCTCATA TTGCCCGGGC ATGAATCTTG ACCTTCTCCA AAAAGCCGCC 
AACCAGGCGC GGGGACTCGC CATGGATGCC GTTCACGACT GCGCATCCGG CCACCTGGGC
CTGCCGCTGG GGTGCGCCGA AATCGGGGCC GTCCTGTTCG GCGATTTGCT CAACATCTGC
CCTTCGCAGC CCCGCTGGCT CAACCGTGAC CGCTTCATTC TTTCCGCCGG ACACGGCTCC
ATGTTTCTTT ACGGCTGGCT CCACCTGTCA GGATTCAACA TCGGCATTGA AGATATCAAG
AATTTCCGCC GCAAAGGTTC CATCACGCCC GGCCACCCGG AATTCCGGGA TACGGAGGGT
GTAGAATGCA CCACGGGACC GTTGGGCCAG GGCATTGCGA ACGCCGTAGG CTTCGCCCTC
TCCGCCAGGC GCGCGGCTGC CCGGTTCAAC AGGCCCGGCA TGGATATCTT CACCCAGCAC
GTCTTCTGCC TCACGGGAGA CGGCTGCCTG CAGGAAGGCG TCGCCCGGGA ATCCCTCGCC
CTGGCCGCCG TGCTCAAGCT GGATAACCTT ATTCTCATTT ACGATTCCAA CGACATCACG
CTGGACGCTC CCGCGGAACG CACCCAGCTC ACAGATCCGC GCGCCGTATA CGAAGCCCTG
GGCTGGGATG TGCGCCAGAT TGACGGGCAC GATATCAGGG CCATTGAGTC AGCGGTTGAA
GCCGCCAAAA ACGCCAAAAA CGGGAAACCC CAGCTCATCA TTGCCAAAAC GGTCATCGGC
AAGGGCATTC CCGGCATTGA AGGCACCACA AAAGGCCATG GGGAAGGCGG AGCCAAGCTT
CAGGAAGAAG CGCACGCCAA CTGGGGAATT CCCGCCGGAG AACGCTATTA CGTCTCCGAA
GACGTCCGTA CCGCTTTCGC AAACCTGAAA GCCCAACGGG AAAAAGATTT CAATGCCTGG
AACGCCATGT ATGAACAATG GCGCCGGGCT TATCCGGAAC TGGCAGAGGA ACTGGACGCA
GGCATCAACG CCTGCTCCTG CGGCGTCAAT CCGGCAGACT CGGACAAGGC AATACCGCCC
TTCCCGCAGG ACTATGGCGA TGCTACCCGT TCGGCCGGAG CCGTTGCCAT CAACGCCATC
GCTAAAGCGA ATCCCTGCTT CCTGACCACC AGCGCAGACC TTTACAGCTC CAATAAAAAC
TACCTTTCCG GCGCAGGAGA CTTCTCTGCG GAAACCCCGG AAGGGCGCAA CTTCTGGTTC
GGCATCCGCG AACACGCCAT GGCCGCCATC TGCAACGGCA TTGCCTACGA CGGCCTGTTC
CGCGTAAGCG CCGCCACTTT CTGCGTCTTC GTAGACTACA TGCGCGCCTC CATCCGTGTA
GCCGCCCTCA GCGGACTCCC CGTCACCTAT ATTCTGACGC ATGACTCCGT AGCCGTGGGA
GAAGACGGCC CCACCCACCA GCCGGTGGAA ACAATTTCCG GCCTCCGTGT CATTCCGAAC
CTGGATGTCA TCCGCCCGGC GGACCCGGAA GAAACCGCAG GAGCCTGGAT GGCCGCCATG
CAGCGTGCCG ACGGCCCTAC CGCCCTCATC CTGACCCGTC AGAAAGTGGC TACGCTGAAC
GGAATCCCCG TTGAAACGCG CCGGGAAGGC GTGCTGAAAG GCGCCTACAT CGCCCGGAAA
GAACAGGGAG CCTTGAAAGC CATTATTCTT GCCAGCGGTT CCGAACTGGA ACTGGCTCTG
AAAGCGGCGG AAAAAACAGG GGAAGGAATC CGGGTCGTCT CCATGCCAAG CTTCTGCCGC
TTTGACGCGC AGCCTGCCGA ATACCGGGAA AGCGTGCTTC CCTCCTCCTG CATGAGGAGA
GTTTCCGTAG AAGCCGGAGT CACGGACCTC TGGTGGAAAT ATCTGGGCTG CCAGGGGGAA
GCCGTGGGCA TCAACCGTTT CGGCTTCTCC GCTCCCGGAA CACAGGTGCT GGAAGAACTC
GGCATGAATG TGGACAACGT CGTTGCCGCC GTCCACAAGG TTCTGGCCAA ATAA
 
Protein sequence
MKLIRSYCPG MNLDLLQKAA NQARGLAMDA VHDCASGHLG LPLGCAEIGA VLFGDLLNIC 
PSQPRWLNRD RFILSAGHGS MFLYGWLHLS GFNIGIEDIK NFRRKGSITP GHPEFRDTEG
VECTTGPLGQ GIANAVGFAL SARRAAARFN RPGMDIFTQH VFCLTGDGCL QEGVARESLA
LAAVLKLDNL ILIYDSNDIT LDAPAERTQL TDPRAVYEAL GWDVRQIDGH DIRAIESAVE
AAKNAKNGKP QLIIAKTVIG KGIPGIEGTT KGHGEGGAKL QEEAHANWGI PAGERYYVSE
DVRTAFANLK AQREKDFNAW NAMYEQWRRA YPELAEELDA GINACSCGVN PADSDKAIPP
FPQDYGDATR SAGAVAINAI AKANPCFLTT SADLYSSNKN YLSGAGDFSA ETPEGRNFWF
GIREHAMAAI CNGIAYDGLF RVSAATFCVF VDYMRASIRV AALSGLPVTY ILTHDSVAVG
EDGPTHQPVE TISGLRVIPN LDVIRPADPE ETAGAWMAAM QRADGPTALI LTRQKVATLN
GIPVETRREG VLKGAYIARK EQGALKAIIL ASGSELELAL KAAEKTGEGI RVVSMPSFCR
FDAQPAEYRE SVLPSSCMRR VSVEAGVTDL WWKYLGCQGE AVGINRFGFS APGTQVLEEL
GMNVDNVVAA VHKVLAK