Gene Acid345_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2237 
Symbol 
ID4072982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2655644 
End bp2657713 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content61% 
IMG OID637984253 
Producttransketolase 
Protein accessionYP_591312 
Protein GI94969264 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.822135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA TTGACTCCGC CACCAAGACC GTTGTCCGCG ATTACACCGT TGAGGAACTG 
AAAGACCAGG CCGCATTAAT GCGCGGCTAC AACCTCGTCG CGCTTTGCGC CGCCGGTTCC
GGCCACGCCG GGGGCACGCT GTCCATCATG GACATTACTG CCGCCCTTTA TCTCAGCGTT
GCCAATCATG ATCCCAAGAA TCCGAACTGG GCCGAGCGTG ATCGCATCCT CTGGTCGGGT
GGACACAAGG CGCCTGCGCT TTACGTGGGG CTGGCGTTCG CGGGCTTCTG TAACAAGGAA
GAGCTGGTCA CGTTGCGCAA GCTGTATTCG CCATTCCAGG GGCATCCGCA CTGGCTCAAG
CTGCCGGGAG TGGAAGCCTC GACGGGATCG CTTGGACAGG GACTGAGCGT GGCGGTGGGT
TCGGCGCTGG CGTCGCGGCT GGACGGCCGC CGCAACAAAG TGTTCTGCAT TATGGGCGAT
GGCGAGCAGC AGGAAGGGAA CATCTGGGAG GCTGTGATGG AGGCCGCCCA CTACAAGCTC
GACAACGTGA TTGGCATCAT TGACGAGAAC CGGCTGCAGA TCGATGGCCC GGTTTGCGAA
GTCATGAACG TTGCGCCGCT CGCCGATCGC TATCGCAGCT TCGGCTGGCT GGTGATCGAG
TGCGATGGCC ACGACATGGA GCAGGTGGTG AATGCGCTGA ACCAGGCGGT GCGCAACGAA
GGCAGCGGCA AACCGACTGC GATCATCGGG AAGACCGTCA AGGGCAAGGG CGTCAGCTTC
ATGGAGAACA TCGCCGGATG GCATGGCAAG GTCCCGAACT TCGATGAGCT CGTGAAGTCG
CTCAAGGAAC TCGGCGTAGA AGAGAAGATT CCGTATCTCG CGCTGCTCGA TAAGGCAAAG
GGCTACCAGA CCGAAGTCGA ACGGAAACTC GACGCCAAGA TGCCGAAGTT CTCGAAGGAT
TACTGGTGGA ACACCTCGAC CACCATGCAA GCTGCCATGA AGCCAACCCG GATGGGCTTC
GGCGAGAGTC TCAGCGAGAA CGGCAAAGAC GAGCGCGTGG TCTGCCTCGG GCTCGACATC
TCGGGTTCCA TCACGATCAG CGAGTTCTAC GCCAAGCATC CCGAGCGCAA GAACCGATGG
TTCAGCATGG GCATTGCCGA ACAGTCGGCA ACCGCGGCGG CAGCGGGCCT GGCGCGTGAA
GGCAAGCTGC CGGTACTCGG CACCTACGCC ACATTTGCAG CCGCACGCAA CCTGGACCAG
ATTCGCGTTT CGATTTGTTA CGGCAACTTC AACGTGATGA TCGCCGGTGC GCACGGCGGC
GTTTCCGTCG GCCCAGATGG TGCGACACAC CAGGCGCTCG AAGATCTCTT CGCGATGTGC
GGTCTGCCAA ACATGACGGT TGTGGTACCG GCCGACTCGG TCGAGACGCG AAAAGCTACA
GACTTCCTGT TGCTGAAGCA TGTCGGCCCG AAGTACATCC GCTTCGCGCG CGAGGCGACG
CCGATCATCA GCACCACCGA CACACCGTTT GAATTCGGCA AAGCGAACGT CATCCGGCTG
CGCAATGAAT CGAAGAACTT CATTGAGGCC TTCGCTACGG AACTCGCGGA CGACTATCGC
AACGAGAACG AAGACCTCTC GATCGTCGCT TGCGGACCGA TGGTGCCTGA AGCGATGCGC
GCCGCTTGGA TCCTGAAGCA GGAGTTCGGC TACGAGACGC GCGTGATCAA CATGCACACG
CTGAAGCCGC TTGATCGCCG CACCATCCTG AAGGCCGCGC TCGATACCCG CGTGGTCATC
ACTGCCGAGG AGCACCAGAT CGGCGCGCTG GCCTGGCAGG TTTCGCATGC CATCATTTCC
AGCCCGGCGC TGTTTGATGT GCCGGTGATT ACCGGCGCCA TCGGCGTGAA GGACCGCTTC
GGCGACTCCG GCGCCCCATG GGAGCTGATC AAGGAATTCG AGGTCTCCGC CGAGCACATT
GCACAACAGG CAGCGGCCCT GGTGGCGCTG AAGAAGCAGC GCATCGAAGC CACGGAGAAG
GAACTCGAGC CGACTCTCAG CCTTCGCTAG
 
Protein sequence
MSIIDSATKT VVRDYTVEEL KDQAALMRGY NLVALCAAGS GHAGGTLSIM DITAALYLSV 
ANHDPKNPNW AERDRILWSG GHKAPALYVG LAFAGFCNKE ELVTLRKLYS PFQGHPHWLK
LPGVEASTGS LGQGLSVAVG SALASRLDGR RNKVFCIMGD GEQQEGNIWE AVMEAAHYKL
DNVIGIIDEN RLQIDGPVCE VMNVAPLADR YRSFGWLVIE CDGHDMEQVV NALNQAVRNE
GSGKPTAIIG KTVKGKGVSF MENIAGWHGK VPNFDELVKS LKELGVEEKI PYLALLDKAK
GYQTEVERKL DAKMPKFSKD YWWNTSTTMQ AAMKPTRMGF GESLSENGKD ERVVCLGLDI
SGSITISEFY AKHPERKNRW FSMGIAEQSA TAAAAGLARE GKLPVLGTYA TFAAARNLDQ
IRVSICYGNF NVMIAGAHGG VSVGPDGATH QALEDLFAMC GLPNMTVVVP ADSVETRKAT
DFLLLKHVGP KYIRFAREAT PIISTTDTPF EFGKANVIRL RNESKNFIEA FATELADDYR
NENEDLSIVA CGPMVPEAMR AAWILKQEFG YETRVINMHT LKPLDRRTIL KAALDTRVVI
TAEEHQIGAL AWQVSHAIIS SPALFDVPVI TGAIGVKDRF GDSGAPWELI KEFEVSAEHI
AQQAAALVAL KKQRIEATEK ELEPTLSLR