Gene Acid345_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2378 
Symbol 
ID4071376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2810993 
End bp2813701 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content60% 
IMG OID637984394 
ProductBeta-glucosidase 
Protein accessionYP_591453 
Protein GI94969405 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0896649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.197642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACGGA ACTGGACTCC TGAAGTTTCG GCGGTCTTTG CGTTGGCGCT TGCTTTGCAG 
GCGCTTCTGG CTCCCATCCA ACTCCCGGCG CAATCGCCGG CCACCCCTGT GTATCGCGAT
GCGACCCGCC CGGCGAATGA GCGTGCGCAT GACCTTGTGC AGCGCATGAC GCTCGACGAA
AAGGCAGCGC AACTGGAGGA CTGGGCAACG GCGATCCCGC GGCTTGGGGT GCCTGATTAC
CAGACATGGA GCGAGGCGCT CCACGGTGTC GCCCGCGCGG GTCATGCCAC GGTTTTCCCG
CAGGCGATCG GCATGGCCGC CACGTGGGAC ACTGAGATGG TGAAGCAGAT GGGCGACGTC
ATCTCGACAG AAGCCCGCGG CAAATACAAC GAGGCCCAGC GCGAAGGGAA CCATCGCATC
TTCTGGGGAC TGACATTCTG GTCGCCGAAC ATCAACATCT TCCGCGATCC ACGCTGGGGG
CGCGGCCAGG AGACATACGG TGAAGATCCG TTCCTGACCG GCAAGATGGG CATCGCGTTC
ATAGACGGTG TGCAGGGGCC AGATGCGGCG CATCCCAAAG CGGTGGCCAC GAGCAAGCAT
TTCGCGGTGC ACAGTGGACC GGAGTCCTTG CGCCACGGGT TTGACGTAAA GGTGAGCCCT
CGCGATTTGG AAGAGACGTA TCTTGCCGCC TTTCGAGCAA CGGTAACGGA TGGTCACGTG
AAGAGCGTGA TGTGTGCCTA TAACGCGGTC GATGGGATGG GCGCCTGCGC GAACAAGATG
CTGCTCGAAG AGCACCTGAA GCAGGCATGG GGATTCAAGG GATTCGTTGT GTCCGATTGC
GGTGCGATTA TGGATGTAAC CCAGGGCCAT AAGAATGCGC CGGACATCGT GCACGCGGCT
GCGATTTCGT TGGCGGCGGG AACAGACCTT TCGTGCAGCA TATGGGAGCC TGGATTCAAC
ACGCTCGCGG ATGCAGTGCG TAAGGGGCTG GTGACAGAAG ATATGGTTAC GCGTGCGGCC
GAGCGGTTGT ACGCGGCGCG TTTCGAACTC GGAATGTTCG ACGAGCCGGG ATCGAACCCG
AACGACAAAA TTGACATGTC GCAGGTGGCT TCGGAGGAAC ATCGCGCAGA GGCGTTGAAA
GCGGCGGAAG AGTCGATAGT GCTGCTGAAG AACGATGGCC TCCTGCCGCT CAAGAACGCG
AAGACGATTG CCGTGATTGG GCCGACAGCC GAACTGTTGG CCTCGCTTGA GGGCAATTAC
AACGGGCAAC CGGTTCGTCC GGTGACTCCG CTCGATGGCA TCGTGAAACA GTTCGGCGCA
GAGAACGTGC GTTACGCGCA GGGATCGAGC TTGGCAGCGG GTGCGCCCGT TCCCGTACCG
CGGACGGCGT TTGAGGGTGG TCTGAAGGCT GAATATTTCG CGACGTCGGA TTGGACGGGG
CGTCCGGTGG CGACGAGCAC CGAGCCACGA ATTGACTACG ACTGGGTGTA TGCGACGCCG
GTGCCGGAGA TCCACACGCA CGATTACTCA GTGCGATGGA CTGGGACGAT CCGTGTGCCT
GCTCCTGGCA AGTATCGTTT TGCGACGGAG ACCCAGAGCG GGTTCCCATA TTCGCCGCGT
GAGAGCTATC GCGCAATGGT GGACGGCAAA CTGGTTTCGC AAGGGAAGAC CGAGGGCGAA
ACAAAACCAG CGGAGGGAGC ATCGCCAACT TCGCCGCCGC ACATGGCGAA TATGGCAAAG
GGCCAATTCG AGGTCACGTT CAGCGACACC AATCCGCACG CATTTGAATT CGGCTATAGC
CACGCTGGAG ATGAATCGGG AGGTGGAATC ACGCTGAGTT GGGAAGCGCC TCCGGAAGCG
CAGATTGCAG AAGCGGTGAA TGCTGCGAAG GCAGCGGACG TGGTGGCGGT TTTTGTGGGA
TTGTCTCCAA ATCTCGAAGG CGAAGAAATG CCGATCAAGA TTGAAGGCTT TTCGGGTGGC
GATCGAACCA GCATCGATCT TCCGGCGACG CAGGAAAAAC TGCTCGAGGC CCTTGGGGCT
GCAGGTAAGC CGGTCGTTGT CGTGAACTTG AGCGGAAGCG CAGTGGCGCT GAATTGGGCG
AACCAGCACG CAGGCGCGAT TCTGCAGGCA TGGTATCCGG GCGTGGAAGG TGGCACTGCA
ATTGCGAAGA CGCTCGCAGG CGAGAGCAAT CCGGCGGGGC GGTTGCCGGT CACGTTTTAC
GCTAGCGTGC AAGACTTGCC GGCCTTCACC GAGTACGCAA TGAAGAACCG CACCTATCGC
TACTACGCAG GCAAGCCGCT GTGGGGCTTT GGATTCGGGC TTAGCTACTC GACATTCAAG
TATGGCGAGG TGAAGCTGGC ATCCACTTCC GTAGATGCAG GGAAATCTTT GACTGCGACG
GTGACGGTGA CGAATACGTC TCAGGTCGCT GGCGATGAAG TGGTGGAAGC GTACCTCAAG
ACTCCTCAGA AGGGTGGGCC CTCGCATTCG CTCGTCGGAT TCCAGCGCGT CCCTCTGAAT
CCGGGTGAGA GCCGCGAAGT GGCGATCGAA GTCAGCCCGC GATCGTTGTC GGCAGTGGAT
GACAGCGGGA AGCGGTCGAT TCTTGCGGGC GAATATCGGC TGAGTATCGG TTCAACACAG
CCGCAGGAGA CGCAGGCAAA AAGTGAGGCG AACTTCACGG TGAAGGGAAG CGCGGAGTTG
CCGAAGTAG
 
Protein sequence
MRRNWTPEVS AVFALALALQ ALLAPIQLPA QSPATPVYRD ATRPANERAH DLVQRMTLDE 
KAAQLEDWAT AIPRLGVPDY QTWSEALHGV ARAGHATVFP QAIGMAATWD TEMVKQMGDV
ISTEARGKYN EAQREGNHRI FWGLTFWSPN INIFRDPRWG RGQETYGEDP FLTGKMGIAF
IDGVQGPDAA HPKAVATSKH FAVHSGPESL RHGFDVKVSP RDLEETYLAA FRATVTDGHV
KSVMCAYNAV DGMGACANKM LLEEHLKQAW GFKGFVVSDC GAIMDVTQGH KNAPDIVHAA
AISLAAGTDL SCSIWEPGFN TLADAVRKGL VTEDMVTRAA ERLYAARFEL GMFDEPGSNP
NDKIDMSQVA SEEHRAEALK AAEESIVLLK NDGLLPLKNA KTIAVIGPTA ELLASLEGNY
NGQPVRPVTP LDGIVKQFGA ENVRYAQGSS LAAGAPVPVP RTAFEGGLKA EYFATSDWTG
RPVATSTEPR IDYDWVYATP VPEIHTHDYS VRWTGTIRVP APGKYRFATE TQSGFPYSPR
ESYRAMVDGK LVSQGKTEGE TKPAEGASPT SPPHMANMAK GQFEVTFSDT NPHAFEFGYS
HAGDESGGGI TLSWEAPPEA QIAEAVNAAK AADVVAVFVG LSPNLEGEEM PIKIEGFSGG
DRTSIDLPAT QEKLLEALGA AGKPVVVVNL SGSAVALNWA NQHAGAILQA WYPGVEGGTA
IAKTLAGESN PAGRLPVTFY ASVQDLPAFT EYAMKNRTYR YYAGKPLWGF GFGLSYSTFK
YGEVKLASTS VDAGKSLTAT VTVTNTSQVA GDEVVEAYLK TPQKGGPSHS LVGFQRVPLN
PGESREVAIE VSPRSLSAVD DSGKRSILAG EYRLSIGSTQ PQETQAKSEA NFTVKGSAEL
PK