Gene Acid345_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3246 
Symbol 
ID4072581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3843696 
End bp3846341 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content60% 
IMG OID637985267 
ProductBeta-glucosidase 
Protein accessionYP_592321 
Protein GI94970273 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.190497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.145992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATTC GTGCATTCCC TATTTGCCTG CTGTTGCTGG CCTTCCTTGT CCCTCTGACC 
ACGGCGCAGA ATGCTGAACA GCCTGCGTAT CTAAACCCTT CCCTGGCGCC GGAAAAGCGA
GCGGCAGACC TCGTCCATCG CATGACCGTG GAAGAGAAGG TCAGCCAACT CACCAACGAT
TCGCGTGCCG TTCCTCGTTT GAATGTTCCT GACTACGACT GGTGGAGTGA AGCGCTCCAC
GGCGTGGCGC AACCGGGCGT CACCGAATAT CCGCAGCCCG TGGCGCTGGC AGCCACCTTC
GACAACGATA AGGTCCAGCG CATGGCCCGC TTTATCGGCA TCGAGGGCCG CATCAAGCAC
GAAGAAGGCA TGAAGGACGG CCATAGCGAT ATCTTCCAGG GCCTCGATTT CTGGGCGCCG
AACATCAACA TCTTTCGCGA CCCTCGCTGG GGACGCGGCC AGGAGACCTA CGGCGAAGAT
CCGTTCCTGA CCGCGCGCAT GGGCGTGGCT TACGTGAAGG GCCTGCAAGG CGACGACCCC
AAGTACTACC TCGCCATCTC TACGCCCAAG CATTACGCGG TGCACAGCGG CCCTGAGACC
ACGCGCCACT TCGCGGACGT GAAGGTCAGC AAGCACGACG AGCTCGATAC CTATCTGCCC
GCCTTCCGCG CCACCGTCAC CGAAGCCAAG GCCGGCTCCG TCATGTGTGC CTACAACAGC
ATCAACGGAC AGCCTGCTTG CGTGAACGAA TTCCTGCTGC AAGACCAGTT GCGCGGCAAG
TGGAACTTCC AGGGCTATGT CGTCTCCGAT TGCGAGGCGA TCATCAACAT CTATCGCGAC
CACAAGTTCA CCAAGACTCA GGCGGAAGCC TCCGCGCTCG CGGTACAGCG CGGCATGGAC
AACGAATGCG TCGACTTCGG CAAGCAAAAG GACGATCACG ATTACCGTCC CTACTTCGAC
GCCTACAAGC AGGGCATCTT GAAAGAGAGT GAGATCGATA CCGCGCTCGT TCGCCTCTTC
ACCGCCCGCA TGAAGCTTGG CATGTTCGAT CCGCCGGAAA TGGTCCCCTA TTCCAAAATC
GATCCCAAGG AATTGGAGAG CGCCGAGCAT CGCGAGTTGG CACGAACCCT GGCGAACGAA
TCCATGGTGC TTCTGAAGAA CGATGGCACG CTGCCGCTGA AGAAGTCGGG GCTGAAGATA
GCGGTGATCG GCCCATTGGC GGAACAGACG CGCTATCTCC TCGGCAATTA CAACGGCACA
CCGTCACACA CCGTCTCCGT GCTCGAAGGT CTCAGGGCGG AGTTCCCCGA CGCGCAGATC
ACCTTCGAAC GTGGCACCCA ATTCCTCGAT CAGAACGGCG AAGCGGTTCC CTCCAGGGCG
TTGACTACCG AGGACGGAAA GCCTGGCTTG AAAGCGGACT TCTCCACCGG TGAGTTCTTT
GGCGACAAGA TTCCGCTCAC GTCCGCGCAG GCGAGCAATG TTGATTTCAC CAACAAAGAC
ATCCCGCAGG CTGCCGCTGG AAAGTTCCCG CTCAACGTGG AATGGAGTGG CTTCCTCACC
GCCAGTGAGA CCGGGCGGTA CAGCCTCGGC GTGCGTGCGC TAGGAAACGC CGCCATCGTC
GAAGTAGATG GCAAGCCCCT CGCCAGGGAG TGGCTCGACG GACAGCACGT GCAAACCGCG
GTCGGCCACA TTCATCTCGA GCAAGGCAAG AAGATCGCCA TCAAGGTGCG CTACTCCATC
CGGCAAGCCG GCCCCATGCA GGCGCAATTG ATCTGGTCGA AGTTCGATCC GACGCCGAAT
CCCGCAGCTG TGACGGCAGC CAAGAATGCC GATGTCGTCA TCGCTGTGCT CGGCATCACC
AGCGACCTGG AAGGCGAAGA GATGCCTGTC AGCGAGGAAG GCTTCAACGG CGGCGATCGG
ACCAGCCTCG ACCTTCCAAA ACCGGAGCAG CAACTGCTCG AGTCCATTTC GGCCGCGGGC
AAGCCGGTGG TCCTCGTGCT GTCGAACGGC AGCGCGCTGT CGGTGAATTG GGCGCAGCAA
CACGCCAACG CGATTCTCGA AGGCTGGTAT CCCGGCGAAG AAGGCGGGAC CGCCATTGCC
CAAACGCTCT CCGGCAAAAA CAACCCGGCA GGACGCCTCC CGGTCACGTT CTACACCGGC
ACTGAGCAAC TGCCGCCCTT CGAAGATTAC GCGATGAAAG GACGCACCTA TCGCTACTTC
GAAGGCAAGC CGCTCTACCC GTTCGGATAT GGCCTGAGCT ACACCACGTT CTCTTATCGC
GACCTCGCGC TTCCCAAGGC TCCGTTGAAC GCAGGCGACC CGGTGACAGC GCAAGTCACA
GTCACCAACA CCGGAAAAGT GGAAGGCGAT GAAGTGGCGC AGCTCTACCT CTCGTTCCCA
AACATCGCGG GAGCACCGCT GCGCGCACTG CGCGGATTCC GGCGTATTCA CCTCAAGGCG
GGCGAATCGC AGACGATAAA GTTCGAACTG AAGGACCGGG ACTTGAGCAT GGTCAATGAA
GCGGGCGATC CCATCATCGC GGAGGGCGAG TACTCCGTCT CGGTAGGTGG CGGCCAACCG
GACACCGGCG CACCGACGGT CTCTGGAAAA TTCCAGATAC AGGGGACGAA GAAGCTGCCG
GAATAA
 
Protein sequence
MRIRAFPICL LLLAFLVPLT TAQNAEQPAY LNPSLAPEKR AADLVHRMTV EEKVSQLTND 
SRAVPRLNVP DYDWWSEALH GVAQPGVTEY PQPVALAATF DNDKVQRMAR FIGIEGRIKH
EEGMKDGHSD IFQGLDFWAP NINIFRDPRW GRGQETYGED PFLTARMGVA YVKGLQGDDP
KYYLAISTPK HYAVHSGPET TRHFADVKVS KHDELDTYLP AFRATVTEAK AGSVMCAYNS
INGQPACVNE FLLQDQLRGK WNFQGYVVSD CEAIINIYRD HKFTKTQAEA SALAVQRGMD
NECVDFGKQK DDHDYRPYFD AYKQGILKES EIDTALVRLF TARMKLGMFD PPEMVPYSKI
DPKELESAEH RELARTLANE SMVLLKNDGT LPLKKSGLKI AVIGPLAEQT RYLLGNYNGT
PSHTVSVLEG LRAEFPDAQI TFERGTQFLD QNGEAVPSRA LTTEDGKPGL KADFSTGEFF
GDKIPLTSAQ ASNVDFTNKD IPQAAAGKFP LNVEWSGFLT ASETGRYSLG VRALGNAAIV
EVDGKPLARE WLDGQHVQTA VGHIHLEQGK KIAIKVRYSI RQAGPMQAQL IWSKFDPTPN
PAAVTAAKNA DVVIAVLGIT SDLEGEEMPV SEEGFNGGDR TSLDLPKPEQ QLLESISAAG
KPVVLVLSNG SALSVNWAQQ HANAILEGWY PGEEGGTAIA QTLSGKNNPA GRLPVTFYTG
TEQLPPFEDY AMKGRTYRYF EGKPLYPFGY GLSYTTFSYR DLALPKAPLN AGDPVTAQVT
VTNTGKVEGD EVAQLYLSFP NIAGAPLRAL RGFRRIHLKA GESQTIKFEL KDRDLSMVNE
AGDPIIAEGE YSVSVGGGQP DTGAPTVSGK FQIQGTKKLP E