Gene Acid345_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4061 
Symbol 
ID4072483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4805289 
End bp4807484 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content57% 
IMG OID637986092 
ProductBeta-glucosidase 
Protein accessionYP_593135 
Protein GI94971087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.905446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0160169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTT CTCGCGTATT TCTTGCCACC ACTTTAGCGC TGTCAATGTT TGGTTCTGCG 
CAGCGCCCGG ATCCCAGGGC CGAACTCATG AAGAAGCCCT GGATGGACAA AAACAAATCC
GCCGATGAGC GCGCCGAACT CGTGCTGAAG GAAATGACTG TAGACGAGAA GATCAACCTC
ATCCACGGCC AGGGCATGGA AGAAATGGCA GAGTTCGGGA TTCCGATGCC GAACAAGGCG
CTCGGAAATG GCGGCGCCGG ATTCGTCCTC GGTGTGGAAC GGCTTGGAAT CCCGCTCATC
CAGATGAGCG ATGCGGCGTA TGGCGTTCGT AGCAGCGCAA AGAATGGCCG GTATTCCACA
GCGTTACCGT CGAACCTGGG CTCAGCCGCG AGTTGGGACC CGCAGGCCGC GTGCGAGTAT
GGCGCACTCA TCGGTCGTGA ACTGCGCGCG CAGGGCTACA ACATGACACT CGGCGGAGGG
GTGAACATCA CCCGCGAACC GCGCAACGGC CGCACCTTCG AATACATGGG CGAAGACCCG
ATTCTCGCTG GAACTTTGGT TGGCAATCGG ATCAAGTGTG AGCAGGCGCA GCACGTCATC
GGCGACACGA AGCACTATGC CGTCAACGAC CAGGAGAGCG GTCGAACGGA AGTTGACGTC
GTGATCAGCA AGCGCGCGAT GCGCGAGACC GACCTTCTCG CGTTTGAAAT CGGTATCGAG
ATCGGACAGC CGGGCGCAGT GATGTGCTCG TACAACCTGG TGAACGGCAC GTACGCTTGC
CAGAACAAGT ATCTGCTTAC CGATGTTTTG AAGAAGGACT GGAACTTTAA GGGCTTCGTA
GTTTCCGATT GGGGGGCCAC CCACAGCACG ATCGAATCGT CCGCTGCCGG CCTAGACAAC
GAACAGCCAT TCGGCATTTT CTATTCCGAC AAATTCAAGG CCGGTCTCGA CTCGGGCAAA
ATCCCAATGT CGGAACTCGA CGACCATGTT CGGCGCATTC TGCGCACCGA GTTTGCGTCA
GGCATCGTCG ACAATCCGGT TGTGAAAGCT GTCGTTGATG TTGAAGCAGG TCTCGAAACC
GCGCGCAAGA TCGAAGAGGG GAGCATCGTT CTTCTAAAGA ATGGCAACAA CATTCTTCCG
CTCGACAAAG ACTCGATCAA GTCTGTCGCA ATTATCGGAG AGAAGGCCGA TTTCGGAATG
ATCTCTGGTG GTGGCTCCGC GCAGGTGGAT CCTCCGGGTC CGTCGCATGA GTGGCAAGCC
CATGTCTGGT TCCCAACCTC GCCGCTGAAG GCTGTGCAAG CCAAGCTACC GAATGCGAAA
GTTGACTGGC GCTCCGGCTC GGACAAGAGC AAAGCCGCCG CGCTTGCCAA GCAGTCCGAC
ATCGCGATCG TGTTTGTACA TCAATGGATC AGTGAAGGCA TGGACCTGCC TGATCTGACA
CTGCCTTTCG GCCAGGATGA ACTGATCGAG CGCGTTGCCG CTGCAAATCC GCGGACGATT
GTTGTTCTAG AATCCGGTAC TGCGGTCACC ATGCCGTGGC TCGATAAAGT CAGCGGCGTA
GTGGAAGCCT GGTACGGAGG CAGCAAGGGT GCCGACGCCG TCGCGAACAT TTTGTTCGGC
GATGTGAATC CCTCCGGAAA GCTGCCGATG ACTTTCCCGC GCAGCGTAGA AGACGTGCCG
CATCCGAAGC TCGCGGTGCC TCCCCCGAAC CCGAATCCAA TGGAAACCTA TATGCACCCC
GAACTCGCGA AGGCTACGGT CACAGCGACA TACGACGAGG GCGTAAAAGT CGGGTACAAG
TGGTATGACG CCGAGAAGAA GCCGGTCCTG TTCCCGTTTG GCTTCGGGCT CTCCTACACC
ACGTATCAGT ACAGCGGATT GAAGGTTTCC AGCGGGAAGA CGACAACTGT GAGCTTCACG
GTGAAAAATA CCGGAAAGCG AGGGGGCGAC GAGATCGCGC AAGTCTACGC ATCGCTTCCG
GCGGAAGCGC AGGAGCCTCC GAAGCGTTTG GTGGGCTTCA CCAAAGTCCA TCTTAATGCG
GGTGAATCGA AGGAAGTGAG CGTAACCGTG CCAGCGAAAT ATCTATCGGT CTTCGATGAA
GCGTCGAACG GATGGAAGCT GGTTCCGGGC AGCTATAGCT TTATGGTCGG CGGATCGTCG
CAAGACCTTC CGCAGAAACA ATCGGCTACG TTCTAA
 
Protein sequence
MKISRVFLAT TLALSMFGSA QRPDPRAELM KKPWMDKNKS ADERAELVLK EMTVDEKINL 
IHGQGMEEMA EFGIPMPNKA LGNGGAGFVL GVERLGIPLI QMSDAAYGVR SSAKNGRYST
ALPSNLGSAA SWDPQAACEY GALIGRELRA QGYNMTLGGG VNITREPRNG RTFEYMGEDP
ILAGTLVGNR IKCEQAQHVI GDTKHYAVND QESGRTEVDV VISKRAMRET DLLAFEIGIE
IGQPGAVMCS YNLVNGTYAC QNKYLLTDVL KKDWNFKGFV VSDWGATHST IESSAAGLDN
EQPFGIFYSD KFKAGLDSGK IPMSELDDHV RRILRTEFAS GIVDNPVVKA VVDVEAGLET
ARKIEEGSIV LLKNGNNILP LDKDSIKSVA IIGEKADFGM ISGGGSAQVD PPGPSHEWQA
HVWFPTSPLK AVQAKLPNAK VDWRSGSDKS KAAALAKQSD IAIVFVHQWI SEGMDLPDLT
LPFGQDELIE RVAAANPRTI VVLESGTAVT MPWLDKVSGV VEAWYGGSKG ADAVANILFG
DVNPSGKLPM TFPRSVEDVP HPKLAVPPPN PNPMETYMHP ELAKATVTAT YDEGVKVGYK
WYDAEKKPVL FPFGFGLSYT TYQYSGLKVS SGKTTTVSFT VKNTGKRGGD EIAQVYASLP
AEAQEPPKRL VGFTKVHLNA GESKEVSVTV PAKYLSVFDE ASNGWKLVPG SYSFMVGGSS
QDLPQKQSAT F