Gene Acid345_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2110 
Symbol 
ID4069536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2523252 
End bp2525936 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content61% 
IMG OID637984125 
Productglycoside hydrolase family protein 
Protein accessionYP_591185 
Protein GI94969137 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGAA CACAGGGATG GAAACTGGCA GTTGGGTGCG CGTTATTGGC GGGGGCAACG 
CTTGCGGTCG CGCAGGATGC GCCCGTGGTT ACGGGCGACA AGCGCGTGGA CAAGCTGCTG
AGCCAGATGA CGCTCGAAGA GAAGATCACG CTGATCCATG GGACGCAGGA AGATCCCAAG
GTTTACCAGG GGCAGGCTGG GTACCTGGCG GGCGTGCCGC GGCTTGGGAT TCCCGGGCTG
CGCCTTGCCG ATGGGCCGCC GGGGGTGCTG ACGCGGCATC CGTCGCAGGC GGAGACGGCG
ACCATGGGGG TGGCTGCGAC CTTCAGCGAA AAAGATGCGG AAGCCAACGG GCTGGTCATC
GGGCGCGAGG ACCGCGCGCT GGGGATTGAT GTGAGCTTGC AGCCGTTTGT GAATATCGAC
CGCGACCTCG AGTTCGGGCG CGGGTACAAC ACCTTCGGCG AAGATCCGTA CTTGACGAGC
GTGATGGGCG CGGCGGAGAT CAAGGGCATC CAGTCGCAAC ACGTGATGGC GCAGGTGAAG
CACTACGTGG GCTACGACTC GGACGGGACC AGCACCTACA TTGACGATCA GACGCTGCAC
GAGGTGTATG TGGCGCCATT CGACGCGGCG GTGAAGGCCG ATGTGTCGTC GATTATGTGC
TCGTACAACC GGCTGAATGG GACGTTTGCG TGCGGCAACA AAGATTCGCT TACGACCATT
CTGCGCGACC AGATTGGCTT TAAGGGCTTT GTCACTTCCG ACTGGGGCGC GACCCACGCA
GTCAACTTCA TCAACGCCGG TCTCGATATG GAGATGCCTG GCGAGCCGGC GGAGAATGCG
CCGTTCTCAC TTCCCTCCTT CTTCGATCTG AAGCCGGTGC CGGCTGCGCC GGATATGTCG
AAGCTGAGCG CGATGATTGA AGACAACGAC AACATGTTCG GCAACCATAT TCCCGAAGAG
CCCGCAAAGC AGCCCGGCGA TCTCGGCGAC TTTGGGACGA AGCTGGATCC GAAGAAACTG
AAAGAAGCTC TTGCCGATGG CACCGTAACC GAGGCCGCAA TCACGCGCGC TGCGGGACGC
GTGTTGTACG AGATTGTGCA CTTTGGCTAC ATGGATGGAC AGTCGAAGCA CGACGTCACT
ACGCAGGCGA TTGAAGCCAA CGCGAAGATC ATCGAGAAGA CGGGCGAAGA TTCTGCGGTG
CTGTTGAAGA ACGATGGCGC GGCGCTGCCG CTGAAGGACC TCGATAGCGT GGTGCTGATT
GGGCCGACGG CGGCCCAGGT GGATGCGATC GGCATCAATG GCGAACGCAG CGTGGGATTG
CCGGAGAGGC AGATTGGTCC GCTGGCAGCG ATGAAGAAGA TTAGCGGCAA GAACATTCAA
TTCGCCGTCG CCGACGACAT GACCGGCACG ACGATTCCTG CGGCGATGCT TACGCACGAT
GGCAAACCCG GCCTGCTCCG TACTACGGGC GATAAGCAGC AAACCGACGC ACAACTCGAC
TTCACGAAGA AAAACGGCAA AGCACTGGCG GCAAATTCCA TCGTCAAGTG GACGGGTGAG
ATCAACGTTC CGGCTGCGGG GAATTATTGG ATCTATCTAC AGGCGCTGGG AGCGAATGCG
GTTATTAATC TCGATGGCAA GAAGCTCTCT GCGACCGGCG CGTTCCAGGG CGGCGTGCAT
GGCGACATCC TGCAGGCGAA CCAGGACAAC GTGATTCCTA CGCCGGATGG TTTGGACAAC
GTGCGTCGCG CGGTGGATTT GACGGCGGGT GCGCACAAGG TGGAGATCAC GACGTCGGAC
GATACGTCGA AGGCGCCGGT GCAGATGCGA CTGAACTGGT ACACGCCGCA GCAGCGCCAG
GCCGATCACG ATGCGGCAAT CGCGGCGGCG AAGAAAGCGA AGACCGCCGT GGTGTTTGTG
TGGACGCGTC TAGAACCGGT GTTTGGGCTG CCTGGCGATC AGGACAAGCT TGTCGAAGAG
ATTGCGGCGG TGAATCCAAA TACGGTCGTG GTGCTGAATA CCAGCCAGCC GGTTGCGTTG
CCGTGGGTGG ATAAAGTTAA AGCTGTACTT GAAATGTGGT GGCCCGGCGA TGAAGGCGGC
TGGGCAACGG CGAACATCCT ACTCGGCAAG ACCAGCCCTG CCGGACGGCT GCCAGTGACC
TGGGCGAAGA AGCTGACGGA TTATGCGGCG ACGAATCCGC GTTTTCCCGA GCGCAGCAAA
AAAGGTGTGG GCCACAAGAC GACCTACAGC GAGGGCGTGC ATCTTGGGTA TCGTTGGTTC
GATAAAGAAA ACGTCGAACC GCTCTTCGCG TTCGGGCACG GACTGAGCTA CACGACGTTC
GAATACTCCG GGCTGAAGAT CGCGAAAGCG GCAGACGGCG GTTTGGATGT GTCGCTCACC
ATCAAGAACA CGGGTGGTGT GGACTCGGAC GAAGTGCCGC AGGTTTATCT CGGAGCGCCG
GGGTCTGGGC CGCAGGATGC GCAGTTCCCG GTTCGCAAAC TCGTGGCGTT CGATCGCGTG
CAGATCGGGG CGGGCAAGTC GCAGACGGTC TCGCTGCACG TGCCGGAGCG GCAGTTGCAG
TACTGGTCCA CGAAAGACCA GAAGTGGGTG ACGCTGACGG CGTCGCGCAC GCTGAGCGTG
GGCGGGTCGT CGCGGGCGTT GCCGCTGAAG CAGCCCGTAG AATAA
 
Protein sequence
MGRTQGWKLA VGCALLAGAT LAVAQDAPVV TGDKRVDKLL SQMTLEEKIT LIHGTQEDPK 
VYQGQAGYLA GVPRLGIPGL RLADGPPGVL TRHPSQAETA TMGVAATFSE KDAEANGLVI
GREDRALGID VSLQPFVNID RDLEFGRGYN TFGEDPYLTS VMGAAEIKGI QSQHVMAQVK
HYVGYDSDGT STYIDDQTLH EVYVAPFDAA VKADVSSIMC SYNRLNGTFA CGNKDSLTTI
LRDQIGFKGF VTSDWGATHA VNFINAGLDM EMPGEPAENA PFSLPSFFDL KPVPAAPDMS
KLSAMIEDND NMFGNHIPEE PAKQPGDLGD FGTKLDPKKL KEALADGTVT EAAITRAAGR
VLYEIVHFGY MDGQSKHDVT TQAIEANAKI IEKTGEDSAV LLKNDGAALP LKDLDSVVLI
GPTAAQVDAI GINGERSVGL PERQIGPLAA MKKISGKNIQ FAVADDMTGT TIPAAMLTHD
GKPGLLRTTG DKQQTDAQLD FTKKNGKALA ANSIVKWTGE INVPAAGNYW IYLQALGANA
VINLDGKKLS ATGAFQGGVH GDILQANQDN VIPTPDGLDN VRRAVDLTAG AHKVEITTSD
DTSKAPVQMR LNWYTPQQRQ ADHDAAIAAA KKAKTAVVFV WTRLEPVFGL PGDQDKLVEE
IAAVNPNTVV VLNTSQPVAL PWVDKVKAVL EMWWPGDEGG WATANILLGK TSPAGRLPVT
WAKKLTDYAA TNPRFPERSK KGVGHKTTYS EGVHLGYRWF DKENVEPLFA FGHGLSYTTF
EYSGLKIAKA ADGGLDVSLT IKNTGGVDSD EVPQVYLGAP GSGPQDAQFP VRKLVAFDRV
QIGAGKSQTV SLHVPERQLQ YWSTKDQKWV TLTASRTLSV GGSSRALPLK QPVE