Gene Acid345_4564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4564 
Symbol 
ID4071509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5410711 
End bp5412585 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content61% 
IMG OID637986604 
Productglycoside hydrolase family protein 
Protein accessionYP_593638 
Protein GI94971590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0741611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0567328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCA GAATCCTGGC CCTCTTACTC ACGCTCACTA CCCTCTTTCC ATCGGCGTTT 
GCCAAAGACA GATTTCAGCA GCCGGGGCCC GTGCATCTGG ACAAGGACGG CGAGAAGTGG
GCCGATAAGA CCTTGAAGAG CATGTCGCTC GAAGAGAAGG TTGGGCAGAT GTTCATGATC
TGGTCGAAGG CCCAGTTTGT GAACGTGCAG AGCCCGGATT TTCTCAAGCT GCGCGACACC
ATGGCGCGGT ATCACCTGGG CGGGTTCGGC GTGACCGTGA ATTTCGAAGA TGGGTTCCTG
TTCAAGACCG AGCCGTACGA GGCCGCGATG ATGATCAACG AGTTGCAGCA GGGCTCGGAG
ATACCGCTGA TCATTGCCGC CGATTTCGAG CGCGGGCTCT CGATGCGTCT GAAGGAAGCC
ACTGATTTCC CGCACGCGAT GGCGTTTGGG GCGACTAACA ATCCGGCGTA TGCGAAGGAG
TTTGGGCGGA TCACGGCGCT GGAGTCGCGC GCCATTGGCG TGGAGTGGAA CTGGTTTCCT
GATGCCGACG TGAATTCGAA CCCGGCAAAC CCGATCATCA ATACGCGCTC GTTTGGCGAG
GACCCGCAGG CGGTGGCGTC GATGGTGAAG GCCTACATCG AGGGCGCTCA TGCGGAAGGC
CTGCTCACGA CGGTGAAGCA CTTCCCCGGT CACGGCGACA CCGATACCGA CACACACCTG
GCTACAGCGC GAATCAATCA GCCGCTGGAG CATATCCAGA ACGTTGAGTT GGTGCCGTTC
AAGGCCGCGA TTGATGCGGG CGTGGACTCG GTGATGATCG GACATCTCGT GGTGCCGGCA
CTCGATCCCG ATACCAATCG GGTTGCGACT ATCTCGCCGA AGATCGTGAA CGGCACTCTC
AAGAAAGACC TCGGGTTCCA GGGGCTCGTC GTCACCGATG CAATGGAGAT GAACGGGCTG
GCGAAGCTGT TCGGCTTCGG GCCGGAAGGC TCGGCGCGAG CGGCCGTAGC GGCAGTGAAG
GCCGGTGATG ACATGCTTCT GCTGCCGTCG GACCTCGACG GCGCGTACGA AGGGCTGATC
AAAGCGGTGA AACGCGGCGA GATTCCGGAG TCGCGGATTG ACGAGTCGGT GCGGAAGATC
CTGCGGATGA AGGCTTCGGT GGGGCTGAAC AAGGCCAAGC TGGTAGACGT GGAGCAGATG
AAGAACCTCA TCGCGCGTCC GGATAGCCTG TCAGTGGCGC AGGAAATCGC GGATTCCGCG
GTGACACTGG TGCGCAGCAA CGACAAAACC CTGCCCCTGC GGGCGAAAAC AGTGGGAACC
AGCGGGCCTC ATGCAACGTA TGAGAAACCA GAGGGAGTCC GGGGAAGGCA GCTTGCGGTC
ATCATAACGG ATGATTCGCG GAGCGAGTCG GGCCGGATCT TCGATCAGCA GATTCGGCGG
CGATCGCCGG AGATGCGGAC CATCTGGGTG GATGATCGCA ACGCCGTGGG CATGAGCGAC
ACCGTATTGC AGGCGGTACG CGAGGCGGAG AAAGTTGTGG TCGCGATTTA CGCGATTCCC
AGCGCCGGAC GCGTGAAGGT GGAGAACGGC CAGTTCAAGG CCTCGAGCGA CATGAGCGAT
GCGCCGGCGG CGCTGGTGAA GAACATTTTG CGCGTAGCTG GGAGCCGCAC GGTGGTGGTC
GCAATGGGGA ATCCGTATCT GGCGCAGGAT TTCCCGGAAG TGCAGAACTA TATGTGCACG
TATTCCAATG CGCAGGTTTC AGACGTAGCG GCGGTGAAAG CGCTGTTCGG CGACATCGCT
ATTCGTGGTC ACCTGCCGGT GACGATTCCG CAGTTCGCCG AGCGCGGCGC GGGGATCCAG
CTACCGGCAA AGTAG
 
Protein sequence
MIRRILALLL TLTTLFPSAF AKDRFQQPGP VHLDKDGEKW ADKTLKSMSL EEKVGQMFMI 
WSKAQFVNVQ SPDFLKLRDT MARYHLGGFG VTVNFEDGFL FKTEPYEAAM MINELQQGSE
IPLIIAADFE RGLSMRLKEA TDFPHAMAFG ATNNPAYAKE FGRITALESR AIGVEWNWFP
DADVNSNPAN PIINTRSFGE DPQAVASMVK AYIEGAHAEG LLTTVKHFPG HGDTDTDTHL
ATARINQPLE HIQNVELVPF KAAIDAGVDS VMIGHLVVPA LDPDTNRVAT ISPKIVNGTL
KKDLGFQGLV VTDAMEMNGL AKLFGFGPEG SARAAVAAVK AGDDMLLLPS DLDGAYEGLI
KAVKRGEIPE SRIDESVRKI LRMKASVGLN KAKLVDVEQM KNLIARPDSL SVAQEIADSA
VTLVRSNDKT LPLRAKTVGT SGPHATYEKP EGVRGRQLAV IITDDSRSES GRIFDQQIRR
RSPEMRTIWV DDRNAVGMSD TVLQAVREAE KVVVAIYAIP SAGRVKVENG QFKASSDMSD
APAALVKNIL RVAGSRTVVV AMGNPYLAQD FPEVQNYMCT YSNAQVSDVA AVKALFGDIA
IRGHLPVTIP QFAERGAGIQ LPAK