Gene Acid345_4152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4152 
Symbol 
ID4072343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4911948 
End bp4914443 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content62% 
IMG OID637986183 
Productglycoside hydrolase family protein 
Protein accessionYP_593226 
Protein GI94971178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.318604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCA CATTCCTCCG CAAGTTCTCG GCAACGCTTC TCCTCAGCGT TGCCGTTGTC 
GCTTCAGCGC AGAAGCTTCC GACGAAGCAG GAAGCCGCCG CCCGCGCCGA AAAAATCCTC
ACGCAAATGA CGCTCGAAGA AAAAGTCGCC TACATCGGCG GTGACCGCGA CTTCTACATC
CGCGCCATCC CGCGCCTCAA CGTTCCCGAA ATCAAAATGT CCGACGGCCC GCTCGGCACC
CGTAACGATG GCAACTCCAC CGCCTATCCT GCCGGTATCG CCCTCGCTGC CTCGTGGGAC
ATCAAGCTCG CTCACGAGAT GGGAGCCGCT CTCGGTTCGG ACTCCCGCGC CCGTGGCGTG
AACATCCTCC TCGGCCCCGG ACTCAACATC TATCGTGCGC CGATGTGCGG CCGCAACTTC
GAGTACTTTG GCGAAGATCC CTACCTCGCC TCGCGCATGG CCGTCGCCGA CGTCCAGGGC
ATCCAGAGCA TGGGCGTCAT CGCCACCGCC AAGCACTACG CCGCCAACAA CCAGGAGTGG
GACCGCAACC GCGTCTCCTC CGATGTCGAC GAGCGCACAT TGCGCGAAAT CTACCTCCCC
TCCTTCGAGT ACGCCGTGAA GGAAGGCCAC GCCGGCGCCA TCATGGACTC CTACAACCTC
GTCAACGGCG TCCACTCCAC GCAGAACACC TTCCTCAACA TTGACGTTGC CCGCAAAGAC
TGGAACTTCA CCGGCATCAT CATGTCCGAC TGGGAAGCTA CCTACGACGG CGTCGCCGCC
GCCAACGGTG GCCTCGATCT CGAAATGCCC AGCGGCAAAT TCATGAGCCC CACCACGCTC
CTGGCCGCCG TCAAAGATGG CTCCGTCAAA GAATCCGTCA TCGACGAAAA AGTTCGCCGC
ATCCTGCGCA CTTCGATCGA GTTCGGCTTC TTCGATCGTC CGCAGAAAAC CGCCACCCCA
TGGAACGATC CAGCCTCGCG TGCCGTCGCC CTCAAAGTCG CGCAGGAAGG CTTCGTCCTC
CTCAAAAATC AAGGCGGTGT GCTTCCGCTC GATCGCACGA AATTCAAGAA CATCGCTCTC
ATCGGCCCCA ACGCCGGCAT TCCCGCCACC GGCGGTGGTG GCAGCTCCAA GATCGATCCT
TTCTCCGCTG TCTCTCCGGT TGACGCCGTG AAGAACCTCG TCGGCGATTC CGCTAAGATC
GCTTACTATC CCGGCCTCCA ACTCATCTCC GACGTTTTCA AGACCACCAG CTTCACCACC
ACCGCCGACG GCGATACCCA CGGCTTAGTT ACAGAGTTCT TTAACAACAA AGACCTCACC
GGTCCGCCCG CGCTCACCCG TACCGACGAG CACATTGCCT TCAACTGGAG CGGCGGCCCC
TACGCGCCCA ACGGCCAGCA GGAAAACTTC TCCGCGCGAT TCACCGGCTA CTACACTCCC
GCCGCCGACG GCACCTACAC CTTCGCCGTC TCCGGCGATG ACGGCTTCCG CCTCTTCGTC
GACGACAAAC CCGTCATCGA ACAATGGGTC TATCAAGGCG AGACCATCGT CACCAAGGCG
CTCGATCTCA AAGCTGGCCA GCACTACAAG CTCCGCCTCG AGTACTTCCA GGGCGGCGGC
GGTGCCGCTC TCGGCTTCGG CGTCACTGAC GGCAAGTCTT CCGCTCTCAC CGATGCCGTC
AACGCCGCGA CAAACGCCGA CCTCGTCATC CTCTGCGTCG GCTTCGACGA CAAGTCCGAA
GGCGAAGGCG CCGACCGTAC TTTCGCGCTC CCGCAGCCCC AATACGAACT CATCAAGCAA
GTTGAGGCCG CCAACAAGAA CACCGTCATG GTCCTCACCG CCGGCGGCAA CGTGGACATG
GTGCCGTTCA TCGACAACAC GCCTGCGCTC CTGCACGTCT GGTATCCCGG ACAGGAAGGC
GCCACCGCCA TGGCCCAGGT CCTCTTCGGC GACATCAACC CGAGCGGCAA ACTCCCCGCC
TCGTTCGAGC GCCGTTGGGA AGACAACGCC ACCTACAACA GCTACTACGA CCCCGATAAG
ACGCTCCACG TGAAGTACAC CGAAGGCATC TTCGTCGGCT ACCGCCACTT CGACAAAGAC
AACGTCAAGC CGATGTTCCC CTTCGGCTAC GGCCTCAGCT ACACCACCTT CCAATACGGC
GGCCTCAAGA TCGGCGCACC TTCCGCCGAC AGCACCGTCC CCGTCACCTT TACCGTGAAG
AACACCGGCA AGCGCGCCGG CGCCGAGATC GCCGAAGTCT ACGTCGGCGA GAAAAATCCC
AAAGTTCCGC GCCCCGTGAA AGAACTCAAA GGCTTCGCCC GCGTCGAACT CAAACCCGGC
GAATCCCGCA GCATCACCGT CAACCTCGAC CGCCGCGCCT TCTCCTGGTA CGACGCCAAC
TCGCACCAGT GGACCGCCGA TACCGGCAAC TACGACATCC TCATAGGCAG CAGCAGCGCC
AAGATCGAAC TAACCGGCAA CGTCGCCCTG CGATAA
 
Protein sequence
MNLTFLRKFS ATLLLSVAVV ASAQKLPTKQ EAAARAEKIL TQMTLEEKVA YIGGDRDFYI 
RAIPRLNVPE IKMSDGPLGT RNDGNSTAYP AGIALAASWD IKLAHEMGAA LGSDSRARGV
NILLGPGLNI YRAPMCGRNF EYFGEDPYLA SRMAVADVQG IQSMGVIATA KHYAANNQEW
DRNRVSSDVD ERTLREIYLP SFEYAVKEGH AGAIMDSYNL VNGVHSTQNT FLNIDVARKD
WNFTGIIMSD WEATYDGVAA ANGGLDLEMP SGKFMSPTTL LAAVKDGSVK ESVIDEKVRR
ILRTSIEFGF FDRPQKTATP WNDPASRAVA LKVAQEGFVL LKNQGGVLPL DRTKFKNIAL
IGPNAGIPAT GGGGSSKIDP FSAVSPVDAV KNLVGDSAKI AYYPGLQLIS DVFKTTSFTT
TADGDTHGLV TEFFNNKDLT GPPALTRTDE HIAFNWSGGP YAPNGQQENF SARFTGYYTP
AADGTYTFAV SGDDGFRLFV DDKPVIEQWV YQGETIVTKA LDLKAGQHYK LRLEYFQGGG
GAALGFGVTD GKSSALTDAV NAATNADLVI LCVGFDDKSE GEGADRTFAL PQPQYELIKQ
VEAANKNTVM VLTAGGNVDM VPFIDNTPAL LHVWYPGQEG ATAMAQVLFG DINPSGKLPA
SFERRWEDNA TYNSYYDPDK TLHVKYTEGI FVGYRHFDKD NVKPMFPFGY GLSYTTFQYG
GLKIGAPSAD STVPVTFTVK NTGKRAGAEI AEVYVGEKNP KVPRPVKELK GFARVELKPG
ESRSITVNLD RRAFSWYDAN SHQWTADTGN YDILIGSSSA KIELTGNVAL R