Gene Acid345_4484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4484 
Symbol 
ID4070968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5323842 
End bp5326991 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content59% 
IMG OID637986523 
Productglycoside hydrolase family protein 
Protein accessionYP_593558 
Protein GI94971510 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.980065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAC TCGCTGCCTT CTTCTTCCTT TTCGCGCTTT TGGGTGTGCT CGGTTTCGCG 
CAAACCCCCG ACTGGGAGAA CCCTCGCGTT TTCGGCATCA ATAGAGAAGC GCCGCGCGCG
ACCTTCACCC CATTTCCAGA CGAGGCCTCG GCGCTTAAGC GTCGCGAGCA ACCGTCTGTC
TTCATGCAAT CGCTGAACGG GATGTGGAAG TTCCACTGGG TAAAAAGCCC TGAAGAGCGG
CCTCAGGATT TCTATCAGCC GAACTACGAC GTGAGTGCAT GGAAAGAGAT TCGCGTGCCC
GCGAACTGGG AGATGGAAGG CTACGGCACT CCGATCTATA CCAACATCAT TTATCCCTTT
GAGCGCGATG CCCCGCGTGT GACGACCGCG CCCGCCGATC ACTCCTGGAC CGCATACCTG
CAACGCGATC CCGTCGGATC GTACCGCCGC GACTTCACGC TTCCGGATTC ATGGAATGGC
CGCGAAACGT TTCTCGTCTT CGACGGCGTC AACTCCGCCT ACTACCTGTG GATCAACGGC
CAGAAGGTCG GCTACAGCCA GGACACCCGG ATGATGGCGG AATTCAATAT CACCAAGTAC
CTGAAGCCGG GCACCAACAC CATCGCCGTC GAAGTCTATC GCTGGTGCGA CGGCAGCTAC
ATCGAGGACC AGGACTTCTG GCGGATGAGC GGCATCTACC GCAACGTGAC GCTGGTCTCG
CGCGCGCCGC TGCACATCCG CGACTTCCAG GTGCAAACGC CATTCGACGC GCAGTATCGC
GACGCGATCT TGAAGGTCAG AGTCGATGTT CGCAATCTTG GCGCAAGCAA CTCGGCTGCA
ACGCTTGAAG CGCAATTACT CGACGACAAT AGTAAGCCGG TCTTTGCAAT CCTTGCGAAG
CGCGTGCAGC TGGAGCAGAA CAAAGAAACT TCCATCACGC TCGAGCAGCT CGTGAAGGCA
CCGAAGCAAT GGTCCGCCGA AATTCCAAAC CTTTACCAGC TTCTGCTCAC GCTCAAAGAC
GCAGATGGCA AGACACTCGA AGTGATCCCG TGGAAGATCG GTTTCCGCCA GTCCGAAATC
AAAGGCGACC AGATTCTCTT CAACGGCAAG AAGCTGATGA TCAAAGGCGT GAACCGCCAC
GAGTTCGATC CCGACCTCGG ACAGGTGGTC ACCCGCGAGC GCATGATCCA GGACATCCGG
ATTATGAAGC AGAACAACAT CAACGCTGTC CGCACCTCGC ATTACCCCAA CGTTCCAGAG
TGGTACGAGC TCGCCGACGA ATATGGCCTC TATATCCTCG ATGAAGCCAA CGTCGAATCG
CACGGCTACG ACAGCGAAGC CCAGCAGCGT ATCTCCACGG GCGAAGACTA CACCGACGCC
ATTGTCGACC GCATCCATCG CACTATTGAG CGCGACAAGA ACCATCCCTC AATCATCGGC
TTCTCACTCG GTAACGAAGC CGGTTGGGGC CGCAATATGG CCGCCGAGCG CGATTGGGCG
AAAGCGCACC ACCCCGAGTT CTTCATCATC TACCAGCCGC ACGACAGCGT TCACGGCGAC
GCCCTCTCGC CGATGTACGT GAAGCCCCAG GAAATTGTGG GCTACTACAA AGAGCACGGC
CAAGGTCGCC CATTCTTCGA GATTGAGTAC GCGCACGCCA TGGGCAACGG CACCGGAAAC
TTCCAGCAGT ATTGGGACGT CTTCGATTCC GAACGCTGGG CCCACGGCGG CTTCATCTGG
GATTGGGTCG ACCAGGGCAT CCGCCGCAAG AACGCGCAAG GCCGCGAAAT CTGGGCGTAC
GGTGGCGACT TCGGCGACAA GCCCAACGAC GACAACTTCG TCACCAACGG CCTCGTGCTC
CCTGATCGAA CTCCGCACCC AGGTCTCACC GAAGTAAAGC ACTCGTACGC CAACATCAAG
GTCGAGGCAG TTGATCTCGC CGCCGGCGAG TTCCGCATTC GCAACAAATA CAACTTTCGC
GATTTGAGTT TTGTCCGCGG AACCTGGGTA CTGGAAGAGA ACGGGAACGC GATCAAGGCT
GGCGAGATAC CCGCAAGCAG TGTTGCTCCG CTCGCTACTC AGGAGGTCAC AATCGATCTC
AGCCGCCCAG CGATTCGCCC CACCGCTGAC TATCTCGTGA CGATTCGATA TGAATTGAGA
GAATCTACAC CATGGGCGCC GAAGGGACAT GTGATCGCGT GGGATCAGTT TGCGCTTGAG
AGTGGAAGAG AACTCTCATC CGCGGTGCGC CGGGAGCGCG CGCCGACGCT CAAGATCGAA
GACATGGAGC ACCAGTTCGC CGTCTACAAT GATCGCTTCT CGATCACCAT CGGCAAAGAG
AGCGGGTCCA TCGAGTCCTT CACGCTCGAC GGCAAGAACT TGATCACCGC GCCTCTCTCG
CCGAACTTCT GGCGCGCGCC CACTGACAAC GACCGCGGCA ACGGGATGCC GCAGAGGCAG
GCGATCTGGC GGCTTGCCGG CCAGAACCGT GAAGTGCAGA GCGTGAAGGC GGAACAGCCG
CAGCCGAATC TCGTAAGAAT CGCAACCGAG ATGAAGCTAC CCGCCGGAAA CTCCACACAG
AAATACACGT ACACCATCCA CGGCGATGGC ACCGTGGAAA TAGCCAGCAC CCTCCACGCC
GATCCCTCGC TCCCCGACCT TCCCCGCGTC GGCATGCAAA TGCGCGTCCT AGGCTCTCTG
CGCAACGTCG AGTGGTTCGG CCGCGGACCC GACGAAAACT ACTGGGACCG CAACCTAGCC
TCCAACGTCG GCCTCTACAA GAACACCATG GACAAAATGT GGTTCCCCTA CATCGAGCCG
CAGGAAACCG GCAACCGCAC CGACGTTCGC TGGGTGACCT TCACCGATGA CCAAGGCTTC
GGTTTCAAAG CCACTGGCGA GCCACTCCTC AACTTCAGCG CCTGGCCTTT CCGCATGTCG
GAGATCGAGC ACGAAAAGTC TCCGGTCAAC ATCGGACGCA AGCACGCCGG CGACATCGAA
ATGTCCGACG ACATCACCGT CAACCTCGAC TACAAACAAA TGGGCGTAGC CGGCGACGAC
AGTTGGGGTG CACCAGTCCA CAAAGAATTC ACGCTGCCTG CGAGCGACTA CACGTATCGC
TTCCGATTGG AGCCGGTCGG AGTCAAATAG
 
Protein sequence
MRRLAAFFFL FALLGVLGFA QTPDWENPRV FGINREAPRA TFTPFPDEAS ALKRREQPSV 
FMQSLNGMWK FHWVKSPEER PQDFYQPNYD VSAWKEIRVP ANWEMEGYGT PIYTNIIYPF
ERDAPRVTTA PADHSWTAYL QRDPVGSYRR DFTLPDSWNG RETFLVFDGV NSAYYLWING
QKVGYSQDTR MMAEFNITKY LKPGTNTIAV EVYRWCDGSY IEDQDFWRMS GIYRNVTLVS
RAPLHIRDFQ VQTPFDAQYR DAILKVRVDV RNLGASNSAA TLEAQLLDDN SKPVFAILAK
RVQLEQNKET SITLEQLVKA PKQWSAEIPN LYQLLLTLKD ADGKTLEVIP WKIGFRQSEI
KGDQILFNGK KLMIKGVNRH EFDPDLGQVV TRERMIQDIR IMKQNNINAV RTSHYPNVPE
WYELADEYGL YILDEANVES HGYDSEAQQR ISTGEDYTDA IVDRIHRTIE RDKNHPSIIG
FSLGNEAGWG RNMAAERDWA KAHHPEFFII YQPHDSVHGD ALSPMYVKPQ EIVGYYKEHG
QGRPFFEIEY AHAMGNGTGN FQQYWDVFDS ERWAHGGFIW DWVDQGIRRK NAQGREIWAY
GGDFGDKPND DNFVTNGLVL PDRTPHPGLT EVKHSYANIK VEAVDLAAGE FRIRNKYNFR
DLSFVRGTWV LEENGNAIKA GEIPASSVAP LATQEVTIDL SRPAIRPTAD YLVTIRYELR
ESTPWAPKGH VIAWDQFALE SGRELSSAVR RERAPTLKIE DMEHQFAVYN DRFSITIGKE
SGSIESFTLD GKNLITAPLS PNFWRAPTDN DRGNGMPQRQ AIWRLAGQNR EVQSVKAEQP
QPNLVRIATE MKLPAGNSTQ KYTYTIHGDG TVEIASTLHA DPSLPDLPRV GMQMRVLGSL
RNVEWFGRGP DENYWDRNLA SNVGLYKNTM DKMWFPYIEP QETGNRTDVR WVTFTDDQGF
GFKATGEPLL NFSAWPFRMS EIEHEKSPVN IGRKHAGDIE MSDDITVNLD YKQMGVAGDD
SWGAPVHKEF TLPASDYTYR FRLEPVGVK