Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4484 |
Symbol | |
ID | 4070968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5323842 |
End bp | 5326991 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986523 |
Product | glycoside hydrolase family protein |
Protein accession | YP_593558 |
Protein GI | 94971510 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.137937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.980065 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAC TCGCTGCCTT CTTCTTCCTT TTCGCGCTTT TGGGTGTGCT CGGTTTCGCG CAAACCCCCG ACTGGGAGAA CCCTCGCGTT TTCGGCATCA ATAGAGAAGC GCCGCGCGCG ACCTTCACCC CATTTCCAGA CGAGGCCTCG GCGCTTAAGC GTCGCGAGCA ACCGTCTGTC TTCATGCAAT CGCTGAACGG GATGTGGAAG TTCCACTGGG TAAAAAGCCC TGAAGAGCGG CCTCAGGATT TCTATCAGCC GAACTACGAC GTGAGTGCAT GGAAAGAGAT TCGCGTGCCC GCGAACTGGG AGATGGAAGG CTACGGCACT CCGATCTATA CCAACATCAT TTATCCCTTT GAGCGCGATG CCCCGCGTGT GACGACCGCG CCCGCCGATC ACTCCTGGAC CGCATACCTG CAACGCGATC CCGTCGGATC GTACCGCCGC GACTTCACGC TTCCGGATTC ATGGAATGGC CGCGAAACGT TTCTCGTCTT CGACGGCGTC AACTCCGCCT ACTACCTGTG GATCAACGGC CAGAAGGTCG GCTACAGCCA GGACACCCGG ATGATGGCGG AATTCAATAT CACCAAGTAC CTGAAGCCGG GCACCAACAC CATCGCCGTC GAAGTCTATC GCTGGTGCGA CGGCAGCTAC ATCGAGGACC AGGACTTCTG GCGGATGAGC GGCATCTACC GCAACGTGAC GCTGGTCTCG CGCGCGCCGC TGCACATCCG CGACTTCCAG GTGCAAACGC CATTCGACGC GCAGTATCGC GACGCGATCT TGAAGGTCAG AGTCGATGTT CGCAATCTTG GCGCAAGCAA CTCGGCTGCA ACGCTTGAAG CGCAATTACT CGACGACAAT AGTAAGCCGG TCTTTGCAAT CCTTGCGAAG CGCGTGCAGC TGGAGCAGAA CAAAGAAACT TCCATCACGC TCGAGCAGCT CGTGAAGGCA CCGAAGCAAT GGTCCGCCGA AATTCCAAAC CTTTACCAGC TTCTGCTCAC GCTCAAAGAC GCAGATGGCA AGACACTCGA AGTGATCCCG TGGAAGATCG GTTTCCGCCA GTCCGAAATC AAAGGCGACC AGATTCTCTT CAACGGCAAG AAGCTGATGA TCAAAGGCGT GAACCGCCAC GAGTTCGATC CCGACCTCGG ACAGGTGGTC ACCCGCGAGC GCATGATCCA GGACATCCGG ATTATGAAGC AGAACAACAT CAACGCTGTC CGCACCTCGC ATTACCCCAA CGTTCCAGAG TGGTACGAGC TCGCCGACGA ATATGGCCTC TATATCCTCG ATGAAGCCAA CGTCGAATCG CACGGCTACG ACAGCGAAGC CCAGCAGCGT ATCTCCACGG GCGAAGACTA CACCGACGCC ATTGTCGACC GCATCCATCG CACTATTGAG CGCGACAAGA ACCATCCCTC AATCATCGGC TTCTCACTCG GTAACGAAGC CGGTTGGGGC CGCAATATGG CCGCCGAGCG CGATTGGGCG AAAGCGCACC ACCCCGAGTT CTTCATCATC TACCAGCCGC ACGACAGCGT TCACGGCGAC GCCCTCTCGC CGATGTACGT GAAGCCCCAG GAAATTGTGG GCTACTACAA AGAGCACGGC CAAGGTCGCC CATTCTTCGA GATTGAGTAC GCGCACGCCA TGGGCAACGG CACCGGAAAC TTCCAGCAGT ATTGGGACGT CTTCGATTCC GAACGCTGGG CCCACGGCGG CTTCATCTGG GATTGGGTCG ACCAGGGCAT CCGCCGCAAG AACGCGCAAG GCCGCGAAAT CTGGGCGTAC GGTGGCGACT TCGGCGACAA GCCCAACGAC GACAACTTCG TCACCAACGG CCTCGTGCTC CCTGATCGAA CTCCGCACCC AGGTCTCACC GAAGTAAAGC ACTCGTACGC CAACATCAAG GTCGAGGCAG TTGATCTCGC CGCCGGCGAG TTCCGCATTC GCAACAAATA CAACTTTCGC GATTTGAGTT TTGTCCGCGG AACCTGGGTA CTGGAAGAGA ACGGGAACGC GATCAAGGCT GGCGAGATAC CCGCAAGCAG TGTTGCTCCG CTCGCTACTC AGGAGGTCAC AATCGATCTC AGCCGCCCAG CGATTCGCCC CACCGCTGAC TATCTCGTGA CGATTCGATA TGAATTGAGA GAATCTACAC CATGGGCGCC GAAGGGACAT GTGATCGCGT GGGATCAGTT TGCGCTTGAG AGTGGAAGAG AACTCTCATC CGCGGTGCGC CGGGAGCGCG CGCCGACGCT CAAGATCGAA GACATGGAGC ACCAGTTCGC CGTCTACAAT GATCGCTTCT CGATCACCAT CGGCAAAGAG AGCGGGTCCA TCGAGTCCTT CACGCTCGAC GGCAAGAACT TGATCACCGC GCCTCTCTCG CCGAACTTCT GGCGCGCGCC CACTGACAAC GACCGCGGCA ACGGGATGCC GCAGAGGCAG GCGATCTGGC GGCTTGCCGG CCAGAACCGT GAAGTGCAGA GCGTGAAGGC GGAACAGCCG CAGCCGAATC TCGTAAGAAT CGCAACCGAG ATGAAGCTAC CCGCCGGAAA CTCCACACAG AAATACACGT ACACCATCCA CGGCGATGGC ACCGTGGAAA TAGCCAGCAC CCTCCACGCC GATCCCTCGC TCCCCGACCT TCCCCGCGTC GGCATGCAAA TGCGCGTCCT AGGCTCTCTG CGCAACGTCG AGTGGTTCGG CCGCGGACCC GACGAAAACT ACTGGGACCG CAACCTAGCC TCCAACGTCG GCCTCTACAA GAACACCATG GACAAAATGT GGTTCCCCTA CATCGAGCCG CAGGAAACCG GCAACCGCAC CGACGTTCGC TGGGTGACCT TCACCGATGA CCAAGGCTTC GGTTTCAAAG CCACTGGCGA GCCACTCCTC AACTTCAGCG CCTGGCCTTT CCGCATGTCG GAGATCGAGC ACGAAAAGTC TCCGGTCAAC ATCGGACGCA AGCACGCCGG CGACATCGAA ATGTCCGACG ACATCACCGT CAACCTCGAC TACAAACAAA TGGGCGTAGC CGGCGACGAC AGTTGGGGTG CACCAGTCCA CAAAGAATTC ACGCTGCCTG CGAGCGACTA CACGTATCGC TTCCGATTGG AGCCGGTCGG AGTCAAATAG
|
Protein sequence | MRRLAAFFFL FALLGVLGFA QTPDWENPRV FGINREAPRA TFTPFPDEAS ALKRREQPSV FMQSLNGMWK FHWVKSPEER PQDFYQPNYD VSAWKEIRVP ANWEMEGYGT PIYTNIIYPF ERDAPRVTTA PADHSWTAYL QRDPVGSYRR DFTLPDSWNG RETFLVFDGV NSAYYLWING QKVGYSQDTR MMAEFNITKY LKPGTNTIAV EVYRWCDGSY IEDQDFWRMS GIYRNVTLVS RAPLHIRDFQ VQTPFDAQYR DAILKVRVDV RNLGASNSAA TLEAQLLDDN SKPVFAILAK RVQLEQNKET SITLEQLVKA PKQWSAEIPN LYQLLLTLKD ADGKTLEVIP WKIGFRQSEI KGDQILFNGK KLMIKGVNRH EFDPDLGQVV TRERMIQDIR IMKQNNINAV RTSHYPNVPE WYELADEYGL YILDEANVES HGYDSEAQQR ISTGEDYTDA IVDRIHRTIE RDKNHPSIIG FSLGNEAGWG RNMAAERDWA KAHHPEFFII YQPHDSVHGD ALSPMYVKPQ EIVGYYKEHG QGRPFFEIEY AHAMGNGTGN FQQYWDVFDS ERWAHGGFIW DWVDQGIRRK NAQGREIWAY GGDFGDKPND DNFVTNGLVL PDRTPHPGLT EVKHSYANIK VEAVDLAAGE FRIRNKYNFR DLSFVRGTWV LEENGNAIKA GEIPASSVAP LATQEVTIDL SRPAIRPTAD YLVTIRYELR ESTPWAPKGH VIAWDQFALE SGRELSSAVR RERAPTLKIE DMEHQFAVYN DRFSITIGKE SGSIESFTLD GKNLITAPLS PNFWRAPTDN DRGNGMPQRQ AIWRLAGQNR EVQSVKAEQP QPNLVRIATE MKLPAGNSTQ KYTYTIHGDG TVEIASTLHA DPSLPDLPRV GMQMRVLGSL RNVEWFGRGP DENYWDRNLA SNVGLYKNTM DKMWFPYIEP QETGNRTDVR WVTFTDDQGF GFKATGEPLL NFSAWPFRMS EIEHEKSPVN IGRKHAGDIE MSDDITVNLD YKQMGVAGDD SWGAPVHKEF TLPASDYTYR FRLEPVGVK
|
| |