Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1689 |
Symbol | |
ID | 4069357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2048956 |
End bp | 2050053 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983697 |
Product | glycoside hydrolase family protein |
Protein accession | YP_590764 |
Protein GI | 94968716 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0715346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCGCC GTCTTGCAGT TGTCTTTGCA TTGGTATCCA TTGTTGCGTT TGCGCAGGAC TCACCGGCAT TCCGCCGCGC GCAACATCTG CGCCACGGGA TCAACGTCAG TGAATGGTTT GCCCAATCGA ACGACTACAG CCCGCAGCGG CTGCGCACGT TCACAAAACT CGAAGACATT GATCGCATTA AGAAGATGGG CTTCGATCAC ATCCGCATCA GCATTGATCC GCAGGTCTTC GACTGTTTCC GCTGGAACTC GAACTGTGAG CGCGTGCAGG TGCTGGATGA CGTGGTGAAC CGGGCGGCCA ATCAGGACCT CGCGGTGATG ATTGATCTTC ATCCGAACAG CGAGTACAAG AAACAAGTCG CAACGAACGG TGACGAAGCA CAACGTGCTG CACTGCTTTG GGAGCGCATC GCCGCCCACT ATGCAAAGTC GAATCCGGAT CTACTCTTCT TCGAGTTGAT GAACGAGCCG GAACTCGGCG ATGCGTACAA ATGGAACGGA ATCCAGGAGG CCCTCGCGGC ATCCGTGCGG CGGGCGGCGC CGCAGCACAC GATCATCGTT GCGGGAGCTC GCTACTCCGA TATTGAAGAC CTGGTCAGGA TGCCCGACTT TGCCGATCAC AATCTGATCT TCAACTTCCA CTACTACGAG CCCCACACCT TCACGCATCA AGGCGCTAGC TGGGGATCTG CCTATTGGCT ACGCACGCAG GATTTGCCGT TCCCGGCAAC ACCAGAGAAC ATGGCACCAA TTCTCGCGCA GCAACCTGAT GACTTTACGC GCTGGCAACT GGAGGAATTT GCGCTCTCCA ACTGGAATGC CGCCCGCATT GAAGGAGAGA TGGCCTTTGC CGCCGATTGG GCTAAGCGCC ACAACGTTCC GCTGATTTGC AATGAGTTCG GAGCTTATCG CGAACACACT CGTCCCGAGG ATCGTATGCG CTGGATTTCT GCGGTACGCG GCGCGTTGGA GAAAAATCAT ATCGGCTGGA CGATGTGGGA TTACCAAGGC GGGTTCGGTG TGGTTCACGT GAAGGATGGT CAAGTCACCG AAGACGCTGC GGTACTGCAA GCGTTGGGGC TGAAGTAG
|
Protein sequence | MLRRLAVVFA LVSIVAFAQD SPAFRRAQHL RHGINVSEWF AQSNDYSPQR LRTFTKLEDI DRIKKMGFDH IRISIDPQVF DCFRWNSNCE RVQVLDDVVN RAANQDLAVM IDLHPNSEYK KQVATNGDEA QRAALLWERI AAHYAKSNPD LLFFELMNEP ELGDAYKWNG IQEALAASVR RAAPQHTIIV AGARYSDIED LVRMPDFADH NLIFNFHYYE PHTFTHQGAS WGSAYWLRTQ DLPFPATPEN MAPILAQQPD DFTRWQLEEF ALSNWNAARI EGEMAFAADW AKRHNVPLIC NEFGAYREHT RPEDRMRWIS AVRGALEKNH IGWTMWDYQG GFGVVHVKDG QVTEDAAVLQ ALGLK
|
| |