Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4564 |
Symbol | |
ID | 4071509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5410711 |
End bp | 5412585 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986604 |
Product | glycoside hydrolase family protein |
Protein accession | YP_593638 |
Protein GI | 94971590 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0741611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0567328 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGCA GAATCCTGGC CCTCTTACTC ACGCTCACTA CCCTCTTTCC ATCGGCGTTT GCCAAAGACA GATTTCAGCA GCCGGGGCCC GTGCATCTGG ACAAGGACGG CGAGAAGTGG GCCGATAAGA CCTTGAAGAG CATGTCGCTC GAAGAGAAGG TTGGGCAGAT GTTCATGATC TGGTCGAAGG CCCAGTTTGT GAACGTGCAG AGCCCGGATT TTCTCAAGCT GCGCGACACC ATGGCGCGGT ATCACCTGGG CGGGTTCGGC GTGACCGTGA ATTTCGAAGA TGGGTTCCTG TTCAAGACCG AGCCGTACGA GGCCGCGATG ATGATCAACG AGTTGCAGCA GGGCTCGGAG ATACCGCTGA TCATTGCCGC CGATTTCGAG CGCGGGCTCT CGATGCGTCT GAAGGAAGCC ACTGATTTCC CGCACGCGAT GGCGTTTGGG GCGACTAACA ATCCGGCGTA TGCGAAGGAG TTTGGGCGGA TCACGGCGCT GGAGTCGCGC GCCATTGGCG TGGAGTGGAA CTGGTTTCCT GATGCCGACG TGAATTCGAA CCCGGCAAAC CCGATCATCA ATACGCGCTC GTTTGGCGAG GACCCGCAGG CGGTGGCGTC GATGGTGAAG GCCTACATCG AGGGCGCTCA TGCGGAAGGC CTGCTCACGA CGGTGAAGCA CTTCCCCGGT CACGGCGACA CCGATACCGA CACACACCTG GCTACAGCGC GAATCAATCA GCCGCTGGAG CATATCCAGA ACGTTGAGTT GGTGCCGTTC AAGGCCGCGA TTGATGCGGG CGTGGACTCG GTGATGATCG GACATCTCGT GGTGCCGGCA CTCGATCCCG ATACCAATCG GGTTGCGACT ATCTCGCCGA AGATCGTGAA CGGCACTCTC AAGAAAGACC TCGGGTTCCA GGGGCTCGTC GTCACCGATG CAATGGAGAT GAACGGGCTG GCGAAGCTGT TCGGCTTCGG GCCGGAAGGC TCGGCGCGAG CGGCCGTAGC GGCAGTGAAG GCCGGTGATG ACATGCTTCT GCTGCCGTCG GACCTCGACG GCGCGTACGA AGGGCTGATC AAAGCGGTGA AACGCGGCGA GATTCCGGAG TCGCGGATTG ACGAGTCGGT GCGGAAGATC CTGCGGATGA AGGCTTCGGT GGGGCTGAAC AAGGCCAAGC TGGTAGACGT GGAGCAGATG AAGAACCTCA TCGCGCGTCC GGATAGCCTG TCAGTGGCGC AGGAAATCGC GGATTCCGCG GTGACACTGG TGCGCAGCAA CGACAAAACC CTGCCCCTGC GGGCGAAAAC AGTGGGAACC AGCGGGCCTC ATGCAACGTA TGAGAAACCA GAGGGAGTCC GGGGAAGGCA GCTTGCGGTC ATCATAACGG ATGATTCGCG GAGCGAGTCG GGCCGGATCT TCGATCAGCA GATTCGGCGG CGATCGCCGG AGATGCGGAC CATCTGGGTG GATGATCGCA ACGCCGTGGG CATGAGCGAC ACCGTATTGC AGGCGGTACG CGAGGCGGAG AAAGTTGTGG TCGCGATTTA CGCGATTCCC AGCGCCGGAC GCGTGAAGGT GGAGAACGGC CAGTTCAAGG CCTCGAGCGA CATGAGCGAT GCGCCGGCGG CGCTGGTGAA GAACATTTTG CGCGTAGCTG GGAGCCGCAC GGTGGTGGTC GCAATGGGGA ATCCGTATCT GGCGCAGGAT TTCCCGGAAG TGCAGAACTA TATGTGCACG TATTCCAATG CGCAGGTTTC AGACGTAGCG GCGGTGAAAG CGCTGTTCGG CGACATCGCT ATTCGTGGTC ACCTGCCGGT GACGATTCCG CAGTTCGCCG AGCGCGGCGC GGGGATCCAG CTACCGGCAA AGTAG
|
Protein sequence | MIRRILALLL TLTTLFPSAF AKDRFQQPGP VHLDKDGEKW ADKTLKSMSL EEKVGQMFMI WSKAQFVNVQ SPDFLKLRDT MARYHLGGFG VTVNFEDGFL FKTEPYEAAM MINELQQGSE IPLIIAADFE RGLSMRLKEA TDFPHAMAFG ATNNPAYAKE FGRITALESR AIGVEWNWFP DADVNSNPAN PIINTRSFGE DPQAVASMVK AYIEGAHAEG LLTTVKHFPG HGDTDTDTHL ATARINQPLE HIQNVELVPF KAAIDAGVDS VMIGHLVVPA LDPDTNRVAT ISPKIVNGTL KKDLGFQGLV VTDAMEMNGL AKLFGFGPEG SARAAVAAVK AGDDMLLLPS DLDGAYEGLI KAVKRGEIPE SRIDESVRKI LRMKASVGLN KAKLVDVEQM KNLIARPDSL SVAQEIADSA VTLVRSNDKT LPLRAKTVGT SGPHATYEKP EGVRGRQLAV IITDDSRSES GRIFDQQIRR RSPEMRTIWV DDRNAVGMSD TVLQAVREAE KVVVAIYAIP SAGRVKVENG QFKASSDMSD APAALVKNIL RVAGSRTVVV AMGNPYLAQD FPEVQNYMCT YSNAQVSDVA AVKALFGDIA IRGHLPVTIP QFAERGAGIQ LPAK
|
| |