Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3021 |
Symbol | |
ID | 4071576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3584684 |
End bp | 3587809 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985040 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_592096 |
Protein GI | 94970048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.195512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAGCC TCAATAGGCG TTTCATCGTG TTGCCCGCTG TACTACTTTC CCTAGTTGCT GCCGGGTGGT CGCAGGCCGC CCCTGCTGCA ACAACTCAAA CTCAAATTGA CAACGACACC TTCGCAGGCT ACACCGCGCG CTCCATCGGC CCGGCCGTCA TGGGCGGCCG TGTCTCAGCG TTGGCCGCGA TTCCCGGCAA ACGCCTGACG ATCTTTGTCG GCGGTGCCGC TGGTGGAATC TTCAAATCGG AAGATGGTGG CGTCACGTTC AAGCCGATCT TCGACAAAAT GAACAGTCCG TCCATCGGCG CGATTGCGAT TGATCCACAA AATTCGAAAG TGATGTGGGT CGGCACCGGC GAAAGCTGGA TGCGCAACAG CGTCTCGGTC GGCGATGGCG TGTACAAGTC CACCGATGGC GGAGAAAACT GGACCAACGT TGGCCTGAAA GACAGCGAGC ACATCTCGCG CGTACTCATT CACCCCAAGG ACGGCAACAC CGTTTATGTC TGCGCTCTTG GTCATGCGTG GAACGACAAC ACCGAACGTG GTTTGTACAA GACCGCTGAT GGTGGCAAGA CCTGGATGAA TATCCTGAAG GCCGATCAGC GGACTGGATG CGGCGATGTC GCGTTCGATG CCACCGACCC CAACACGCTT TACGCTTCGC TCTGGCCGTA CCGGCGGTAT CCCTACAGCT TTAATTCCGG TGGTTCGACC GGCGGCATCT TTAAGAGCAC CGACGGCGGC GCGAACTGGA AGAAACTGAC CAACGGATTG CCGGAAGGCG ATCTCGGACG TATCGCCATC GCAACCACTC CAGCAAAACC GGGTCGCGTT TGGGCAGTCG TCGAAGCGAA AAAGACGGCC CTCTATCGCT CTGATGATGG CGGAGCAACG TGGACGTACC AGAACGACAG CTTCAACATC GTGGGACGCC CGTTCTATTT CTCATTGCTC GTCTCCGATC CGAACGATGG CGATCGCATC TATAAGCCGG GTTTCGGACT GACGGTGAGC GATGATGGCG GTCGCAGCTT CTCCGGCATC GGTAGCGAAG GGGCGGGAGG CGGTGTTCAC GGCGACTATC ACGCGCTGTG GGTGAACCCA AACAACTCCG ACCATCTCAT CACCTGCTCA GACGGAGGCT GCTATGAGAG CCTCGACCGC GGTGCGCACT GGCGCTTCCT GAACTCCTTC CCGATCGGCC AGTACTACCA CGTGAGCGCC GATATGGCTG AACCATACAA CGTGTACGGC GGCCTGCAGG ACAACGGAAC GTGGATGGGC CCGAACACCG ATTCCGACGG CGTCTTCAAC CGTCATTGGA AGAACATCGG TTATGGCGAC GGCTTCTGGT CGTTTGCCGA TCCAACCGAC AACGACCTGA TCTACAGCGA GTACCAGGGC GGACGCATGT TACGCGTGCG CCGTACTACC GGCGAAATCA AGGAAGTTTA TCCGCTGCCA AAGGCCGGTG ATCCCGACTA TCGTTGCAAC TGGAACACGC CGATCCACGT GGGTGCTGCT TCGAAGGCGC TTTACATCGG CTGCCAGTTC CTCTTCCGCT CGCGCGATCA TGGCGATTCG TGGGAGAAGA TCTCGCCTGA TCTCACAACC AACAATCCGG AATGGCTGAA GCAGTCGGAG TCCGGCGGCC TGACCGTGGA CAACTCCGAC GCTGAAAAGT ACGAAACCAT CTTCACGATC TCCGAGTCGC CGAAGAACCC CCAGATTGTG TGGGCGGGAA CCGACGACGG AAACGTGCAG GTCACTCAGA ACGGCGGCAA GAGTTGGACC AACGTCGCCA AGAACATTCC TGGACTACCG CCGAACACAT GGGTCTCGAC GATCGAAGCC GGTCACTTCG ACCCCGGCAC CGCCTACGCA ACCTTTGACG GTCACGCCAA GGGCGACATG AAGACCTACG TCTATAAGAC GACGGACTTC GGCAAGACGT GGACGCAGCT CAACAGTCCC GAGTTCAAGC TCTACGCGCA CGTGGTCCGT GAAGACCTCG TGAATCCGAA GCTGTTGTGG GTGGGCACGG AGAACGGACT GTACATCAGC ATTGACGGCG GCGCGAATTG GGCCGAGTTC AACGGCAAAA TTCCCCGCGT GCCGGTGCGC GATGTGTTCA TCCACCCGCG CAACAACGAT GTGGTCATCG CCACCCACGG TCGTTCGTTG TACGTGATTG ACGATGTCAC ACCGATCCGC GCGCTGACGA CCGACATCCT CAACAAAGAC ATTGCGATTT TGCCGTCACG CCCTTCGGTG CTGCCGCTTC CCTCGGAAGA ACAGCGCGCG GAAGGCGATG CCGACTATCG CGGCGTTCCA GTCACGAGTT CGGCCATCGT CACCTATTAC CAAAAGAAGC GCCACATCTT CGGCGAACTG AAGGTGGAGC TGTTCGATTC CACCGGCAAG CTCGTCGGAA CTTCGGCGGG TGACAAACGT CGCGGTGTGG TTCGCGTCGA ACTGCCGCTG CGCCTGCCAC CAGCGAAAGT GCCACCTGCA GCAACGCTGG TGGAACAACC GTTCGCTTTC TTCGGGCCGG CGTATCCGGA AGGCACTTAC AAGGTGCAGG TCACCAAGGG TAAGGAAGTA CTGACCTCGA CGATCAAGGT GGTCACCGAT CCGCGAGCCA AGAGCACACC TCAGGACCGA GCCCTTCAGC GCCAGACTGC TCTGAAGCTC TACGGCATGA GGGAACGGCT GGCGTACCTG GTGGCTGCGA TGACCAACGT CCGCGATCAA GCAAAAGACC GCGCTTCGAA GGCCTCGGAT GCTGCTCTCA AACAGCAACT TAGCGACCTG CAAAAGAAAG TGGAGGACTT CCGCAGTTCG TTGCTCGCCG TGAAAGAAGG CGGCGCAATC ACCGGTGAAC GCAAGCTTAA CGAGTACATC GGTGAACTTT ACGGCGGCGT CAATGGCTAC GAAGGCAAGC CGACACAGCA GCAGATCGAC CGCATGAACG CGTTGAATAC TGAGTTGGAG ACCGTCGCGA AGAAGTTCGA CGCGATGAAC TCGAGCGACG TAAACACGGT GAATTCGGCC CTGCAGAAGG CGAGTTTGCA ATCGCTGACA ACACTGTCAG AAGCCGACTG GCGTAAGCAG CAGTAG
|
Protein sequence | MFSLNRRFIV LPAVLLSLVA AGWSQAAPAA TTQTQIDNDT FAGYTARSIG PAVMGGRVSA LAAIPGKRLT IFVGGAAGGI FKSEDGGVTF KPIFDKMNSP SIGAIAIDPQ NSKVMWVGTG ESWMRNSVSV GDGVYKSTDG GENWTNVGLK DSEHISRVLI HPKDGNTVYV CALGHAWNDN TERGLYKTAD GGKTWMNILK ADQRTGCGDV AFDATDPNTL YASLWPYRRY PYSFNSGGST GGIFKSTDGG ANWKKLTNGL PEGDLGRIAI ATTPAKPGRV WAVVEAKKTA LYRSDDGGAT WTYQNDSFNI VGRPFYFSLL VSDPNDGDRI YKPGFGLTVS DDGGRSFSGI GSEGAGGGVH GDYHALWVNP NNSDHLITCS DGGCYESLDR GAHWRFLNSF PIGQYYHVSA DMAEPYNVYG GLQDNGTWMG PNTDSDGVFN RHWKNIGYGD GFWSFADPTD NDLIYSEYQG GRMLRVRRTT GEIKEVYPLP KAGDPDYRCN WNTPIHVGAA SKALYIGCQF LFRSRDHGDS WEKISPDLTT NNPEWLKQSE SGGLTVDNSD AEKYETIFTI SESPKNPQIV WAGTDDGNVQ VTQNGGKSWT NVAKNIPGLP PNTWVSTIEA GHFDPGTAYA TFDGHAKGDM KTYVYKTTDF GKTWTQLNSP EFKLYAHVVR EDLVNPKLLW VGTENGLYIS IDGGANWAEF NGKIPRVPVR DVFIHPRNND VVIATHGRSL YVIDDVTPIR ALTTDILNKD IAILPSRPSV LPLPSEEQRA EGDADYRGVP VTSSAIVTYY QKKRHIFGEL KVELFDSTGK LVGTSAGDKR RGVVRVELPL RLPPAKVPPA ATLVEQPFAF FGPAYPEGTY KVQVTKGKEV LTSTIKVVTD PRAKSTPQDR ALQRQTALKL YGMRERLAYL VAAMTNVRDQ AKDRASKASD AALKQQLSDL QKKVEDFRSS LLAVKEGGAI TGERKLNEYI GELYGGVNGY EGKPTQQQID RMNALNTELE TVAKKFDAMN SSDVNTVNSA LQKASLQSLT TLSEADWRKQ Q
|
| |