Gene Acid345_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3021 
Symbol 
ID4071576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3584684 
End bp3587809 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content58% 
IMG OID637985040 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_592096 
Protein GI94970048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.195512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGCC TCAATAGGCG TTTCATCGTG TTGCCCGCTG TACTACTTTC CCTAGTTGCT 
GCCGGGTGGT CGCAGGCCGC CCCTGCTGCA ACAACTCAAA CTCAAATTGA CAACGACACC
TTCGCAGGCT ACACCGCGCG CTCCATCGGC CCGGCCGTCA TGGGCGGCCG TGTCTCAGCG
TTGGCCGCGA TTCCCGGCAA ACGCCTGACG ATCTTTGTCG GCGGTGCCGC TGGTGGAATC
TTCAAATCGG AAGATGGTGG CGTCACGTTC AAGCCGATCT TCGACAAAAT GAACAGTCCG
TCCATCGGCG CGATTGCGAT TGATCCACAA AATTCGAAAG TGATGTGGGT CGGCACCGGC
GAAAGCTGGA TGCGCAACAG CGTCTCGGTC GGCGATGGCG TGTACAAGTC CACCGATGGC
GGAGAAAACT GGACCAACGT TGGCCTGAAA GACAGCGAGC ACATCTCGCG CGTACTCATT
CACCCCAAGG ACGGCAACAC CGTTTATGTC TGCGCTCTTG GTCATGCGTG GAACGACAAC
ACCGAACGTG GTTTGTACAA GACCGCTGAT GGTGGCAAGA CCTGGATGAA TATCCTGAAG
GCCGATCAGC GGACTGGATG CGGCGATGTC GCGTTCGATG CCACCGACCC CAACACGCTT
TACGCTTCGC TCTGGCCGTA CCGGCGGTAT CCCTACAGCT TTAATTCCGG TGGTTCGACC
GGCGGCATCT TTAAGAGCAC CGACGGCGGC GCGAACTGGA AGAAACTGAC CAACGGATTG
CCGGAAGGCG ATCTCGGACG TATCGCCATC GCAACCACTC CAGCAAAACC GGGTCGCGTT
TGGGCAGTCG TCGAAGCGAA AAAGACGGCC CTCTATCGCT CTGATGATGG CGGAGCAACG
TGGACGTACC AGAACGACAG CTTCAACATC GTGGGACGCC CGTTCTATTT CTCATTGCTC
GTCTCCGATC CGAACGATGG CGATCGCATC TATAAGCCGG GTTTCGGACT GACGGTGAGC
GATGATGGCG GTCGCAGCTT CTCCGGCATC GGTAGCGAAG GGGCGGGAGG CGGTGTTCAC
GGCGACTATC ACGCGCTGTG GGTGAACCCA AACAACTCCG ACCATCTCAT CACCTGCTCA
GACGGAGGCT GCTATGAGAG CCTCGACCGC GGTGCGCACT GGCGCTTCCT GAACTCCTTC
CCGATCGGCC AGTACTACCA CGTGAGCGCC GATATGGCTG AACCATACAA CGTGTACGGC
GGCCTGCAGG ACAACGGAAC GTGGATGGGC CCGAACACCG ATTCCGACGG CGTCTTCAAC
CGTCATTGGA AGAACATCGG TTATGGCGAC GGCTTCTGGT CGTTTGCCGA TCCAACCGAC
AACGACCTGA TCTACAGCGA GTACCAGGGC GGACGCATGT TACGCGTGCG CCGTACTACC
GGCGAAATCA AGGAAGTTTA TCCGCTGCCA AAGGCCGGTG ATCCCGACTA TCGTTGCAAC
TGGAACACGC CGATCCACGT GGGTGCTGCT TCGAAGGCGC TTTACATCGG CTGCCAGTTC
CTCTTCCGCT CGCGCGATCA TGGCGATTCG TGGGAGAAGA TCTCGCCTGA TCTCACAACC
AACAATCCGG AATGGCTGAA GCAGTCGGAG TCCGGCGGCC TGACCGTGGA CAACTCCGAC
GCTGAAAAGT ACGAAACCAT CTTCACGATC TCCGAGTCGC CGAAGAACCC CCAGATTGTG
TGGGCGGGAA CCGACGACGG AAACGTGCAG GTCACTCAGA ACGGCGGCAA GAGTTGGACC
AACGTCGCCA AGAACATTCC TGGACTACCG CCGAACACAT GGGTCTCGAC GATCGAAGCC
GGTCACTTCG ACCCCGGCAC CGCCTACGCA ACCTTTGACG GTCACGCCAA GGGCGACATG
AAGACCTACG TCTATAAGAC GACGGACTTC GGCAAGACGT GGACGCAGCT CAACAGTCCC
GAGTTCAAGC TCTACGCGCA CGTGGTCCGT GAAGACCTCG TGAATCCGAA GCTGTTGTGG
GTGGGCACGG AGAACGGACT GTACATCAGC ATTGACGGCG GCGCGAATTG GGCCGAGTTC
AACGGCAAAA TTCCCCGCGT GCCGGTGCGC GATGTGTTCA TCCACCCGCG CAACAACGAT
GTGGTCATCG CCACCCACGG TCGTTCGTTG TACGTGATTG ACGATGTCAC ACCGATCCGC
GCGCTGACGA CCGACATCCT CAACAAAGAC ATTGCGATTT TGCCGTCACG CCCTTCGGTG
CTGCCGCTTC CCTCGGAAGA ACAGCGCGCG GAAGGCGATG CCGACTATCG CGGCGTTCCA
GTCACGAGTT CGGCCATCGT CACCTATTAC CAAAAGAAGC GCCACATCTT CGGCGAACTG
AAGGTGGAGC TGTTCGATTC CACCGGCAAG CTCGTCGGAA CTTCGGCGGG TGACAAACGT
CGCGGTGTGG TTCGCGTCGA ACTGCCGCTG CGCCTGCCAC CAGCGAAAGT GCCACCTGCA
GCAACGCTGG TGGAACAACC GTTCGCTTTC TTCGGGCCGG CGTATCCGGA AGGCACTTAC
AAGGTGCAGG TCACCAAGGG TAAGGAAGTA CTGACCTCGA CGATCAAGGT GGTCACCGAT
CCGCGAGCCA AGAGCACACC TCAGGACCGA GCCCTTCAGC GCCAGACTGC TCTGAAGCTC
TACGGCATGA GGGAACGGCT GGCGTACCTG GTGGCTGCGA TGACCAACGT CCGCGATCAA
GCAAAAGACC GCGCTTCGAA GGCCTCGGAT GCTGCTCTCA AACAGCAACT TAGCGACCTG
CAAAAGAAAG TGGAGGACTT CCGCAGTTCG TTGCTCGCCG TGAAAGAAGG CGGCGCAATC
ACCGGTGAAC GCAAGCTTAA CGAGTACATC GGTGAACTTT ACGGCGGCGT CAATGGCTAC
GAAGGCAAGC CGACACAGCA GCAGATCGAC CGCATGAACG CGTTGAATAC TGAGTTGGAG
ACCGTCGCGA AGAAGTTCGA CGCGATGAAC TCGAGCGACG TAAACACGGT GAATTCGGCC
CTGCAGAAGG CGAGTTTGCA ATCGCTGACA ACACTGTCAG AAGCCGACTG GCGTAAGCAG
CAGTAG
 
Protein sequence
MFSLNRRFIV LPAVLLSLVA AGWSQAAPAA TTQTQIDNDT FAGYTARSIG PAVMGGRVSA 
LAAIPGKRLT IFVGGAAGGI FKSEDGGVTF KPIFDKMNSP SIGAIAIDPQ NSKVMWVGTG
ESWMRNSVSV GDGVYKSTDG GENWTNVGLK DSEHISRVLI HPKDGNTVYV CALGHAWNDN
TERGLYKTAD GGKTWMNILK ADQRTGCGDV AFDATDPNTL YASLWPYRRY PYSFNSGGST
GGIFKSTDGG ANWKKLTNGL PEGDLGRIAI ATTPAKPGRV WAVVEAKKTA LYRSDDGGAT
WTYQNDSFNI VGRPFYFSLL VSDPNDGDRI YKPGFGLTVS DDGGRSFSGI GSEGAGGGVH
GDYHALWVNP NNSDHLITCS DGGCYESLDR GAHWRFLNSF PIGQYYHVSA DMAEPYNVYG
GLQDNGTWMG PNTDSDGVFN RHWKNIGYGD GFWSFADPTD NDLIYSEYQG GRMLRVRRTT
GEIKEVYPLP KAGDPDYRCN WNTPIHVGAA SKALYIGCQF LFRSRDHGDS WEKISPDLTT
NNPEWLKQSE SGGLTVDNSD AEKYETIFTI SESPKNPQIV WAGTDDGNVQ VTQNGGKSWT
NVAKNIPGLP PNTWVSTIEA GHFDPGTAYA TFDGHAKGDM KTYVYKTTDF GKTWTQLNSP
EFKLYAHVVR EDLVNPKLLW VGTENGLYIS IDGGANWAEF NGKIPRVPVR DVFIHPRNND
VVIATHGRSL YVIDDVTPIR ALTTDILNKD IAILPSRPSV LPLPSEEQRA EGDADYRGVP
VTSSAIVTYY QKKRHIFGEL KVELFDSTGK LVGTSAGDKR RGVVRVELPL RLPPAKVPPA
ATLVEQPFAF FGPAYPEGTY KVQVTKGKEV LTSTIKVVTD PRAKSTPQDR ALQRQTALKL
YGMRERLAYL VAAMTNVRDQ AKDRASKASD AALKQQLSDL QKKVEDFRSS LLAVKEGGAI
TGERKLNEYI GELYGGVNGY EGKPTQQQID RMNALNTELE TVAKKFDAMN SSDVNTVNSA
LQKASLQSLT TLSEADWRKQ Q