Gene Acid345_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3541 
Symbol 
ID4069273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4187646 
End bp4189580 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content61% 
IMG OID637985564 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_592616 
Protein GI94970568 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATT CCCTCAGCGC AGACATCAAC TCGGCCCTCA AGCACTACTT CGGCTACGAC 
CGCTTCCGCC CGCTGCAGGA ACGGATCATC CGCAGCATCG TCGCCAATAA AGACGTCTGC
GTCATTATGC CGACGGGCGG CGGAAAGTCG CTCTGCTACC AGCTTCCGGC GGCGATCTCG
CAAAAGACTA CGGTCGTCAT CTCCCCGCTC ATCGCGCTTA TGAACGATCA GGTCGTCCAA
CTCACGCAGA TGGGCATTCC CGCCGCGTTG TTGAACAGCA GCTTGCCGTA TGACGAACAG
AAGAAGGTGA TGCGCGCCGC CCGCGAGGGC AAGTACCGCT TGTTGTATCT CTCGCCCGAG
CGCCTGGTGC GCGAAGACAC TGTCGGATGG CTGCGCACCG TGCCGCTCGG CGTGTTTGCT
ATCGACGAGG CGCACTGCAT CTCCGAGTGG GGACACGAGT TTCGACCCGA GTATCGGCAA
CTCAAGCTAC TGCGCAACAG TTTTCCCGAC GTCCCCATCG CGGCCTTCAC CGCCAGCGCC
ACGCAGCGCG TGCGTCACGA CATCGTCCAT CAACTTGCGC TGCGCGAGCC CGACAAATAC
ATCGCCAGCT TCCATCGCCC CAACCTGCGC TACATCATTC GCCAGACCGA CCCCTACGGC
CAGCGCGACA TGCTGCTCCG CGCGCTGCGC AGCTACGCCG GCCACAACGT CATCGTCTAT
GCGCCGACGA TCAAAATGGT GGAAGAGGTC GCCGACTTTC TAATCGACAA GAAGATCCCC
GCCGTCCCCT ACCACGGACA GATGGATTCG GCGCTGCGCA CCAGGAACCA GGAGAAGTGG
ATGACCGACG AGGTGCGCGT GCTCGTCGGA ACCATTGCGT TCGGCTTGGG AATCAACAAG
CCTGCGGTGC GCGCGGTCAT TCACCTGGCA GTTCCGAAGT CGCTGGAGAA CTATTACCAG
GAAGCGGGCC GCGCCGGACG CGACGGCCTC CCCGCCGATT GCGTGATGCT CTGGCAGCCC
AAAGACCTCG GCTTGCTCGT GTACTTCATC CAGCAAATGC AGGACACCAG CGAGAAGAAG
CGAGCCTGGG AGCGCTACCA GGTCATCAGC GAATTCGTGA AGTCGGACGA GTGCCGCCAC
AAGCAGATCT GCGAACACTT CGGCGAGAAA AAGAGTTTCG ACGACTGCGT GGCATGCGAC
ATCTGCGGCG CAACCGTCGG CTGGATGACC GCTCCGGTCC CCGAGCCCAT GCCCGGCGAT
CCGCCAATCA AGCTGGGACT GCCGGAAGGG AAAAAGCCGA AGAAGAAACT TCGCGCGCCG
CAACCCTCGG AAATCGAGCA CGCCATGCCG CTCGACGATG CGCTGCTCGA CTTCTTCCGC
CTGTGGCGAC GCGACGAAGC CAAGCGCCGC GGTGTTCCGG CCTACGTGGT CATGCACGAT
ACATCGCTTG AACACCTCTG CCGGGTAAAA CCGAAGACGC TGGAACAAGT GCGAAGCATC
TCCGGCTTCG GCGACTTAAA GACCGCCGAT TACGGCCCGG GCATCCTGAA AGCCCTCGCC
GAATTCGACG CCGGCAAGCG CGCCCTCCAG GATTTGGCTC CGCGCCTCGA ACCGACCGGC
CCCAGCCCTT CACACGAAAC GCTGGACCTC CTCCGCAAAG GCCATTCCTT CGCCGAGATC
GCCAACATCC GCGGCCGCCA GTTGCAAACG GTCATGGCCG CCGTAGCGAA CCTAGTCGAG
ACCGGCGATA TCGAGTTCGA TCCCAAGTGG GTGAAAGAAG ATCGTCGGCT CGCGATTGAA
AACGCATGTG AGAAAGTCGG GACGGCGCGG CTGAAACCGG TGAAGGATTT AGTGGCGGCC
GAGGTGACGC TGGGGGAAGT GCATTTGGTG GTGGCTAAGA AGAGGTGGGA AGAGAAGAGA
AGAGACAAGG AGTAG
 
Protein sequence
MTDSLSADIN SALKHYFGYD RFRPLQERII RSIVANKDVC VIMPTGGGKS LCYQLPAAIS 
QKTTVVISPL IALMNDQVVQ LTQMGIPAAL LNSSLPYDEQ KKVMRAAREG KYRLLYLSPE
RLVREDTVGW LRTVPLGVFA IDEAHCISEW GHEFRPEYRQ LKLLRNSFPD VPIAAFTASA
TQRVRHDIVH QLALREPDKY IASFHRPNLR YIIRQTDPYG QRDMLLRALR SYAGHNVIVY
APTIKMVEEV ADFLIDKKIP AVPYHGQMDS ALRTRNQEKW MTDEVRVLVG TIAFGLGINK
PAVRAVIHLA VPKSLENYYQ EAGRAGRDGL PADCVMLWQP KDLGLLVYFI QQMQDTSEKK
RAWERYQVIS EFVKSDECRH KQICEHFGEK KSFDDCVACD ICGATVGWMT APVPEPMPGD
PPIKLGLPEG KKPKKKLRAP QPSEIEHAMP LDDALLDFFR LWRRDEAKRR GVPAYVVMHD
TSLEHLCRVK PKTLEQVRSI SGFGDLKTAD YGPGILKALA EFDAGKRALQ DLAPRLEPTG
PSPSHETLDL LRKGHSFAEI ANIRGRQLQT VMAAVANLVE TGDIEFDPKW VKEDRRLAIE
NACEKVGTAR LKPVKDLVAA EVTLGEVHLV VAKKRWEEKR RDKE