Gene Acid345_1782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1782 
Symbol 
ID4072842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2160215 
End bp2162203 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content58% 
IMG OID637983790 
Productexcinuclease ABC subunit B 
Protein accessionYP_590857 
Protein GI94968809 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.753339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCA AGGTAAGCTC CCCGTATAAA CCTCAGGGCG ACCAAGCCCG TGCGATTGAA 
CAACTGACCG GCGGGATTCG CGACGGCGAA AAGCACCAGG TGCTGCTCGG CGTAACCGGC
TCCGGTAAGA CCTTCACGAT GGCGAAGATC ATCGAGCAGC TGAACCGCCC GGCGCTCATC
CTGGCGCACA ACAAGACGCT GGCGGCGCAG CTTTATCACG AGTTCAAGAA CTTCTTCCCG
AACAATGCTG TCGAGTACTT CGTCTCGTAT TACGACTACT ACCAGCCGGA GGCCTACATC
CCTGCTGGCG ACGTTTATAT CGAGAAAGAA GCGACGGTTA ACGACGAGCT AGACAAGCTA
CGGCTCGCAG CGACCCGGTC CTTGTTCGAG CGGCGCGACG TGATCATCGT GGCGAGCGTG
AGCTGCATCT ACGGCCTTGG TTCGCCGGAA GCGTACTACG GCATGTTGCT CTTCCTCGAG
AAGGGCCAGC GCATCAAGCG CGACGACATC CTGAAGAAGC TGGTCGAGAT CCTTTATGAG
CGCACCAACG AAGATTTCCG GCGCGGAACC TTTCGAGTGC GCGGCGACGT AATCGAGATC
TTTCCGACTT ACGAAGACAC CGCCTATCGC ATTGAGATGT TCGGGGACGA AGTCGAGTCG
CTCTCGCAGA TTGATCCGCT GTTCGGCACG GTAAAACAGA AGTACCAGCG GCTGCCGATT
TATCCGAAAA CGCACTACGT GATGAAGCCG GAGCGCAAGA ATTCGGCGGT TACCACGATT
CTTGAAGAAC TCGGCTGGTG GGAGAACGAA CTGCAGAAGC AGGGACGCCT GGTGGAATCG
CAACGCATTC ACCAGCGCAC GCGCTTCGAT CTCGAAATGA TCAAGGAGAT GGGCTACTGC
CACGGCATCG AGAACTACTC GCGGCACTTT ACCGGCCGAC TACCAGGCGA GCCGCCGCCG
ACGTTGCTCG ACTACATGCC GCGGGAGTTC TTGCTCTTCA TTGACGAGTC ACACCAGACC
GTCCCGCAGC TACATGGCAT GTGGCACGGC GACCGTTCAC GCAAAGAGAA CCTGATCGAG
TACGGCTTCC GGCTGCCGAG CGCGTTGGAC AATCGTCCGC TGACGTTTGA AGAGTTTGAG
AACCGCGTGA ACCAGGCGGT GTACGTTTCG GCGACGCCGG GACCGTATGA GCTGACGAAA
GCCGCAGGCG TGGTCGTGGA GCAGATTATT CGCCCGACGG GATTGATCGA CCCGGAAGTC
GAAGTCCGTC CGGTAAAAGG ACAGATTGAC GACCTGCTGC ACGAGATCCG GAAGCGCGCG
GAAAAGAGAG AACGCGTGCT GGTGACGACT CTGACCAAGC GCATGGCCGA GGACCTCAGC
GAGTACTACA CCGAGGTCGG CGTGCGCTGC CGCTACATGC ACTCCGAGAT TGAAACGCTG
GAGCGCATCA AGATCCTGCG TGATCTTCGC AAGGGTGAGT TCGATGTATT GATCGGCATC
AATCTGTTGC GCGAAGGGCT CGACTTACCT GAGGTTTCGC TGGTGGCGAT TCTGGACGCC
GACAAAGAAG GCTTCCTGCG CTCGCAGGGC TCGCTCATCC AGACCATGGG CCGTTGCGCC
CGTAATCTCG AAGGGCGCGC GATCCTTTAT GCGGACCGCA TGACTGACTC GATGAAGAAG
GCGATGGACG AGACCTATCG TCGCCGCGCG ATTCAGGAGG CTTACAACGT GGAGCACGGC
ATCACGCCGG AGTCGATCGT TCGCCCAGTA GATATGGCCC TGGCTGCGAT CGTGGGCGCG
GACTACGTGG ATCTCACCGC GCAGCCGGAT GAGATACCGG AGTTCAAATC GCAGGAGCAG
TTGGATAAGT TCGTGGAGAA ACTCGAGGGC GAGATGCGCG AAGCGGCCAA GCGATTTGAG
TTCGAGAAGG CGGCGAAGCT GCGCGATCAG ATCAAGGAAC TACGGACCAA AGAGTTCATG
TTCACTTAG
 
Protein sequence
MDLKVSSPYK PQGDQARAIE QLTGGIRDGE KHQVLLGVTG SGKTFTMAKI IEQLNRPALI 
LAHNKTLAAQ LYHEFKNFFP NNAVEYFVSY YDYYQPEAYI PAGDVYIEKE ATVNDELDKL
RLAATRSLFE RRDVIIVASV SCIYGLGSPE AYYGMLLFLE KGQRIKRDDI LKKLVEILYE
RTNEDFRRGT FRVRGDVIEI FPTYEDTAYR IEMFGDEVES LSQIDPLFGT VKQKYQRLPI
YPKTHYVMKP ERKNSAVTTI LEELGWWENE LQKQGRLVES QRIHQRTRFD LEMIKEMGYC
HGIENYSRHF TGRLPGEPPP TLLDYMPREF LLFIDESHQT VPQLHGMWHG DRSRKENLIE
YGFRLPSALD NRPLTFEEFE NRVNQAVYVS ATPGPYELTK AAGVVVEQII RPTGLIDPEV
EVRPVKGQID DLLHEIRKRA EKRERVLVTT LTKRMAEDLS EYYTEVGVRC RYMHSEIETL
ERIKILRDLR KGEFDVLIGI NLLREGLDLP EVSLVAILDA DKEGFLRSQG SLIQTMGRCA
RNLEGRAILY ADRMTDSMKK AMDETYRRRA IQEAYNVEHG ITPESIVRPV DMALAAIVGA
DYVDLTAQPD EIPEFKSQEQ LDKFVEKLEG EMREAAKRFE FEKAAKLRDQ IKELRTKEFM
FT