Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3621 |
Symbol | |
ID | 4070141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4282869 |
End bp | 4285589 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637985644 |
Product | DNA helicase/exodeoxyribonuclease V, subunit B |
Protein accession | YP_592696 |
Protein GI | 94970648 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3857] ATP-dependent nuclease, subunit B |
TIGRFAM ID | [TIGR03623] probable DNA repair protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.221768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.652305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCGC AACAATCCCG TATCGACCGG GCCGTGGCGC TCGCACGGGC CGGAGCCACC GTTATCGCCG ACAATACGCG CGCTGCACGA CGGTTGCGCC TGGAAGCCGA GTGGCAGGTG CTGCAGGAAA AACGCGTCTG CGCAACCCCC GACGTACTTC CCTTTGAAGC GTGGATCCAA CGCACCTGGA CTGACGCCCT GCTGGCCGGC GTGGTGGACC GAGCGCTGCT CAAGGCAAAC GTGGTCGCGG CACTATGGCG CGAAATCGTG GCGAATTCCA CTCCGGGACG CGACTTGCTC AGCCACAATG CAGCGGCGGA ACAGGCACAG CAGGCGTGGA AGCTCATCTT CGACTACAAG CTGCCGCGCA GCCGCGCGCT CTATTCCGAG ACCGCTGAGA GCAAGGCTTT CCACGACTGG GCCGAGGCGT TCGAAGAACG CTGCGAGCGC GAAGGATGGA TTGATTCTTC CGCGGCGACG GAACAGATCG CTGCCCGCGC CGAGCAGTTG CCGAATCTGC CGAAACAAAT CGTGGCATTC GGATTCGATC GCTTTACTCC GGCGCAGGAA GCTTTGTGGC GCGCGCTACG CAACGCTGGG TGCGAGGTAA CAGTGCTGGC GCCGGAATCG GAGAGCAACA AAGACCACGC TCGAGGATTG GCGTGCGCGG ATCCGAGCGA CGAGATCCGA ACCTCCGCGC TGTGGGCGCG GAAGAAACTG GAAGAAAACC CGTCGGCGCG TGTGGGCGTA ATCGTTCCGC GGTTGGAAGG CCTGCGCGAA ACCTTCGCGA CCATCTTTGA AGATGTGCTG CATCCGGAGA ACCGGCTATT GACGCGGACC GCGACGGCGC GGGCTTTCGA GATCTCGTTG GGCAGGCCGC TCTCCGAGCA CCCCATGGTG CGGGCGGCGC TACGGATTTT GCGGCTGGCG ACCTCAAGCT TGAACGCCGA GGAATTCAGC GCGCTGTTGC GGTCGCGTTA CATCGCGGGC GGGACGAACG AGGCGTCGGC GCGGGCGCTG GTTGATTTCG AGTTGCGAAG GAAGTTGCGC GCGACTGTGA CCTTGGCGCA GGTGCTCAGC GGCAAGGCAG AAGAAAAGGT CGTGGGCGCC CCGAAGATGG CGAAGATGCT GCGCGCGTTT TTCGCGAAAC CGGCAAAGAC CGGCAAACTC ACGCACACGG AGTGGGCCGC CGAGGCGCAG CGCATTCTGC GGATAATGGG ATGGCCGGGC GATGAAGGCG AGTTCTCGCT GAACAGTGAG GAGTTCCAGG TCAGCAAGAA GTGGGAAGAG TTGCTGAAGG ATTTCTGCGC CCTCGACCAG GCTCTGCCCT TGAAAACAGC GAGCGAAATG CTGCGCGAGT TGGAGCGCGC CGCGGCCGAG GCGACCTTTG CGGCGGAGAA CGAGGCAGCG CCGGTGCAGA TTGTTGGCCC GCTCGCGGCC TCGGGCGAAA CTTTCGATGC ACTGTGGTTC TGCGGGTTGA GCGACGAGAC CTGGCCGCAG AAGGGCCATC CGAATCCGTT CATTCCGTTT GCCCTGCAGA AATCAGCAGG GGCCCCGAAC ACGTCGGCGG AGTGGAACCT GCGCGATGGC GAACGCAGAA CCGCACGCCT CTTGCAGAGT GCGAACGAGT GTGTGCTCAG TTGGCCGCAG AGAAACGACG ACGGAGAGTT GCGGCCATCG CCGCTGCTGG CTGGGGTCCC TCCGGCGACT GACTTAGAAA TCGCCGATGT ACGCGACTGG AATTCCTTGC AGAAGGGCGC GTTGCTGGAG CGTTATGACG ATGAGAAAGC ACCGGCGATC GCGAACGAGG AACTAAAGCG ACGCGGAACA ACCGTATTGC AGTGGCAGTC GGGATGCCCG TTTCGCGCCT TCGCACAAAC GCGGCTGGCG GCGGAGAAAT TGGAAGAGAG TCCGCTGGGG GCGAATGCCA TTGAGCGCGG CAACATAGTG GACACCGCGC TGCGGTTCGT GTGGGAGGAA CTGCGGGAGT CGAGCAACCT CAACGGCGGA ATCCCCACCA TGACACTGGA ATTCGTGATT GCCGGTGCGA TCGAACGCGC GCTGCTGGAA GAATTTCCGT CCGGTGAAGA GAGTTGGCTG GTACGGCATC GCGAGATCGA ACGCGAGCGG CTGAGCAAAC TGGTGCACGA TTGGCTCGAT GTCGAGAGAA AGCGGCATCC GTTCCATTCG GTGCGCTCAC AGGTCAACGT CACGGTTAAG CTTGGCGAAC TGGAGTTGAA AGGTCGCATC GATCGCCTGG ACCAAACGGT TGATGGCGAA TATGTCGTCA TTGACTACAA GACGACTCGC AAGGACCTGG CCACAAGCTT GTGGGAGATG CCGCGTCCGC AGGAGCCGCA GCTACCGATC TACGCAGTGG GGCAGCAATT GGAAAGCCAC GAAGTCGCGG GAGTCGCCTT CGCGCAGGTG CGGGCCGGTA AACCGAAGTA CAGCGGACTG GCAACGCGAA AAGAAATTTT CGGCGACAGC AAAAACCTTG CGAAGTTTGG CGAATTTGCA GAAACCCTGG CCACGTGGCA GCCGGAGTTG GAGAAGCTTG CGGGAGAACT GTTAGCCGGG CACGCGGAGG TGGATCCAAA AGTTCCGCCG GGGGTGACCA ACACCACGTG CAGACACTGT CACCTGGGAT CGCTGTGTCG CATCGCCGAG TTGCCGCTCA TTGCCGATGA AGGCGGCGAG GAGGACAGCG ATGAAGAGTA A
|
Protein sequence | MGSQQSRIDR AVALARAGAT VIADNTRAAR RLRLEAEWQV LQEKRVCATP DVLPFEAWIQ RTWTDALLAG VVDRALLKAN VVAALWREIV ANSTPGRDLL SHNAAAEQAQ QAWKLIFDYK LPRSRALYSE TAESKAFHDW AEAFEERCER EGWIDSSAAT EQIAARAEQL PNLPKQIVAF GFDRFTPAQE ALWRALRNAG CEVTVLAPES ESNKDHARGL ACADPSDEIR TSALWARKKL EENPSARVGV IVPRLEGLRE TFATIFEDVL HPENRLLTRT ATARAFEISL GRPLSEHPMV RAALRILRLA TSSLNAEEFS ALLRSRYIAG GTNEASARAL VDFELRRKLR ATVTLAQVLS GKAEEKVVGA PKMAKMLRAF FAKPAKTGKL THTEWAAEAQ RILRIMGWPG DEGEFSLNSE EFQVSKKWEE LLKDFCALDQ ALPLKTASEM LRELERAAAE ATFAAENEAA PVQIVGPLAA SGETFDALWF CGLSDETWPQ KGHPNPFIPF ALQKSAGAPN TSAEWNLRDG ERRTARLLQS ANECVLSWPQ RNDDGELRPS PLLAGVPPAT DLEIADVRDW NSLQKGALLE RYDDEKAPAI ANEELKRRGT TVLQWQSGCP FRAFAQTRLA AEKLEESPLG ANAIERGNIV DTALRFVWEE LRESSNLNGG IPTMTLEFVI AGAIERALLE EFPSGEESWL VRHREIERER LSKLVHDWLD VERKRHPFHS VRSQVNVTVK LGELELKGRI DRLDQTVDGE YVVIDYKTTR KDLATSLWEM PRPQEPQLPI YAVGQQLESH EVAGVAFAQV RAGKPKYSGL ATRKEIFGDS KNLAKFGEFA ETLATWQPEL EKLAGELLAG HAEVDPKVPP GVTNTTCRHC HLGSLCRIAE LPLIADEGGE EDSDEE
|
| |