Gene EcSMS35_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0802 
SymboluvrB 
ID6145228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp805374 
End bp807395 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content53% 
IMG OID641615690 
Productexcinuclease ABC subunit B 
Protein accessionYP_001742882 
Protein GI170682296 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000050554 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC CGTTCAAACT GAATTCCGCT TTTAAACCTT CTGGCGATCA GCCAGAGGCG 
ATTCGACGTC TCGAAGAGGG GCTGGAAGAT GGCCTGGCGC ACCAGACGTT ACTTGGCGTG
ACTGGCTCCG GGAAAACCTT CACCATTGCC AATGTCATTG CTGACCTTCA GCGCCCTACC
ATGGTACTTG CGCCCAACAA AACGCTGGCG GCCCAGTTGT ATGGCGAAAT GAAAGAATTC
TTCCCGGAAA ACGCGGTGGA GTATTTCGTC TCCTACTACG ACTATTATCA GCCGGAAGCC
TATGTACCGA GTTCCGACAC CTTCATTGAG AAAGATGCCT CGGTAAACGA ACATATCGAG
CAGATGCGTT TGTCCGCCAC CAAAGCGATG CTGGAGCGGC GTGATGTGGT TGTGGTGGCA
TCGGTTTCCG CGATTTATGG TCTGGGCGAT CCTGATTTAT ATCTCAAGAT GATGCTCCAT
CTTACGGTCG GTATGATTAT CGATCAGCGC GCGATCCTGC GTCGACTGGC GGAGCTGCAA
TACGCTCGTA ATGATCAAGC CTTCCAGCGC GGTACTTTCC GCGTTCGTGG GGAGGTGATT
GACATCTTCC CGGCAGAATC GGATGACATT GCACTTCGCG TGGAACTGTT TGACGAGGAA
GTGGAACGAT TGTCGTTATT TGATCCGCTC ACCGGGCAGA TTGTTTCCAC TATTCCACGT
TTTACCATCT ACCCGAAAAC GCACTACGTC ACACCGCGCG AGCGCATTGT GCAGGCGATG
GAAGAAATTA AAGAAGAACT GGCTGCCAGA CGCAAGGTGC TGTTGGAAAA CAACAAACTG
CTGGAAGAGC AGCGGCTGAC CCAGCGTACC CAGTTTGATC TGGAGATGAT GAACGAGCTG
GGCTACTGTT CGGGGATTGA AAACTACTCG CGCTTCCTCT CCGGTCGTGG ACCGGGTGAG
CCACCGCCGA CGCTGTTTGA TTACCTGCCT GCCGATGGGC TGCTGGTCGT CGATGAATCT
CACGTTACCA TTCCGCAAAT TGGCGGCATG TATCGCGGTG ACCGGGCGCG TAAAGAGACG
CTGGTGGAGT ACGGCTTCCG CCTGCCATCA GCGCTGGATA ACCGTCCGCT GAAATTTGAA
GAGTTCGAAG CATTAGCGCC GCAAACCATC TATGTTTCGG CGACGCCGGG TAATTACGAG
CTGGAAAAAT CCGGCGGCGA TGTGGTGGAT CAGGTGGTGC GTCCAACAGG CTTACTTGAC
CCGATTATTG AAGTGCGTCC GGTAGCGACA CAGGTCGATG ATCTTCTATC GGAGATTCGT
CAGCGAGCGG CAATTAACGA ACGCGTACTG GTTACAACTC TGACCAAGCG GATGGCGGAA
GATCTCACTG AATATCTCGA AGAACACGGT GAGCGCGTGC GTTATCTTCA CTCAGATATC
GACACCGTCG AACGTATGGA GATTATCCGC GACTTGCGTC TGGGTGAGTT CGACGTATTG
GTAGGGATCA ACTTACTGCG CGAAGGTCTG GATATGCCGG AAGTGTCGCT GGTGGCGATC
CTCGACGCTG ACAAAGAAGG CTTCCTGCGT TCCGAACGTT CGTTGATCCA GACCATTGGT
CGTGCGGCAC GTAACGTTAA CGGTAAAGCG ATTCTCTACG GCGATAAGAT CACCCCATCA
ATGGCGAAAG CGATTGGCGA AACCGAACGT CGCCGCGAGA AACAGCAGAA GTACAACGAG
GAACACGGCA TTACGCCGCA AGGCTTGAAC AAGAAAGTGG TCGATATCCT GGCGCTGGGG
CAGAACATTG CCAAAACCAA AGCGAAGGGC AGAGGAAAAT CGCGCCCGAT TGTTGAGCCA
GATAATGTGC CGATGGATAT GTCGCCTAAA GCGTTGCAGC AGAAGATCCA TGAACTGGAA
GGGTTGATGA TGCAACACGC GCAGAATCTG GAGTTCGAAG AAGCGGCGCA AATTCGTGAC
CAGTTGCATC AGCTGCGTGA GCTGTTTATT GCCGCGTCGT GA
 
Protein sequence
MSKPFKLNSA FKPSGDQPEA IRRLEEGLED GLAHQTLLGV TGSGKTFTIA NVIADLQRPT 
MVLAPNKTLA AQLYGEMKEF FPENAVEYFV SYYDYYQPEA YVPSSDTFIE KDASVNEHIE
QMRLSATKAM LERRDVVVVA SVSAIYGLGD PDLYLKMMLH LTVGMIIDQR AILRRLAELQ
YARNDQAFQR GTFRVRGEVI DIFPAESDDI ALRVELFDEE VERLSLFDPL TGQIVSTIPR
FTIYPKTHYV TPRERIVQAM EEIKEELAAR RKVLLENNKL LEEQRLTQRT QFDLEMMNEL
GYCSGIENYS RFLSGRGPGE PPPTLFDYLP ADGLLVVDES HVTIPQIGGM YRGDRARKET
LVEYGFRLPS ALDNRPLKFE EFEALAPQTI YVSATPGNYE LEKSGGDVVD QVVRPTGLLD
PIIEVRPVAT QVDDLLSEIR QRAAINERVL VTTLTKRMAE DLTEYLEEHG ERVRYLHSDI
DTVERMEIIR DLRLGEFDVL VGINLLREGL DMPEVSLVAI LDADKEGFLR SERSLIQTIG
RAARNVNGKA ILYGDKITPS MAKAIGETER RREKQQKYNE EHGITPQGLN KKVVDILALG
QNIAKTKAKG RGKSRPIVEP DNVPMDMSPK ALQQKIHELE GLMMQHAQNL EFEEAAQIRD
QLHQLRELFI AAS