Gene Shewmr4_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2061 
Symbol 
ID4252634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2451855 
End bp2453876 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content52% 
IMG OID638118680 
Productexcinuclease ABC subunit B 
Protein accessionYP_734191 
Protein GI113970398 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000836148 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGAAT CTGTTTTTCA GCTTGAATCT CAATTTGCTC CCGCAGGGGA TCAGCCCACG 
GCCATTGCCA AGTTGGTCGA TGGCTTAGAA TCGGGCCTAG CTTGCCAAAC CCTATTGGGG
GTAACAGGCT CGGGCAAGAC ATTCACTATC GCCAATGTGA TCGCCCAACT GGGGCGCCCA
ACCATTATTA TGGCGCCAAA CAAGACGCTG GCGGCGCAGC TTTATGGCGA GATGAAAGAG
TTTTTCCCCA ATAATGCGGT GGAATACTTT GTCTCCTATT ACGATTATTA CCAGCCAGAA
GCCTATGTGC CCGCATCAAA CACCTTTATT GAAAAGGATG CGTCGGTTAA CGCCCATATC
GAGCAAATGC GACTCTCGGC GACTAAAGCC TTGTTGGAGC GTAAGGATGT CGTCTTGATT
GCCTCTGTAT CGGCAATTTA CGGTCTGGGC GATCCCGATT CCTACATGAA GATGCTTTTG
CACCTACGCC AGGGCGATAC CATGGGGCAG CGGGATATTC TTAAGCGCTT GAGTGAGCTG
CAATATACTC GTAACGATCT CGAGTTGCAG CGCGGTACTT TCCGCGCCCG TGGTGAAGTT
ATCGATATTT TCCCCGCCGA TTCTGACCGC TACGGGATTC GGGTAGAACT CTTTGACGAT
GAAATTGAGC GCCTAAGCGA ATTTGACCCG TTAACGGGGC AGATAGTTAA GCGTATCGCG
CGCACCACTG TGTATCCCAA AACCCACTAT GTGACGCCAC GGGAAAAAAT CCTTGAAGCG
ACTGAGTCAA TTAAGCAAGA GCTGCGCGAG CGTAAGCAGT ATCTGCTCGA CAACAATAAG
CTCATCGAAG CGCAGCGGAT CCATGAGCGG GTGCAATACG ATATCGAGAT GATGGTTGAG
TTGGGTTATT GCTCCGGCAT TGAGAACTAC TCCCGCTATT TGTCGGGACG GGCGCCGGGA
GAAGGGCCAC CAACCTTGCT GGATTATTTA CCCGCCGATG GTTTGTTGAT CATCGACGAG
TCCCACGTCA CTGTGCCGCA AATTGGTGCC ATGTATAAGG GTGACCGCTC CCGTAAGACC
ACGCTTGTGG AATATGGCTT CCGTTTACCC TCGGCGCTGG ATAACCGGCC ATTGAAGTTC
GAAGAGTTTG AGCAATTGAT GCCGCAGACC ATTTATGTGT CGGCAACGCC TAATCCTTAC
GAACTGGAGA AAAGCGACGG CGAGATTGTT GAGCAAGTCG TGCGGCCAAC GGGATTGCTC
GATCCCGAGT TAGAAGTGCG CCCGGTTAGC ATTCAAGTGG ATGATTTACT CTCCGAGGTC
GCTAAACGCG TCGCCGTCAA TGAGCGGGTG CTTGTTACCA CCTTAACCAA GCGCATGTCG
GAGGATTTAA CCGAATACCT CGATGAACAT GGCGTCAAAG TCCGTTATTT GCACTCGGAT
ATCGATACCG TGGAGCGGGT GGAGATCATT CGCGATCTGC GCCTTGGTAA GTTTGATGTG
CTGGTCGGTA TCAACTTGTT ACGCGAAGGC TTAGATATGC CGGAAGTCTC CTTGGTCTGT
ATTCTCGATG CGGATAAGGA AGGCTTTTTA CGTTCGGAGC GTTCACTGAT TCAGACCATT
GGTCGCGCCG CTCGTAACGT CAATGGCAAG GTTATCCTCT ATGCGGATAG GATCACTCAG
TCGATGGCCA AGGCGATGGG AGAAACTGAG CGCCGCCGTG AGAAACAGCG CGCCTACAAT
CTTGAGCACG GCATTGTGCC TAAAGGGGTG GTGAAACGCA TTACCGACGT AATGGATGTC
GATGATGGTA GAGAGTCTGA AAAAGGTTAT CGTCAGTCAT CACTGAATAA AGTGGCTGAA
CCTAAAGCCA AACGTTATCA AGCCGATGCG GCGCAGCTGA GCCATGATAT CGACAAGCTC
GAGAAGCAAA TGCATGAACA TGCGCGTAAC TTGGAGTTTG AACAGGCAGC GGCGCTACGC
GATGAGGTGA AACGGTTACG GGAGTTGCTG ATCACCGCTT AA
 
Protein sequence
MSESVFQLES QFAPAGDQPT AIAKLVDGLE SGLACQTLLG VTGSGKTFTI ANVIAQLGRP 
TIIMAPNKTL AAQLYGEMKE FFPNNAVEYF VSYYDYYQPE AYVPASNTFI EKDASVNAHI
EQMRLSATKA LLERKDVVLI ASVSAIYGLG DPDSYMKMLL HLRQGDTMGQ RDILKRLSEL
QYTRNDLELQ RGTFRARGEV IDIFPADSDR YGIRVELFDD EIERLSEFDP LTGQIVKRIA
RTTVYPKTHY VTPREKILEA TESIKQELRE RKQYLLDNNK LIEAQRIHER VQYDIEMMVE
LGYCSGIENY SRYLSGRAPG EGPPTLLDYL PADGLLIIDE SHVTVPQIGA MYKGDRSRKT
TLVEYGFRLP SALDNRPLKF EEFEQLMPQT IYVSATPNPY ELEKSDGEIV EQVVRPTGLL
DPELEVRPVS IQVDDLLSEV AKRVAVNERV LVTTLTKRMS EDLTEYLDEH GVKVRYLHSD
IDTVERVEII RDLRLGKFDV LVGINLLREG LDMPEVSLVC ILDADKEGFL RSERSLIQTI
GRAARNVNGK VILYADRITQ SMAKAMGETE RRREKQRAYN LEHGIVPKGV VKRITDVMDV
DDGRESEKGY RQSSLNKVAE PKAKRYQADA AQLSHDIDKL EKQMHEHARN LEFEQAAALR
DEVKRLRELL ITA