Gene Smed_5440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5440 
Symbol 
ID5319742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp400856 
End bp404011 
Gene Length3156 bp 
Protein Length1051 aa 
Translation table11 
GC content61% 
IMG OID640777203 
ProductUvrD/REP helicase 
Protein accessionYP_001314135 
Protein GI150377540 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.202358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACC ACATCACGCT CGTTCCGGCC GGCGCAGGCT CAGGCAAGAC ATACCGCATC 
GAGACCACGC TCGCAGATCT TGTTGTCTCG GGCGAGGTGT CGGCAGATCG TATCCTCGCG
GTCACATTCA CCGAAGCCGC GGCCTCCGAT CTGCGTACAC GTATCAGGGG TGCCCTGTTG
GCCAGGGGGC GCATCGACGA GGCTCTGGCA ATTGACGGCG CCTATGTCGG CACGATCCAC
GCCCTTGGTC TCCGGCTGCT GACGGAGCAT ACTTTTGCCG CGAACCGCTC TCCTGCAAGC
CGGCTGTTAT CGGACTCAGA GCGAGATCTG CTCATCCGCC AGCAGATCGG TCGCGCTCCT
GCCCTCGCTC TAATCCTAAA CGCGCTTCCG CGCTTTGGCT ATGTGTGGGA TCGCAATACG
GACGCCACTG GCGAAGAACA GCTGCGCAAA AAGGTCATGC AAACCATTGA TCTGCTGCGC
TCGCTGGGTC AGCGCGGAGC CGCCTCAGAG ATCATTGACC CGGCCATCGC CGACCTGCGT
GCGACCTACG GCACCACGAA GGACTCAGCA CGGCTCGAAA TGGCCTTGTC TCGTGCGGTC
AATGCCCTGC TCACTTCGTT TCCCGAAAGC ATCGCCGATA CCGTTTCCGC CAAGACGGCG
CGAGAAGCCT TCGACAAGGA TTTCGAGGCG CTCCGCCGCG CCTCGCGAAC GGACGCCCTC
GGCAGCGATT GGGCGTTGTG GCAAAGACTT CGCAAGCTGC GGCTGTCGAA CAGCAAAACC
AAAGCGCCCG AGGGCTACGA CCCGCTCGCC AGTGCCGTCG TCGCAGCCGC CGACGAGCTC
CGCTACCATC CGGGGCCGCT CATGGATGCC GAGAGGCATT TGACGGCTTT GGTCCACGGC
GCACAGGAGA TCATGAGCGC TTATGAGAAC GAAAAGCGCG CCAGGGGATT GATCGACTAC
GCCGACATGA TCGTCGAAAC TGAAGTCCTG CTACGCACGC GTTCGGACAT TCTCGATGCA
GCGTTGGCCG AGATCGACTG CGTCGTCATC GATGAATTCC AGGATACAAA TCCCGTGCAG
TTCGCGATGC TGTGGCGACT TGCCGCCCGC GCCCCACGCG CGTTGATCGT CGGCGACCGC
AAACAGGCGA TCATGGGCTT CCAGGGCGCC GACCCGCGTC TGTCTGAAGC ACTCGACGCT
GCACACCCAG CGAGTGTCGA CCCGCTGACC CAGAATTGGC GCTCCGACCC GAGGATCATG
ACGTTCGTCA ACGCGATCGG GCCGGCGCTC TTCCCGAGCG GCTACGACGC GCTCAAGTCG
ATGCGGCCCA AGAGCGGCGA GACAGCACTC GAATTTATCG AATTCGATGG GAGCAAGGCG
AAAGTTTTCG AAGGGATTGC CGCTCATGCC CGCGATCTCA CCGACCAAAA GATCCTCATC
ATTGATAAGA GCACGCAGGA AACTCGTCAG GCTCGCCCAG CGGACATCGC GATTTTGGTC
TATTCGGGTA TCGATGCCCG AAGGGTCGCA ACATCGCTGC GTGACTTCGG GCTTCCGGTT
CGTCTTGCCG AGGACGGCTG GTTCCTGTCG CAGATCATCC AGGTTGCGCG CGCCGCACTG
GCTCTCGTGG CCGACCCAAC CGACCGGTTG GCAGCGCTGC TCCTTTTGTC ACTGGGGCCG
GACCGCATGT CTCTGGATGC TTCGCTGGCT GCGGCGATCG ACGACAACCT CATTCAATTC
GACCGTGTCC GCGCCCTCGC GGAATTCGCA GAGGAGGCTC GCCATCTAAC CGCCTCCGCC
GCGCTCCCCC GTGTGTTGCA GATAGCAGGT ATTGAAGAGT GGATTTCGAC GTTACCCGAC
AGAGCGGCTG CGGAAGCTGA TCTCGGTCGC CTTTTTGCGG AGGCAAGTGC TTTCGACAGC
GCTGAGACAA GCTTGCGTGA AGCCGTGGGT TTCCATGGAA ACTCTCTTCA ATCCTTCCTC
GGCTGGCTTG CCGACCAGGC CGAACGGGGC CTTGACTATC GGCCTGACCG TGAGGGTTGG
GAGGTCGAAG GCATCGAAGT CTCCACCTGG CATGCCTCCA AGGGGCGTGA GTGGCCAATC
ACCATTGTCG GCGGCATGGA TTTCTCGTTC CCGGAACGCG GCAATACGAT GCGCGCGGAG
TTCACATCGT TCGGCAACTT CGACAACGTC TTGGATTCTG CCGGGCTGAG TTGGTTTCCA
TCCTTCGACT GTCCCGAGGC GCAAATCGTC TTTGCCGATA GACGCGTCGA ACAGGATGAA
CGTGAGGCCG CCCGCGCACT TTACGTTGCG CTGACGCGCG CCCGTGATCG TCTGGTTCTG
GCTTTGCCGC TAAAGGAGGG AAGCCCCGAC AAACGTCCCG AGACCATGGC AGACCTTCTC
CGCCGCCGGA CGCAACTCGG CCTCGGATCA GGCACGGTTA CGGCATGCGG GCAACAAGTT
CCGGCGCTCT GCCGTACGAT CCCCAAGGAT ACTGAGTACG TCTATCCCGA TGCATCGGCG
GATTCTGGCG TTCACAAGGT ATGGGGCGTT GACGAGACTT CCGAATTGGT TGCGCGAACG
CCATGGCGCA GTAGCCCCTC AAGTCTCGCT GCCCCGGATG GTCGGCTTGG GCCAGCACTC
ACCCATATCG ACCTCGGTGT GCCGGTCCCG AAGCAAAAAT TCGGTTCGGC CACGGAACGC
GGTACCGCCT ATCACCTCGC CTTTCGGGTT CTTGCCGAGC GGCCCGAATT TTCTGCACGG
CTCCCGGCTG CGACTGGTCT TTCCGACGAT ACGATCAAAG CCATAGAGGG CCAGGCAAGC
GCTTTGCGGG CCTGGCTCGG CGGTCTGGGT TTCCATCAAT TGAGCTTCGA GGTACCGCTT
CAGCTCCGCG CCGACGATGG CTCGGAAACC AACGCCATCA TTGACCTGCT GGCGGAGAGC
GACGACGCGC TCATAATCGT GGATCACAAG ACTGGTCCCT GCCCCGATCC CAATTTGCGG
TTTGGAGGGT ATCTGCCTCA ACTCACGGCA TACGCCGACT TACTCACGGA ACGATACCCG
GAAAAGCCTG TCCGCTTCCT CGTCGTCAAT TGGATGGACG AAGGCCACAT CAGCATTGCG
GATGTCGCCG CGCTAGCAGC GGAGGAAGCT GCCTGA
 
Protein sequence
MSDHITLVPA GAGSGKTYRI ETTLADLVVS GEVSADRILA VTFTEAAASD LRTRIRGALL 
ARGRIDEALA IDGAYVGTIH ALGLRLLTEH TFAANRSPAS RLLSDSERDL LIRQQIGRAP
ALALILNALP RFGYVWDRNT DATGEEQLRK KVMQTIDLLR SLGQRGAASE IIDPAIADLR
ATYGTTKDSA RLEMALSRAV NALLTSFPES IADTVSAKTA REAFDKDFEA LRRASRTDAL
GSDWALWQRL RKLRLSNSKT KAPEGYDPLA SAVVAAADEL RYHPGPLMDA ERHLTALVHG
AQEIMSAYEN EKRARGLIDY ADMIVETEVL LRTRSDILDA ALAEIDCVVI DEFQDTNPVQ
FAMLWRLAAR APRALIVGDR KQAIMGFQGA DPRLSEALDA AHPASVDPLT QNWRSDPRIM
TFVNAIGPAL FPSGYDALKS MRPKSGETAL EFIEFDGSKA KVFEGIAAHA RDLTDQKILI
IDKSTQETRQ ARPADIAILV YSGIDARRVA TSLRDFGLPV RLAEDGWFLS QIIQVARAAL
ALVADPTDRL AALLLLSLGP DRMSLDASLA AAIDDNLIQF DRVRALAEFA EEARHLTASA
ALPRVLQIAG IEEWISTLPD RAAAEADLGR LFAEASAFDS AETSLREAVG FHGNSLQSFL
GWLADQAERG LDYRPDREGW EVEGIEVSTW HASKGREWPI TIVGGMDFSF PERGNTMRAE
FTSFGNFDNV LDSAGLSWFP SFDCPEAQIV FADRRVEQDE REAARALYVA LTRARDRLVL
ALPLKEGSPD KRPETMADLL RRRTQLGLGS GTVTACGQQV PALCRTIPKD TEYVYPDASA
DSGVHKVWGV DETSELVART PWRSSPSSLA APDGRLGPAL THIDLGVPVP KQKFGSATER
GTAYHLAFRV LAERPEFSAR LPAATGLSDD TIKAIEGQAS ALRAWLGGLG FHQLSFEVPL
QLRADDGSET NAIIDLLAES DDALIIVDHK TGPCPDPNLR FGGYLPQLTA YADLLTERYP
EKPVRFLVVN WMDEGHISIA DVAALAAEEA A