Gene Smed_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3115 
Symbol 
ID5323994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3261877 
End bp3264957 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content65% 
IMG OID640792065 
Producthelicase domain-containing protein 
Protein accessionYP_001328776 
Protein GI150398309 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTTTC AATCCATGAT CCTGAGCGGC CGCGGTGTAA CCGCGGTGCT CGGACCTACC 
AATACCGGCA AGACCCATTA TGCTATCGAG CGCATGGTCG CGCATGACAG CGGCGTCATC
GGGCTGCCGC TGAGATTGCT CGCGCGTGAG GTCTATACGC GCCTGGTGGA GAAGGTCGGC
CATCATAATG TCGCGCTGAT AACGGGCGAG GAAAAGATCG CGCCCCACCG GGCCCGGTAC
TCGGTCTGCA CGGTGGAGGC GATGCCGCGC GAGACGACGG CCTCTTTCGT GGCGATCGAC
GAGGTCCAGC TCGCAGGCGA TCTCGAACGC GGTCACATCT TCACGGACAG GGTCCTGCAC
CTTCGCGGTC GCGGCGAGAC GCTGTTCCTC GGGGCTGCGA CGATGCGGCC CATTCTCGAA
TATCTTCTGC CCGGCATCAC CGTGGTCGAA CGGCCGCGTA TGTCGCAGCT CCTCTATGCC
GGATCCAAAA AGATCACGCG CCTGCCGAAC CGCTCCGCAA TCGTCGCCTT TTCGGCCGAC
GAGGTCTATG CGATAGCAGA GCTCATCCGG CGGCAGCGCG GCGGGGCGGC CGTCGTTCTG
GGCGCTCTTT CGCCGCGGAC CCGCAATGCC CAGGTGGCCC TCTACCAGGA GGGAGACGTC
GATTATCTCG TGGCAACCGA TGCGATCGGC ATGGGGCTGA ACCTCGACGT CGACCACGTG
GCCTTCGCCC AGGACCGGAA GTTCGACGGT TATCAATATC GCAACCTCAA TCCGGCGGAG
CTCGCGCAGG TGGCGGGGCG CGCGGGCAGG CACGTTCGGG ACGGGACGTT TGGCGTCACG
GGCCGCGTCG ACCCTTTCGA GGAGGAGCTT GTTCACCGGA TCGAATCGCA TGAGTTCGAC
CCCGTCCGGG TGCTGCAATG GCGCTCCAAG GCGCTGGATT ATTCTTCAGT GAAGGCGCTG
AAGAAAAGCC TGGAGGCGGC ACCGGCGGTC TCCGGGTTGG CGCGGGCGCT CCCGGCCGTC
GATCAGCAGG CTCTCGAGCA TCTGACGCGC TACCCGGAGA TCGTCGACGT AGCGACGACG
TCCGAGCGCG TGGAGAAGCT GTGGGAAGCC TGCGCACTCC CGGACTACCG CCGCATTACG
CCCGCCCAGC ATGCGGATCT GATCTCGAGT ATCTATGCCG ACCTCGTTCG CCATGGGACG
GTCAATGAGG ATTTCATGGC CGAACAGGTC CGGCGTGCCG ATCACACGGA CGGGGAAATC
GACACACTTT CGGCGCGAAT TGCGCAGATC AGGACCTGGA CCTATGTGTC CAATCGGCCC
GGGTGGCTTG CCGATCCGAC ACACTGGCAA GAAAAGACGC GGGAAATCGA AGATCGATTG
TCCGACGCGC TACATGAACG GTTGACGAAA CGCTTTGTTG ATCGCAGGAC ATCTGTGCTC
ATGAAGCGCC TGAGAGAGAA TGCGATGCTG GAAGCAGAAA TCAGTGTGAA TGGTGATGTC
TTCGTCGAGG GACACCATGT GGGTCAGCTA GCCGGTTTCC GGTTCACGCT CGTTGCGGGC
AGCGAGGGAA CGGACGCGAA GGCCGTGCAG GGCGCCGCCC AGAAGGCGCT CGCGCTGGAA
TTCGAAGCGC GCGCCGCCCG TCTCCACGCG GCCGGTAATG GCGATCTGGC CCTGTCGTCG
GACGGTCTCG TCCGTTGGCT CGGGGATCCG GTCGCGCGGC TCACGGCGAG CGACCATGTC
ATGCGTCCCC GGGTGATCCT GCTTGCCGAC GAGCAGCTCC AGGCCAATGC GCGCGAGCAT
GTACTCGCTC GGATCGAGCG CTTCGTAAAT CATCACATCA GCACGGTGCT GAAGCCGCTG
GACGATATCT CGCGCGCCGA GGACCTGGAG GGGCTTGCCA AAGGGCTTGC CTTCCAGCTC
GTCGAAAACC TCGGCGTGCT CTTCCGTCGC GACGTTGCAG AAGAGGTGAA GTCGCTCGAT
CAGGAGAGCC GTGCATCTAT CCGCAGATAC GGTGTTCGCT TCGGAGCCTA TCACATCTTC
CTCCCCGCGC TTTTGAAACC CGCTCCGGCG GAGCTGATCA CGCTGCTCTG GGCGCTGAAA
AACGATGGGC TCGACAAGCC GGGCTACGGC GACCTCATCC CGATGCTCGC TGCTGGGCGG
ACCTCCGTCG TCACCGACCC TTCCTTCGAG CGGACCTTCT ACAGGCTCGC GGGCTTCCGA
TTCCTCGGCA AGCGCGCGGT GCGCATCGAT ATACTCGAGC GGCTTGCCGA TCTCATCCGT
CCGCTTCTCC AGTGGAAGCC GGGGACGTCG CCGCGTCCCG AGGGCGCCTA TGACGGCCGC
CGTTTCGTGG CGACGACATC GATGCTGTCC ATACTCGGGG CAACGCCTGA CGACATGGAG
GAGATCCTCA AGGGGCTCGG CTATCGTGCC GATGCCGTGA CCGCCGAAGA GGCGGCCGCA
TTCCTGGCAA GTCAGAACGG CGCTACCCCG GCGGACACAC CTGCCGGAGA AGCCGCCGTG
AGCGAAACGG ATGCCGGGGC TCAACAGACC GGTGAACCGG CTGCAGAGGC CGAAGCGGCC
CGCATACCGC CAGCAGAGGC TGAGACGCGC GATGCACCGG CCGCGGAAGC GGAAGCGGCC
GATGCTGCGA CCAGCGGGCA GGAAGTCGCG CCGACCAGCG ACGAAACACC GGCGCCGGCC
GAGCCTGAGA CGCCGGCTGA ATCCAAGCCG GTGCTCCTGT GGCGGCCCGG CACGCGTCAG
GACAATCAGC GGCAAGGCGG TCGCCAAGGC GAACAGCGCC GCGGTGGACA GCGGCATGCC
CAGACCGAGG GACGCGAGGG CGGCCGCAGG CAGGGCGCCC AGGGCAAGCC GCCAGGCAAG
CCGAGGGAAG GCAAGCCTCA GGAGGGCAAG GGTTCGGAAG GAAGGCCGGG CGGACAGCGC
AAGGAGCGGG GCGACCGGTA CGATCGCAGC AAGCCGTCTC CCGCCAAGTT CGAGGGGCGC
CCGCCACGCA AGGAGAAGCC GATCGATCCG GATTCACCTT TCGCCAAGCT GGCTGCTCTG
AAGGAGCAGA TGAAGAAATA G
 
Protein sequence
MSFQSMILSG RGVTAVLGPT NTGKTHYAIE RMVAHDSGVI GLPLRLLARE VYTRLVEKVG 
HHNVALITGE EKIAPHRARY SVCTVEAMPR ETTASFVAID EVQLAGDLER GHIFTDRVLH
LRGRGETLFL GAATMRPILE YLLPGITVVE RPRMSQLLYA GSKKITRLPN RSAIVAFSAD
EVYAIAELIR RQRGGAAVVL GALSPRTRNA QVALYQEGDV DYLVATDAIG MGLNLDVDHV
AFAQDRKFDG YQYRNLNPAE LAQVAGRAGR HVRDGTFGVT GRVDPFEEEL VHRIESHEFD
PVRVLQWRSK ALDYSSVKAL KKSLEAAPAV SGLARALPAV DQQALEHLTR YPEIVDVATT
SERVEKLWEA CALPDYRRIT PAQHADLISS IYADLVRHGT VNEDFMAEQV RRADHTDGEI
DTLSARIAQI RTWTYVSNRP GWLADPTHWQ EKTREIEDRL SDALHERLTK RFVDRRTSVL
MKRLRENAML EAEISVNGDV FVEGHHVGQL AGFRFTLVAG SEGTDAKAVQ GAAQKALALE
FEARAARLHA AGNGDLALSS DGLVRWLGDP VARLTASDHV MRPRVILLAD EQLQANAREH
VLARIERFVN HHISTVLKPL DDISRAEDLE GLAKGLAFQL VENLGVLFRR DVAEEVKSLD
QESRASIRRY GVRFGAYHIF LPALLKPAPA ELITLLWALK NDGLDKPGYG DLIPMLAAGR
TSVVTDPSFE RTFYRLAGFR FLGKRAVRID ILERLADLIR PLLQWKPGTS PRPEGAYDGR
RFVATTSMLS ILGATPDDME EILKGLGYRA DAVTAEEAAA FLASQNGATP ADTPAGEAAV
SETDAGAQQT GEPAAEAEAA RIPPAEAETR DAPAAEAEAA DAATSGQEVA PTSDETPAPA
EPETPAESKP VLLWRPGTRQ DNQRQGGRQG EQRRGGQRHA QTEGREGGRR QGAQGKPPGK
PREGKPQEGK GSEGRPGGQR KERGDRYDRS KPSPAKFEGR PPRKEKPIDP DSPFAKLAAL
KEQMKK