Gene Smed_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0686 
Symbol 
ID5321523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp738473 
End bp740779 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content64% 
IMG OID640789623 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001326377 
Protein GI150395910 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.194488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGA CCGCGAGACC CATCGCCGCC ATCATTGCCA GCGAAATCAA GGCGAGCGCC 
GCACAGGTTG CAGCCGCCGT CGAATTGTTG GACGGTGGTG CGACCGTGCC TTTCATCGCC
CGCTACCGCA AGGAAGCCAC GGGCGGGCTC GACGACACGC AGCTGCGCGT CCTGTCGGAG
CGCCTCACCT ATCTGCGCGA GCTCGAGGCG CGGCGTGCAT CGATCCTCGA TTCGATCCGC
AGCCAGGACA AGATGACCGA TGAACTCGAA CGCAAGATCG AGGGCGCGGT TACCAAAGCG
GAGCTCGAAG ACATCTACCT GCCCTTTAAG CCGAAGCGCC GCACCAAAGC GGAGATCGCG
CGCGAGCGGG GCCTTGGCGC GCTGGCGGAC GCCATCCTTT CCGACCGTTC GACAGCGCCT
GCCGAACGTG CTGCCGCCTT TCTCACCGCC GATGTGCCGG ACGCCAAGGC CGCACTCGAT
GGTGCCCGCG ACATCATCGC CGAGGGCATG ACCGAGAATG CCGAGCTTCT TGGGCAGCTG
CGGAGCTACA TGCGGGGTGC AGCCTTCCTG CGCGCCCGCG TCGTGGGTGG CAAACAGGAG
TCCGGCGCAA AGTTCTCCGA CTATTTCGAT CACTCCGAGC GATGGGCGAC GGTGCCCGGC
CACCGCGCGC TTGCGATGCT TCGCGGCTGG AACGAGGAGG TCCTGTCCGT GGACATCGTT
GTCGACGAGG ATGCGGCGCC TTCGCATAGA CCGATCGAGC GTATGATCGC GGCTGCCTAC
AGCGTTGGCG ACCGGCTGCC CGGTGACAAA TGGCTGCTGG AGGTCATCGG CTGGACCTGG
CGCGTGAAGC TTTCGCTTTC GCTATCACTG GACCTGATGC GCGAGCTTCG AGAGCGCGCC
GAGGAGGAGG CCATCCGCGT CTTCGCGCGC AATCTCAAGG ACCTGCTGCT TGCAGCGCCT
GCGGGCTCGC GTCCGACCAT GGGCCTCGAC CCTGGCATCC GGACGGGCGT CAAGGTCGCG
ATCGTCGACG GTACCGGCAA GCTCCTCGAC ACGAGCACGG TCTATCCATT TCCGCCGAAG
AACGACATTC GCGGGACTCA GGCAGAGCTT GCGGCGCTTG TGCGCAAACA CAAGGTGGAA
CTTATCGCGA TCGGCAACGG CACCGGGAGC CGTGAAACCG AAAAGCTCGT CGCGGATATG
CTCGCTGCCA TGCCGGCGCC GAAGCCGACA AAGGTGATCG TCTCCGAGGC GGGAGCATCG
GTCTATTCCG CTTCGGAGAC GGCGGCTGCC GAGTTCCCCG GCCTCGACGT GTCCTTGCGG
GGTGCCGTTT CCATCGCGCG CCGGCTCCAG GATCCCTTGG CCGAGCTCGT GAAAATCGAA
CCGAAGTCGA TCGGGGTCGG TCAATATCAG CACGACGTTG ATCAGTCGAA GCTCAGCCGT
TCGCTGGATG CCGTCGTCGA GGATGCCGTG AACGCGGTCG GGGTCGATCT CAACACCGCC
TCGGCGCCGC TGCTGGCGCG CGTTTCGGGG TTGGGTAAGT CCTCGGCGGA GGCGATCGTT
GCGCATCGCG ATGCGCTGGG CGCCTTCGCC AGCCGCAAGG ACCTTTTGAA AGTGCCTCGC
CTCGGTGCCC GCACTTTCGA ACAATGCGCC GGCTTCCTCA GGATCACGAA CGGCTCCGAG
CCGCTCGATG CGTCTGCGGT GCATCCGGAG GCTTATGGAG TCGCAAAGAA GATCGTCGCC
GCCTGCGGTC GTGATGTGCG TTCTCTGATG GGCGACAGCG CCGAACTGAA GAAGCTCGAT
CCCCGCATTT TCGTCGATGA ACGCTTCGGC CTGCCAACTG TCAAGGACAT TCTCGCTGAA
CTGGAGAAAC CGGGTCGCGA TCCACGCCCG AGCTTCAAAA CGGCCACCTT CGCGGAAGGG
ATCGACGACA TCAAGGATCT GAAGGTCGGC ATGCGCCTTG AGGGCACGGT GACCAATGTC
GCCGCCTTCG GCGCTTTCGT CGATATCGGC GTGCATCAGG ACGGTCTCGT CCATGTGTCG
CAATTGGCCG ACCGCTTCGT CAAGGATCCG CATGAGGTCG TCAAAGCCGG GGACGTCGTG
CACGTCCGTG TCACGGAGGT CGATGTGGCG CGAAAGCGCA TTGGACTGTC GATGCGCAAG
GAAGGTGGCG CCGAAACGCC CCGAGAGGCG AGGGGGACGG CGCCGAACGG CGGCAATCGG
GGTGCGCCCG CTCGACCGCA GCCGCCGCAG CAGCCGGCGC AAGGCGCCTT CGGTGAAGCG
CTGATCGCGG CGATGAAGAA AAAATGA
 
Protein sequence
MVTTARPIAA IIASEIKASA AQVAAAVELL DGGATVPFIA RYRKEATGGL DDTQLRVLSE 
RLTYLRELEA RRASILDSIR SQDKMTDELE RKIEGAVTKA ELEDIYLPFK PKRRTKAEIA
RERGLGALAD AILSDRSTAP AERAAAFLTA DVPDAKAALD GARDIIAEGM TENAELLGQL
RSYMRGAAFL RARVVGGKQE SGAKFSDYFD HSERWATVPG HRALAMLRGW NEEVLSVDIV
VDEDAAPSHR PIERMIAAAY SVGDRLPGDK WLLEVIGWTW RVKLSLSLSL DLMRELRERA
EEEAIRVFAR NLKDLLLAAP AGSRPTMGLD PGIRTGVKVA IVDGTGKLLD TSTVYPFPPK
NDIRGTQAEL AALVRKHKVE LIAIGNGTGS RETEKLVADM LAAMPAPKPT KVIVSEAGAS
VYSASETAAA EFPGLDVSLR GAVSIARRLQ DPLAELVKIE PKSIGVGQYQ HDVDQSKLSR
SLDAVVEDAV NAVGVDLNTA SAPLLARVSG LGKSSAEAIV AHRDALGAFA SRKDLLKVPR
LGARTFEQCA GFLRITNGSE PLDASAVHPE AYGVAKKIVA ACGRDVRSLM GDSAELKKLD
PRIFVDERFG LPTVKDILAE LEKPGRDPRP SFKTATFAEG IDDIKDLKVG MRLEGTVTNV
AAFGAFVDIG VHQDGLVHVS QLADRFVKDP HEVVKAGDVV HVRVTEVDVA RKRIGLSMRK
EGGAETPREA RGTAPNGGNR GAPARPQPPQ QPAQGAFGEA LIAAMKKK