Gene Smed_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4083 
Symbol 
ID5317902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp544377 
End bp545402 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID640775890 
ProductAraC family transcriptional regulator 
Protein accessionYP_001312823 
Protein GI150376227 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0474473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGG CAGCGCTGCG CTCGAAAGCT GAAATCGGTC TCATGCTTTA TCCGGGCTGC 
CAGATGGCGA TGGTTCACGG CATGACGGAC CTCATCAACA TAGCCCGTCA ATTCTCCGCC
GAGCGGGGCG GGGCGGTGGC ACGGATCAGT CACTGGCGGC TTCGGGACGA CGGCTGTCTT
GCGCGAAGTT TCGACACCCA TCCCGAACTC GGATCACGGG AGATGCCGGA GATACTGCTT
GTCCCCGGCA GGTTGACCGG GCCGATGGAA GCGGAGGAGG CCGCGCCCTA TGCTCGCTGG
CTGCTCGACG GGCATGCCCA AGGCGCGACC CTCGCATCGA CTTGCGGCGG GACCTTCGTG
CTTGCGGCGA CGGGCCTGCT CAAGGGCCGG CCAGCGACGA CGCACTGGCT CTTCGCCAAT
GCTTTCCGTG ACCGGTTTCC GGATGTCCGG CTCGACACGG ACAAGATCGT GATCGAGGAC
GGCGACATCG TAACTGCCGG CGGCCTCATG GCATGGACAG ACCTCGGCAT GCGCCTCGTG
GATCGTCTGT TCGGTCCGAC GGTAACGGTC GAGACCGGGC GTTTCCTTCT TATCGACCCC
GCCGGACGCG AGCAGCGGCA CTATTCCAGC TTTTCGCCGC GCCTCAATCA CGGCGACGAA
GTGATCCTGA AGGTGCAGCA TTGGCTGCAA GCGCGTGAGG CTCGCGCCGT CAGCGTTGCG
CAGATGGCAA AAGTGGTGGC GATGGAGGAG CGTACTTTCC TGCGCCGTTT CAAGGCGGCG
ACCGGAATGA AGCCGATCGA ATATGCCCAG CATCTGCGCG TCGGTAAGGC GCGGGAACTC
CTGGAGTTCA CCAGACGTTC GGTTGAACAG GTCGCCTGGG CAGTCGGCTA CGAGGACGCG
GCCGCCTTTC GCAAGCTGTT CCATAGGATC GTCGGGCTGT CGCGAGGTGA TTACCGGCAT
CGATTCGCGG TAGCGGGCGA GGCCTTTTCG CGCCCCGTCA CCGGCTTGGA ACAAATTGAC
GTTTGA
 
Protein sequence
MPKAALRSKA EIGLMLYPGC QMAMVHGMTD LINIARQFSA ERGGAVARIS HWRLRDDGCL 
ARSFDTHPEL GSREMPEILL VPGRLTGPME AEEAAPYARW LLDGHAQGAT LASTCGGTFV
LAATGLLKGR PATTHWLFAN AFRDRFPDVR LDTDKIVIED GDIVTAGGLM AWTDLGMRLV
DRLFGPTVTV ETGRFLLIDP AGREQRHYSS FSPRLNHGDE VILKVQHWLQ AREARAVSVA
QMAKVVAMEE RTFLRRFKAA TGMKPIEYAQ HLRVGKAREL LEFTRRSVEQ VAWAVGYEDA
AAFRKLFHRI VGLSRGDYRH RFAVAGEAFS RPVTGLEQID V