Gene Smed_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2172 
Symbol 
ID5323032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2242627 
End bp2243616 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content63% 
IMG OID640791110 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001327840 
Protein GI150397373 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0036608 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGAA GTTTCGGCGT GGACAGCAGC ACTTCGCAAG CGCAGCATCT CGATCTGCTG 
ATATTGCCGG AGACCAATCT CATTCTTGTC GCTTCGGTGG TCGAGCCCTT ACGCGCCGCC
AATCGGATAG CCGGGCGCCC GCTTTACAGC TGGGCCCTGT TCAGCCCTGA CGGAAATGCG
ATCGAGACGA AAAGCGGCAT TCCCATTCCG GTGGCCGGGG CCTTCCGTCC GCAGCGCGAG
ACTGCGCCGC TCTTCGTGCT TTCCAGCTAC CACTGGCAGC GCAGCGCCAC CGTGCAGCTC
AAGATGTTCC TGTCGCAGAC GGCGCGGCAC AGGGAGACGA TGGCGGGAAT CGAATCCGGC
TCCTGGCTCC TTGCGGAGGC GAGCCTCCTC GACAATTTCT CGGCCACCAC CCATTGGGAG
GACTTCGAGG ACTTCTCGGC CGCCTATCCG CAGGTCACGA TGGTGCGCGA CCGGTTCGTC
ATCGACCGCA AGCGCATTAC CACCGGCGGC TCGCTGCCGA CGCTGGATCT GATGCTGGAA
CTGATCCGCC GCGCGCACGG CTACTCGCTG GCACTCGAAG TATCCCGCCT CTTCATTTAC
GAGCAGGAGC GCACGCGCGG GGACCTCCTG CAGGTGCCGG CCATCGGCAA TATGCGCATT
CTGGATGCGC GGGTCGGTGC AGCGGTAAAG CTTATGGAGG AGACGGTAGA GGCACCGCTG
ACACTCGCCC GGCTGGCGCG CCGGGCAGGC ATCAGTGCCC GGCATCTGCA GGATCTCTTC
AAGGAGACGA TGGGTGTCGC TCCGCACGAG CACTATCTGG CGCTCCGGCT CAACGCGGCG
CGTCGCAAGG TGATCGAGAC GCGGATGGCG TTCGCCGATA TCGCGGCGAT TTCCGGCTTC
AATTCCTCGT CTTCATTTTC CCGCAGCTAT AGGGCTCATT ATCGAGAAAG CCCAAGTGAG
ACACGCCGGC GGCTCAAGTT GAAGAACTGA
 
Protein sequence
MGGSFGVDSS TSQAQHLDLL ILPETNLILV ASVVEPLRAA NRIAGRPLYS WALFSPDGNA 
IETKSGIPIP VAGAFRPQRE TAPLFVLSSY HWQRSATVQL KMFLSQTARH RETMAGIESG
SWLLAEASLL DNFSATTHWE DFEDFSAAYP QVTMVRDRFV IDRKRITTGG SLPTLDLMLE
LIRRAHGYSL ALEVSRLFIY EQERTRGDLL QVPAIGNMRI LDARVGAAVK LMEETVEAPL
TLARLARRAG ISARHLQDLF KETMGVAPHE HYLALRLNAA RRKVIETRMA FADIAAISGF
NSSSSFSRSY RAHYRESPSE TRRRLKLKN