Gene Smed_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4158 
Symbol 
ID5319207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp631356 
End bp632606 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID640775963 
Productalanine racemase 
Protein accessionYP_001312896 
Protein GI150376300 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.626127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGGC CGTACAATCG GCCTGCATTC TCCGCCGGAA GCGGAGAAAA ACACCCACTT 
CGAATTGTGG CGGGTCCCCT GCGGGGCCCG TCACTGCCGA GGGGCGGATT CGCAAAGATC
TGCTTTCGCC GAATAATCAT AGGTCTCTTA GGGAAAACCT TTTCCCATAT TGACTTTGAG
AGTGAGAAAA GGTTTTCTCA TCCCTGGTTC TCTCGGGAGG AGGCGAAACT GAGCTCTCAG
CGCGGCAAGG GGCATCGCGT CACCATTCAC GATCTGGCGA GGGTCGCCGG CGTGAGTGTT
TCGACCGTGT CGAAGGCACT CAACGACAAT GGCCGCATGG CCGCCGACAC GCGCGAGCGG
ATCAAGACTC TTGCCGCGGA GATCGGCTTT CGTCCGAATG CGCTGGCGAA AGGGTTGCTC
AGCAACCGGA GCTTCACCGT CGGTCTTCTG ACGAACGATA CCTATGGGCG CTTCACGCTT
CCGGTCATGG CCGGAATATC CGAGGCGCTT GTGGATCATG GCGTGTCGGT CTTTCTTTGC
GCCATCGAGG ATGACCCGGC CCTGGGAAAA ATCCATGTCG ACGCCATGCT GGACAAGCAG
GTGGACGGCA TCATCGCGAC GGGCAAGCGG GTCGACAGGT CTCTCCCGGT CGACCTCGCT
GGCCTGCCGG TGCCGGTCGT CTACGCCTTC ACCAAGGGCG AGCCGGGCAG CGTGACGCTG
ACGTCGGATG ACCGGCACGG AGCGAGGCTT GCCACCGAGT GGCTGAAGGA GCTTGGCCGC
CAGCGGCTTG TCCATATCAC CGGCCCGCGG GAATTCGTAT CCGCTGTGGA GCGCGCTGAG
GCGTTCCGTA CCGTGGCTGG CAACGGCGCG CCGGTGCTGC ACGGCGTCTG GTCGGAGGCC
TGGGGCCACG AAGCGATCGA CAGGATCTGG AAAGAGGGCG GCGAAAGGCC CGACGGCATC
TTTTGCGGCA ACGACCAGAT CGCCCGCGGC GTGGTCGATG CGCTTCGCGA GCGCGGCGCC
CGGGTGCCGG GGGATGTCTC GGTCATAGGT TTCGACAATT GGGAGATCAT GGCGGCACAG
ACACGGCCGC CGCTGACGAC CATCGACACG AACCTGAAGG AACTTGGGCG CGAAGCGGGC
CTGATGGTGC TTGCGCTTGC GGAGGGGCGG GCGATCGAAC CCGGTCTGCG CAGGTTGCCC
TGCAAACTGG TCATAAGGGA CTCCTGCGGA GGCGGGCGCC GGCAGAACTG A
 
Protein sequence
MWRPYNRPAF SAGSGEKHPL RIVAGPLRGP SLPRGGFAKI CFRRIIIGLL GKTFSHIDFE 
SEKRFSHPWF SREEAKLSSQ RGKGHRVTIH DLARVAGVSV STVSKALNDN GRMAADTRER
IKTLAAEIGF RPNALAKGLL SNRSFTVGLL TNDTYGRFTL PVMAGISEAL VDHGVSVFLC
AIEDDPALGK IHVDAMLDKQ VDGIIATGKR VDRSLPVDLA GLPVPVVYAF TKGEPGSVTL
TSDDRHGARL ATEWLKELGR QRLVHITGPR EFVSAVERAE AFRTVAGNGA PVLHGVWSEA
WGHEAIDRIW KEGGERPDGI FCGNDQIARG VVDALRERGA RVPGDVSVIG FDNWEIMAAQ
TRPPLTTIDT NLKELGREAG LMVLALAEGR AIEPGLRRLP CKLVIRDSCG GGRRQN