Gene Smed_5238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5238 
Symbol 
ID5319540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp199984 
End bp201030 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content57% 
IMG OID640777015 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001313947 
Protein GI150377352 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA TGCCAGCGTC GAGTGTCGGG CGGGGTATTG ACGTCAAATT TGCGTGCTCG 
ATCGATGTAA GTACCCAGTC CTATCCTCAG CAAGATCAGT TTGAAATATT CAAGAACGCA
CATGCCGGCG TCGCGGATCT AACCTTCTGC AAGAGTGCGG ACGGGTCCTT TCCAGCCCGG
CAGATGGTGT GGATCCTGGG CTCAATGGTA ATTGTCTCCA GTATGCTGCC TGGAGCAGGT
TACGCACACG AGTGGCGGCA TTTAAAGAAG CCGGCCTTAG ACAACTGGTA CCTGTGGATT
CCACGACGGT CCGTCGATCA GGGCGTTGGC GCGCGGACCA TGCCTCATCT GCACTGTCTG
GCGAAGCCAT TTCACGCCAT CGTCGAGGAC GAGGGCGCAT GTGCGATCTA TTTTCCGTCA
GAGGGGTTCG TTCCTGCATC GATCCTTGAT TGCTTGCTCG ATAGATCTGT CCAGGGCGCG
TCGGGGCGTC TCCTCACTGA CTATCTCATG CTCCTGGTTC GATCACTGCC CGACATGACC
GTGGCAGAAG TTCCATACGT CGTCGAAGCA ACACGCAATC TCGTCGTTGC ATGTTTGGCG
CCTTCGCCTG ATCGTGTTGC AGACGCTCAA AGACCGATTG CGGCGGTCGT TCTGGAGCGG
GCCAAACGCA TGATAACCTC GAGACTTGTT GATCGCTCGC TCACCCCCGA GGCGATTTGC
TGCGAAATAG GCATCTCGCG CTCGAGGTTG TACAGGTTGT TCGAGCCGCT TGGTGGCGTC
GCGGCCTACA TCCGACACCA GCGCTTGGTT CGGACCCGCA GCGCTATTTC CAGTATTGAA
GACGTCCGGC CGATATCTCG CATCGCGGAG GAATGGGGGT TCGACGATCC TTCAGCATTC
AGCCGGGCCT TTAAGCACGA GTTTGGAATG ACCCCTAAGG AAGTAAGGGA GGTGGGATGG
AACGGGGCTG CTGCGCACGT GCGCAGGGAG AGGTTTCGCG AAGGAGCCCC GACTACACTC
CGCCAACTCC TTCAAGGCAT AGCGTGA
 
Protein sequence
MNTMPASSVG RGIDVKFACS IDVSTQSYPQ QDQFEIFKNA HAGVADLTFC KSADGSFPAR 
QMVWILGSMV IVSSMLPGAG YAHEWRHLKK PALDNWYLWI PRRSVDQGVG ARTMPHLHCL
AKPFHAIVED EGACAIYFPS EGFVPASILD CLLDRSVQGA SGRLLTDYLM LLVRSLPDMT
VAEVPYVVEA TRNLVVACLA PSPDRVADAQ RPIAAVVLER AKRMITSRLV DRSLTPEAIC
CEIGISRSRL YRLFEPLGGV AAYIRHQRLV RTRSAISSIE DVRPISRIAE EWGFDDPSAF
SRAFKHEFGM TPKEVREVGW NGAAAHVRRE RFREGAPTTL RQLLQGIA