Gene Smed_5885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5885 
Symbol 
ID5320187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp848915 
End bp849895 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content62% 
IMG OID640777580 
ProductDeoR family transcriptional regulator 
Protein accessionYP_001314512 
Protein GI150377917 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.873951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC GCGTCAACGA CCAGATCATT CACAAGGCCG CCTGGCTTTA CTATACGCAC 
GGCCTTCGCC AGGACGAAGT TGCCCGGAGG CTGGAAATCT CGCGTGCTTC CATCGCCATG
TATCTGCGGC GCGCCCGGGA GATGGGTATC GTCACGATCA CCACCTCATC GGAACTCTTC
TCCAGTGACG TTCTCGCCCG GGAACTCGAA GACGCCACGG GACTGACGAC CGCTTGGATC
GTCCCGGAAG ATCGGCAAGC GATGAACCCG GCTGCGGAGG TTCCGGAGGT CGCAGCCTCG
GTCTTTCTGG AGCTGATCAA CAAGGGCGAC CGAATTGGGG TGGCATGGGG CCGCACCGTA
TATCATATCG CCGACGTCAT GCCCTTCGCA GACCTCAAGG GCGTCACCGT CGTGCAGCTT
TGCGGCAATC TGGGCGCACC CTATTCCTAC CGCCCCGATC AGTGCACCAC CGAAATTGCG
CGTCGCCTCA ACGCCGAGGG CGTCAATATC TACGCACCCC TCGTTCTCTC TTCAGAGCGG
CTTGCTGAGG AACTGCGCGC CGAGCCGGTC ATTCGGGAGC AGCTCGCAAC CATTTCCGAC
TGCCGGCTTT CGCTCTACTC CGTCGGAGGA ATCGAGGACG ACAGCCATCT CGTCAAATGC
GGCGCCCTTT CGGCCGACGA GATGCATGCC ATGGGCGAGA GGGGCGCGGC CGGAGTGATC
GCCGGGCAGA TCATCGATCA CAACGGTCAA TGGATGGATT GCGCGCACAA TCGGCGCTGC
ATCTCCGCCG ATCTCAATTC CATCCGCGCG ATCAGGAAGC GCATGCTCGT CGTGCAGGAG
GAAAACAAGT TTGAACCCCT GTTGGCCGCT CTGAAGGGAG GCTTCGCCTC GCACCTCGTC
GTCACCGCTT CGATGGCGCG GCGGATCATG GATCGCTGGA GCCGAGACGG GCTTGGCAGG
AGTGCCCCTG CCAAGCCCTA G
 
Protein sequence
MPIRVNDQII HKAAWLYYTH GLRQDEVARR LEISRASIAM YLRRAREMGI VTITTSSELF 
SSDVLARELE DATGLTTAWI VPEDRQAMNP AAEVPEVAAS VFLELINKGD RIGVAWGRTV
YHIADVMPFA DLKGVTVVQL CGNLGAPYSY RPDQCTTEIA RRLNAEGVNI YAPLVLSSER
LAEELRAEPV IREQLATISD CRLSLYSVGG IEDDSHLVKC GALSADEMHA MGERGAAGVI
AGQIIDHNGQ WMDCAHNRRC ISADLNSIRA IRKRMLVVQE ENKFEPLLAA LKGGFASHLV
VTASMARRIM DRWSRDGLGR SAPAKP