Gene Smed_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4604 
Symbol 
ID5318514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1105052 
End bp1106107 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID640776404 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001313336 
Protein GI150376740 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCACGTT CCGGACCCAA ATTGACGAGG ATAGCCAATA CACTCGGCGT CTCCGCGGCG 
ACCGTCTCCA ACGCGCTTTC CGGCAAGGGA CGGGTTTCGG CGGAACTTAC TGATAGAATT
CGGTCGATCG CTGCCGACCT CGGCTACGTG CCGAGCCAGG CGGGGAGGGC GCTGAGGACG
GGCAGGAGCG GCGTCATCGG GCTGGTTCTG CCGGATATCA GCAATCCTCT CTTTCCCCAG
ATCGCGCAGG CGATCGAGTT TGCGGCTTCT TCGGCAGGCT ATGGTGTTTT GATTGCCGAT
TCGCGCGGTG ACATCGCCAT CCAGACGAGG GCGATCGAAC GGCTGATAGA ACGCGGCGTG
GACGGTATGG TCATCGTACC AAGGCGCGGC ACCCGCATCG CCGATGTCGG CTGCCCGGTT
GCGGTCGTGG ACACCCCGTC CACGCCTGGC AATACGGTGG CCGCGGACCA TTGGGACGGC
GGAAGGCAGA TAGCCGATCA CCTCGTAGGC CTCGGCCACA GACATTTGCT GATCATCGGC
AACAATCCAG CCTCGAACGT GCAGAACGAC CGTGTCGGCG GGCTTCGCTC GGCCCTCCGG
GAAGACGTGT GCGCGGAGAC CCTATGGATC GAGCGGCTGG AAGAGGTGAA TGGCAAGGGC
TGTTCGCTCG GCCTTGCTGC GAAGGTAGCG GAGGGGGTAA CCGCCTTTGC CGCGATTTCG
GACCTGCATG CCCTGCGCGC GCTCACGGAA CTGCAGCGTG CCGGCATCCA TGTTCCGGAG
CAGGCAAGCG TCACCGGCTT CGACGACCTC ATCTGGTCGC CGGTGGTGAC GCCCGCCTTG
ACCACGATCC GTATGGACAT GGCGCGCATT GCCGCGATCG CGGTCGAGGC TCTGGTTCGG
GCGATCGGTG CCGAGGAGCC GGAACACCCG TCCGTAGGAG CGCCGGTTTG CGCGCCATTC
TCGAAAGTGC CCATGCAGCT CGTCGTCCGG CAATCCACCG CGACGCCGCC CACCACCGCG
ACGCCACCAT TTGTCACTCA AGGAGAACAG CCATGA
 
Protein sequence
MARSGPKLTR IANTLGVSAA TVSNALSGKG RVSAELTDRI RSIAADLGYV PSQAGRALRT 
GRSGVIGLVL PDISNPLFPQ IAQAIEFAAS SAGYGVLIAD SRGDIAIQTR AIERLIERGV
DGMVIVPRRG TRIADVGCPV AVVDTPSTPG NTVAADHWDG GRQIADHLVG LGHRHLLIIG
NNPASNVQND RVGGLRSALR EDVCAETLWI ERLEEVNGKG CSLGLAAKVA EGVTAFAAIS
DLHALRALTE LQRAGIHVPE QASVTGFDDL IWSPVVTPAL TTIRMDMARI AAIAVEALVR
AIGAEEPEHP SVGAPVCAPF SKVPMQLVVR QSTATPPTTA TPPFVTQGEQ P