Gene Smed_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3640 
Symbol 
ID5318182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp75040 
End bp76107 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID640775453 
Productregulatory protein LacI 
Protein accessionYP_001312386 
Protein GI150375790 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG GTGGCAGGAA AAAGGCGACA ATCTATGATC TCTCGGTGCT ATCCGGGAGT 
TCGCCTTCGA CGGTAAGTGC GGTACTGAAC GGCACGTGGC GCAAGCGGCG GATAAGGGAA
AGCACCGCAG AGCTCATCCG GAGCCTTGCC GAAACGCATC AGTACACTGC AAATCGTCAG
GCGCGGGGCT TGCGCAGCTC CCGTTCCGGC CTGGTAGGGC TGCTCCTGCC CGTTCACGAC
AACCGCTATT TTTCTTCGCT TGCCCAGACC TTCGAAGCGC ATGTGCGAAG CAAGGGTCAG
TGTCCAATTG TCGTCAGCGC CAGCCGCGAC CCGGAAGAGG AACGCAGGAC GGCCGAAACG
CTGATCTCCT ATTCCATCGA CGAATTGTTC ATATGCGGCG CGACGGATCC CGACGGCGTT
CACGAGGTCT GCGAAGCGGC AGGGCTGAAG CACATCAACA TCGATCTGCC GGGGACGAAG
GTCCCATCCG TCATCAGCGA CAATTTCGAA GGCGGCCGTA TTCTGACCGA AGCAATCATC
CGCCACTTCC CTGCCGACCG GCCGCTCGCG CCCGAGGATC TCTATTTGTT CGGTGGTCGT
GATGATCATG CCACCCGCGA GCGCATCCGC GGCTTTCGTG CCGCAAAGAA GGAGTTGCTC
GGGGGCGATC CGGATGAATG CGTATGGCCC ACCGGTTATG CGGCAGACAA TGCGCGGAAG
GCCTTCGATG CCTTTTACGA ACAGCGGGGG AAACTTCCGC GCGGGTTCTT CGTCAATTCC
TCGATCAATC TAGAGGGACT GCTGCGTTTC ATGGCCGAGC ATCCGCTCGA GAATTTCAAG
GATCTCGTCG TCGGCTGCTA CGACTACGAT CCATTCGCAT CCTTCCTCCC CTTCCCCGTC
ATCATGATAA GACAAGATGT CGAGGGAATG ATCGCCAGGG CCTTTGAGGT GATCGAGGAG
CCGCGGGCGT CGGTCCAGAT TCATTTGGTG AAACCGAGGC TCGTGCCCCC GAGAACGGCG
CTGACCGGCC CCCTCGACGC GCTAATCGAC AGCGACATGC CGCGGTAA
 
Protein sequence
MTNGGRKKAT IYDLSVLSGS SPSTVSAVLN GTWRKRRIRE STAELIRSLA ETHQYTANRQ 
ARGLRSSRSG LVGLLLPVHD NRYFSSLAQT FEAHVRSKGQ CPIVVSASRD PEEERRTAET
LISYSIDELF ICGATDPDGV HEVCEAAGLK HINIDLPGTK VPSVISDNFE GGRILTEAII
RHFPADRPLA PEDLYLFGGR DDHATRERIR GFRAAKKELL GGDPDECVWP TGYAADNARK
AFDAFYEQRG KLPRGFFVNS SINLEGLLRF MAEHPLENFK DLVVGCYDYD PFASFLPFPV
IMIRQDVEGM IARAFEVIEE PRASVQIHLV KPRLVPPRTA LTGPLDALID SDMPR