Gene Smed_1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1433 
Symbol 
ID5322285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1514091 
End bp1515116 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID640790376 
ProductAraC family transcriptional regulator 
Protein accessionYP_001327114 
Protein GI150396647 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000646594 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATGG TTGCGAACAG GATTGCACAA TCTACGATCG AGGTCGCGGT GATTATCCTG 
CCCGAGTCGT CGATCATGTC CCTTGCTTCC GTCCTCGATC CGATGCGCGC GGCAAACCGG
GTGACCGGGC ACGAGGTCTT CCGGTGGCGG TTGCTCTCGG CCGATGGTGA CGCGGTGATG
CTCACCTGCG GCTTATCTAT TCCGGTGGAT GCTCGGTTCG CCCTGCCGAT CGTCGGGGAT
CTCCTTCTCA TCATCGGCGG GTTCAATCTC GAGAGATATG CAGGCAAGCG CTTCCTCGCT
ACTCTGCAGG AATGCGCGCG GCATTTCGAT ATCGTCGCGG GCGTTGAGTC CGGGTGTTGG
CTGCTCGGGC GTTCCGGGCT TATCAAAGGC CGCAAGGCAA CCGCCCACTG GGAGGAGCTC
GAGGATTTCA GTCGGGCATT TCCCGAGCTG CAGGTGATTG GGGACCGTTT CGTGACCGAC
GGCAAGTACT GGACCTCCGG TGGCGCGTCG CCGACTTTCG ACATGATGCT GCACCTCATT
GCGGAGAGGC TGGGGCCGGC TATCGCGCTG GATGTAGCGA GCATTTTTGT CTACGATCAG
ATGCACAGTC CCACGGACGT ACAGCCCTTC GTCTCGCTCG GCCGCATGGA AGCACGAGAT
CCGGAGCTCG CCGCGGCCAT AAGGCTGATG GAGCGCACAC TCGAACGGCC GATGACGGTC
GCGGCGCTTG CGCGCCGGCT ATCCGTCTCA CAACGCAAGC TTGAAATGCT CTTCGCCAAA
GGCCTCTCGA CCAGCCCGGG CGCCTATTAC CTGCGCCTCA GGCTGCAGGT CTCCCACCGG
CTCGTTCGCG ATACCGGAAT TCCAATGCGG GATGTCGCCC TTCGCTGCGG GTTTGACAGT
CTCTCGGCCT TTTCGCGTGC CTACCGACGC GAATACGGGA CGAGCCCTAC GGGTATGCGC
AGCGCACGCA GCGCAAGCGT CGCTTCTGAG ATGTCAGATG AAGGCGGACG CCACGCGCGC
CCCTGA
 
Protein sequence
MRMVANRIAQ STIEVAVIIL PESSIMSLAS VLDPMRAANR VTGHEVFRWR LLSADGDAVM 
LTCGLSIPVD ARFALPIVGD LLLIIGGFNL ERYAGKRFLA TLQECARHFD IVAGVESGCW
LLGRSGLIKG RKATAHWEEL EDFSRAFPEL QVIGDRFVTD GKYWTSGGAS PTFDMMLHLI
AERLGPAIAL DVASIFVYDQ MHSPTDVQPF VSLGRMEARD PELAAAIRLM ERTLERPMTV
AALARRLSVS QRKLEMLFAK GLSTSPGAYY LRLRLQVSHR LVRDTGIPMR DVALRCGFDS
LSAFSRAYRR EYGTSPTGMR SARSASVASE MSDEGGRHAR P