Gene Smed_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4035 
Symbol 
ID5318335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp495427 
End bp497301 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content65% 
IMG OID640775843 
ProductFis family GAF modulated sigma54 specific transcriptional regulator 
Protein accessionYP_001312776 
Protein GI150376180 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTCAC ATTCCGAACA TATCCGCGAA ATTGAGCGCG TTGGCCGTGG CCTTCCGGTC 
AGCCGCGACG AAACGGTCGT GAAATCCTGG ATGCGCTGCC TCGAGCATCA CCGGCTCGAC
CCGGCCCAGA GTTGCGAAGC CTATATCGTG CCTGAAACCC GGCTCAAGGA ACACCGGCAG
CAATCGGAGG ACCTGATCGG CATCGCTCGC TCGGGGCTGG AGCACCTGTT CCGCCAGGTT
GCCGGGCAGA ACTACGTGCT GCTTCTCTCC GACCGGGAAG GCGTGACCGT GGAGTTCCTT
GGAGACCCTC TCTTCAACAA CAGCTTGCGA AAGGCCGGGC TTTATCTCGG CTCGGAGTGG
TCCGAATGGC GCGCCGGTAC CTGTGCGGTC GGCGCATGCC TGGAAACGGG AGAGGCGCTG
ACGATCCATC AGACCGACCA TTTCGACAAT ACCCACACCC CGCTCTCCTG CACGGCGGCA
CCGATTTATG ACAACAAGGG CGAACTCTCC GCGGTGCTGG ATATCTCGCT GCTGAGTTCG
CCGATCCTCA AGGCCAGCCA GAACCTCGCG CTGCATCTGG TGACCGCCAC CGTGAGACGC
ATAGAGCTTG CCAACCTCAT GGCGCAGGCG CGAAACCAGT GGGTGCTGCG GTTCTCGCGC
TCACCCGAAT TTCTCGATGT CGATCCGGAG GCGGCCATCT CGATCGACGG CCTCGGCCGG
ATCGCCGGAA TGACGCATGG GGGAGCCAAA ATACTCGCAC GCTCGACCGG GCTGGACTGG
CGCGATCCCC GGAAACTGAT CGGCGAGCCG GTGTCACGTT TCTTCGATAT CGAGGTCGAC
GATCTTTCCG ACCTCACCCG GCGCCGCCCG ACCCAGGAGC GCCTCGTCTT CGCCCGCGAC
GGAAATGCGC TCTTCGCCCA TGCGATCGAG CCGCATTCGA CCGTACGTGC GCCAGTCGTC
TCGCGCGAGC AGATTCCGCC GGCCTTGCGG CGGCTCGGCG GCGACGCGCC GGTGATCGCC
GCGCTGCAGG CAAGGGCCGC GAAGCTCGCG CGTACCGGGC TGCCGATCCT CGTTCAGGGA
GAGACGGGGA CCGGCAAAGA GCATCTTGCG CGCGCTATCC ACGAGGGCAG CGGTCTGAAG
GGGCAGTTCG TCGCGATCAA CTGCGCGGCG ATCCCCGAGC AATTGATCGA GAGCGAGCTT
TTCGGCTACC TGCCGGGCGC CTTTACCGGC GCTTCGGCGA AGGGGCGCAA GGGCCTGATC
GAGCAGGCGG ACGGAGGAAC GCTCTTCCTA GACGAGATAG GCGACATGCC GCTTGCCCTG
CAAAGCAGGC TGTTGCGCGT ACTTGCCGAG GGCGAGGTCC TGCCGGTCGG CGGCACCGTG
CCGCGAAAGG TCCGGATCCG GGTCGTGTCG GCATCGCACC GGCCACTGCA GACGCTTGTC
GCGCAAGGGG CGTTCCGCGA GGATCTCTAT TACCGGCTCA ATGCCGCGAC CCTCTCGATT
CCGGCGCTCA GGGACCGGCC GGACTTTGAC TGGATACTCG AACAGCTGCT GAAGCGGCAC
GGCGACGGTG AGTTGATATT GTCGGAGGCC GCGCTCGCGG CGCTCAAGGC GCATGATTGG
CCGGGAAACA TCCGCGAACT CGACAATGTC GTCGCCGTGG CTGCAGCGCT CGCCGAGAAC
GGCGTCGTGG AGATCGGCGA CCTGCCGGAC CACCTCCTGG TGAATGCGGA CACCGTCGGC
GGAAGCGAGG CGGGGGCGGC CTTGAGCCTT ATGCTCGCCA CTTGCGACTG GAATATTTCC
GAGACCGCCC GCCGGCTGGG GCTCGATCGC TCGACGGTGC ACCGGCAGAT CAAGCGCTAC
AGCCTCAAGC GCTGA
 
Protein sequence
MFSHSEHIRE IERVGRGLPV SRDETVVKSW MRCLEHHRLD PAQSCEAYIV PETRLKEHRQ 
QSEDLIGIAR SGLEHLFRQV AGQNYVLLLS DREGVTVEFL GDPLFNNSLR KAGLYLGSEW
SEWRAGTCAV GACLETGEAL TIHQTDHFDN THTPLSCTAA PIYDNKGELS AVLDISLLSS
PILKASQNLA LHLVTATVRR IELANLMAQA RNQWVLRFSR SPEFLDVDPE AAISIDGLGR
IAGMTHGGAK ILARSTGLDW RDPRKLIGEP VSRFFDIEVD DLSDLTRRRP TQERLVFARD
GNALFAHAIE PHSTVRAPVV SREQIPPALR RLGGDAPVIA ALQARAAKLA RTGLPILVQG
ETGTGKEHLA RAIHEGSGLK GQFVAINCAA IPEQLIESEL FGYLPGAFTG ASAKGRKGLI
EQADGGTLFL DEIGDMPLAL QSRLLRVLAE GEVLPVGGTV PRKVRIRVVS ASHRPLQTLV
AQGAFREDLY YRLNAATLSI PALRDRPDFD WILEQLLKRH GDGELILSEA ALAALKAHDW
PGNIRELDNV VAVAAALAEN GVVEIGDLPD HLLVNADTVG GSEAGAALSL MLATCDWNIS
ETARRLGLDR STVHRQIKRY SLKR