Gene Smed_2599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2599 
Symbol 
ID5323467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2695628 
End bp2696659 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content57% 
IMG OID640791542 
ProductAraC family transcriptional regulator 
Protein accessionYP_001328264 
Protein GI150397797 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCG TGAGTGACAA ACTTCAGGGT GATATGGACA AAACGAAGGA TATTCACGAA 
GTCGGCATCG TTGGCTACAA GGGGGCTCAG GCTGCCGCGG TGCTTGGAAT GACCGATCTT
CTGACCGCCG CAGACGGCTT TGCACGTAAA ATGCATGCCA TTGATCACCC CCTACTTCGC
GTGAGTCACT GGACACGCGA AGATGGGCGG GCTGCGCCTG AGCGGCTGTT CGATTCCGAC
CCTGGCATAG GTGGCAGCAG GCCAACCGTC ATCGTCATCC CTCCAGGGCT TGGCGATCCG
CTCCCCGAGC ACGAAGCGAA ATTCTACGCC GACTGGCTTC TTTCAGAACA TTCGAGAGGA
GCAGCTTTGT GCTCGATCTG CAAAGGAGCC TTCCTGTTGG GAGAGACCGG GCTTCTTGCG
GGCCGGACAG TGACCACTCA CTGGACCTAT GAGGAGCAGC TTGCCTCTCG ATTTCCCGAC
ATCAAGGTGA ACACCGACCG TCTGATCATA GACGATGGCG ACATACTCAC GGCCGGCGGC
GTGATGGCAT GGATCGATCT CAGTCTGATT CTGATCGAGC GTTTTCTCGG CCCGAACATC
ATGGTGGAAA CAGCAGGAGC TTTCCTGGTT GATCCACCGG GACGCAAACA AAGTTACTAC
AGGGGCTTCT CGCCACGTCT CAATCATGGT GACGATTCGA TCTTGAAGGT TCAGCACTGG
CTTCAACTCA CCGGCGGGAA AGAGATGAGA CTTGCGGCCC TCGCGGAGCA GGCAGGCCTT
GAACCGCGTA CCTTTATGCG GCGATTCCAG AAAGCAACCG GCCATACGGC AGGCGAATAT
GTTCAACGTC TGCGTATCAA CCGGGCACGT GACCTGCTCC AGCTGACACG CGATCCCATC
GATTCAATCG CCTGGGATGT TCACTACAGC GATCCCAGCG CCTTTCGACG AATCTTCACG
CGGATCATCG GTCTGAGCCC AACTGAGTAT CGCCGAAGAT TTCGCGCAGG GCCGAACGGA
AATGGGACTT GA
 
Protein sequence
MSVVSDKLQG DMDKTKDIHE VGIVGYKGAQ AAAVLGMTDL LTAADGFARK MHAIDHPLLR 
VSHWTREDGR AAPERLFDSD PGIGGSRPTV IVIPPGLGDP LPEHEAKFYA DWLLSEHSRG
AALCSICKGA FLLGETGLLA GRTVTTHWTY EEQLASRFPD IKVNTDRLII DDGDILTAGG
VMAWIDLSLI LIERFLGPNI MVETAGAFLV DPPGRKQSYY RGFSPRLNHG DDSILKVQHW
LQLTGGKEMR LAALAEQAGL EPRTFMRRFQ KATGHTAGEY VQRLRINRAR DLLQLTRDPI
DSIAWDVHYS DPSAFRRIFT RIIGLSPTEY RRRFRAGPNG NGT