Gene Smed_3561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3561 
Symbol 
ID5324449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3765994 
End bp3767862 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content65% 
IMG OID640792510 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_001329211 
Protein GI150398744 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.238424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA AAACCGTGCG CCAGGGTACA GTGCGCTTGA CCATGTCGCA GGCCGTGGCG 
CGGTTCCTGA CCCGGCAGAT GACGATCATC GAGGGCGAGC GCGTGCCGAT TTTTGGCGGC
GTCTTCGCGA TTTTCGGTCA CGGTAACGTC GCCGGGGTCG GGGAAGCGCT CTATGCCGTG
CGCGAAACCC TTCCGACCTA CCGGGCCCAG AACGAGCAGG GCATGGCGAA CGCGGCGATC
GCCTTCGCCA AAGCGAGCTT CCGCCGCCGC TTCATGGCAT GCACGACGTC CATCGGCCCG
GGCGCCCTGA ATATGGTGAC GTCGGCCGCG CTCGCACACG TCAACCGGCT GCCGGTGCTC
CTCCTGCCCG GCGACATCTT CGCCAATCGC CGGCCGGACC CGGTCTTGCA GCAGGTAGAG
AGTTTTGGCG ACGGGACGAT CTCGGCGAAC GACTGCTTCC GCCCGGTCTC GCGTTACTTC
GACCGCATCA CGCGACCGGA GCAGATCATT CCGGCGCTTC GCCGCGCCAT GCAGGTGCTT
ACAGACCCGG CCGATTGCGG CCCGGTGACG CTTTCGCTCT GCCAGGACGT TCAGGCGGAA
GCCTATGACT ACCCGGAATC GTTCTTCGAC GAAAAAGTCT GGGTACCGCG CCGCGTCGAG
CCCGATCTCG ACGAACTGGC CGCGGCGATC GAGACGCTGA AGGTTGCCAG GAAGCCGATC
ATCATCGCAG GCGGAGGCGT GCTCTATTCG GAAGCGAGCG CCGACCTTGC CGAGTTCGCC
GAAAAACACG GCATTCCGGT CCTCGAGACG CAGGCCGGCA AGTCCGCCCT GCCGCACGCG
CATCCGCTGA ACATGGGTTC GGTCGGCGTC ACCGGTACCT CTGCCTCCAA CGCCTTGGCG
GAAGAGGCGG ACGTGGTGCT CGCCGTCGGC TCGCGGTTGC AGGATTTCAC TACCGGCTCC
TGGGCGCTTT TCAAGAATGA AGCGGTGAAG ATCATCGGAC TCAACGTCCA GCCCTTCGAC
GCCGGCAAGC ACGATGGACA GCCTTTGATC GCCGATGCGC GGGCCGGCCT CAACCGCATC
TCGGGCGGGC TCGGCAGCTA CAGCGCCGAC AGCGCCTGGA CAGAGAAGGC GAGGGCCGGA
AAGGCCGAGT GGCTTGCTGC AGCCGACAGG GCGACGGCCA CCACCAATGC GGCGCTTCCC
TCCGATGCGC AGGTCATCGG CGCCGTGCAA CGCGCCCGTG GCGGGAGGCA AACGACACTG
GTCTGCGCCG CCGGCGGGCT GCCCGGCGAG CTGCACAAGC TCTGGCAGGC GGAGTCTCCA
GGCAGCTACC ATATGGAATA TGGCTTCTCG ACCATGGGCT ATGAGGTCGC TGGCGGCCTT
GGCGTGAAAC TTGCCAAGCC CGAAAGTGAC GTGATCGTCA TGGTCGGAGA CGGCAGCTAC
ATGATGCTGA ACTCCGAGAT CGCCTCTTCG GTCATGCTCG GTGCCAAGCT TACGATCGTG
CTGCTCGACA ATGCCGGCTA TGGCTGCATC AACAGGTTGC AGATGGGTAC GGGCGGCGCC
AACTTCAACA ACCTGTTGAA GGACACACAT CACGTGGCGC TGCCGCAGAT CGACTTCGCC
GCCCACGCCG CCGCCATGGG CGCGGTCACC CGAAAGGTGG GATCGATCCC CGAACTCGAA
GCGGCGCTTG CCGAAACGGC AGACGAGGCT CGCACGACCG TCATCGTCAT CGATACCGAT
CCGCTGATCA CGACGGAAGC CGGTGGGCAC TGGTGGGACG TCGCGGTCCC GGAGGTTTCG
GACCGCGACC AGGTAAAGGC CGCCCGCGAA GATTACGAAA ATGCCCTCCG GTCACAGCGG
TTTGGTTGA
 
Protein sequence
MTQKTVRQGT VRLTMSQAVA RFLTRQMTII EGERVPIFGG VFAIFGHGNV AGVGEALYAV 
RETLPTYRAQ NEQGMANAAI AFAKASFRRR FMACTTSIGP GALNMVTSAA LAHVNRLPVL
LLPGDIFANR RPDPVLQQVE SFGDGTISAN DCFRPVSRYF DRITRPEQII PALRRAMQVL
TDPADCGPVT LSLCQDVQAE AYDYPESFFD EKVWVPRRVE PDLDELAAAI ETLKVARKPI
IIAGGGVLYS EASADLAEFA EKHGIPVLET QAGKSALPHA HPLNMGSVGV TGTSASNALA
EEADVVLAVG SRLQDFTTGS WALFKNEAVK IIGLNVQPFD AGKHDGQPLI ADARAGLNRI
SGGLGSYSAD SAWTEKARAG KAEWLAAADR ATATTNAALP SDAQVIGAVQ RARGGRQTTL
VCAAGGLPGE LHKLWQAESP GSYHMEYGFS TMGYEVAGGL GVKLAKPESD VIVMVGDGSY
MMLNSEIASS VMLGAKLTIV LLDNAGYGCI NRLQMGTGGA NFNNLLKDTH HVALPQIDFA
AHAAAMGAVT RKVGSIPELE AALAETADEA RTTVIVIDTD PLITTEAGGH WWDVAVPEVS
DRDQVKAARE DYENALRSQR FG