Gene Smed_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4014 
Symbol 
ID5318823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp470282 
End bp471484 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content60% 
IMG OID640775822 
Productcystathionine gamma-synthase 
Protein accessionYP_001312755 
Protein GI150376159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCA AAGAGCTCGA TTTTGCAAGC CAAGCCGTAT TCTACACGCC GGAAGAACAT 
TCCCAGTCGA TCAGCTATCC GATCTACATG TCGGCGAATT TCCAGTATGA CGGAGACATC
TACGATCAGA TCGTTGCCGG CGCGCGCAAG GAAGTGAACA TCTATTCACG CTGCGGCAAT
CCGACGGAAT ACAAGTTCGA AGAACACATC GCCAAGCTCA CCGGCGGCAC GGCCTGCCTG
GCGACCGCGT CCGGAATGGC GGCCATCTCG CATGCCTTGT TCGGCATCCT CAAGGCGGGC
GATCACATCG TCGCCGATCT GACGACCTAT TCGAGCACAC ACGAATTCTT CGATCACCGC
GCCCAGGATT TCGGACTGAA GGTGAGCCTG GTGGATTGCA CCGACGTGAG GGCGGTCGAG
AACGCGCTTA CGGAAGAAAC GAAGGTGCTC TACGTCGAGG CCATTGCCAA TCCGACGATG
AAGGTGCCGC CCCTGAAGGC GCTCGTGGAG CTGGCACATG CGCGCGGCAT CGTGGTGATC
TGCGACAATA CCTTCGCGTC GCCGGCAGTC TGCCGCCCGC ATGATTTCGG CGTCGACGTG
GTCGTAGAGA GCGCGACCAA GTTCATCGGC GGGCATAATG ACGCGGTGGG CGGGGTTATC
ACGTTGAAAT CCGATATCCT GCCGCCGGAC TGGCTCGAAG ACGTGCGTTG GAACACGCTC
AACAAACTGG GCGCCCCGCT TTCGCCCTTC AATGCATGGC TGCTTTTGCG CGGCGCGCAG
ACCCTGGCGC TGCGGCTGGA GAAGCAATGC GCAAATGCGC TGGCGCTCGC GAAGCATCTC
GAGGCCCATC CGAGGGTCAG GCGCGTGTTC TATCCGGGCC TGCCTTCGCA CCCGAACTAC
GCATCCGCCA AGGAGCAGTT GCGCGGCGGC GGCGCAATGC TCTCGTTCCA GGTTGATGAC
GAAGCGTCCG GTGTCCGGCT TCTAAAGCGT TTGCAGCTTT GTTCCTTCGC CGCGAGCCTC
GGGGGGCTCA GAACGACGAC GCAGGTGCCC GCCACCATGG CCTTCCTGGA TATTCCGAGC
CAGGAAAGGG AGGCGATGGG CGTCGTCGAC GGTCTCGTGC GCTTTTCGGT CGGCATCGAG
CATATCGACG ACATCATCGC CGATGTGGAT AGTGCGATAG ACCAGATGCA TATGGAGATC
TAA
 
Protein sequence
MSPKELDFAS QAVFYTPEEH SQSISYPIYM SANFQYDGDI YDQIVAGARK EVNIYSRCGN 
PTEYKFEEHI AKLTGGTACL ATASGMAAIS HALFGILKAG DHIVADLTTY SSTHEFFDHR
AQDFGLKVSL VDCTDVRAVE NALTEETKVL YVEAIANPTM KVPPLKALVE LAHARGIVVI
CDNTFASPAV CRPHDFGVDV VVESATKFIG GHNDAVGGVI TLKSDILPPD WLEDVRWNTL
NKLGAPLSPF NAWLLLRGAQ TLALRLEKQC ANALALAKHL EAHPRVRRVF YPGLPSHPNY
ASAKEQLRGG GAMLSFQVDD EASGVRLLKR LQLCSFAASL GGLRTTTQVP ATMAFLDIPS
QEREAMGVVD GLVRFSVGIE HIDDIIADVD SAIDQMHMEI