Gene Smed_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1373 
Symbol 
ID5322223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1452527 
End bp1453774 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content60% 
IMG OID640790314 
Productthreonine dehydratase 
Protein accessionYP_001327054 
Protein GI150396587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02079] threonine dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.293609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGG ATGTTGAAAA AGCGGCAGCG GCGATGCGCG AGATCTTTCC GCCGACGCCG 
CTTCAACTCA ATGAGCATCT GAGCGCACGC TATGGTGCAA CCGTCTTCCT GAAGCGCGAG
GATCTTTCGC CGGTGCGGTC CTACAAGATC CGAGGCGCAT TCAATTTCTT CCGGAAATCA
CTCGGCTCCG GAGCGGCAGG GAAGACGTTC GTCTGCGCTT CCGCTGGCAA TCACGCGCAG
GGCTTTGCTT TCGTCTGCCG GCACTTCGGC GTTGCAGGCG TGGTCTTCAT GCCCGTGACG
ACGCCCCAGC AGAAGATCGA CAAGACGCGC ATTTTCGGTG GCGAGTTCAT CACGATCCGC
CTCGTCGGTG ACATCTTCGA TCAATGCTAC CACGCGGCGC GCGAGCACGT GGAGGCGATC
GGCGGCGTCA TGGTGCCACC GTTTGACCAC GGCGATATCA TCGAAGGCCA GGCCACGGTC
GCAGCGGAAA TAGCCGAGCA GATGCCGGTG GATCTTACCG CCGACCTCGT CGTCCTTCCC
GTCGGCGGCG GTGGATTGGC CGCGGGCGTT ACCGGCTTTC TTGGCGACCG CCTCGCGGCC
GATTGCTTTC TGTTTTGCGA GCCGGAGGGC GCGCCGAGTT TGAGGCGCAG CCTGGAATTC
GGCAGCGTCG TCACTCTCGA TCAGGTGGAC AACTTCGTCG ATGGTGCCGC CGTTGCACGG
ATCGGGGATC TGAACTTTGC CGCGCTGCGT GAATTTTCGC CGGAACAGGT GATGCTGCTG
CCGGAAAATG CAATCTGCCT GACGATCACC GAAATGCTGA ATGTCGAGGG CGTGGTGCTC
GAGCCAGCGG GCGCTCTTGC AATCACCGCC CTGGAAACGC TCGGACGCGG AGGCTTGGAG
GGCAAAATTG TCGTGGCGGT CGTTTCCGGG GGCAATTTCG ACTTCGAGCG TCTTCCGGAC
GTGAAGGAGC GCGCCATGCG ACACGCAGGG CTCAAGAAGT ACTTCATTCT GCGCATGGCG
CAGCGCCCGG GTGCGCTTCG CGATTTCCTC AGCCTGCTCG GAGAAGAGGA CGACATTGCC
CGTTTCGAAT ATCTGAAGAA GTCGGCGCGA AACTTCGGCT CCGTGCTTAT TGGCATCGAG
ACGAAGCATG CGGAGAATTT CCCCATCCTC AAACAGCGTT TCGATGCGGC GGGGCTGCGC
TACCAGGACA TTACCGACAA CGAAATGCTC GCCAATTTCC TCATTTGA
 
Protein sequence
MKQDVEKAAA AMREIFPPTP LQLNEHLSAR YGATVFLKRE DLSPVRSYKI RGAFNFFRKS 
LGSGAAGKTF VCASAGNHAQ GFAFVCRHFG VAGVVFMPVT TPQQKIDKTR IFGGEFITIR
LVGDIFDQCY HAAREHVEAI GGVMVPPFDH GDIIEGQATV AAEIAEQMPV DLTADLVVLP
VGGGGLAAGV TGFLGDRLAA DCFLFCEPEG APSLRRSLEF GSVVTLDQVD NFVDGAAVAR
IGDLNFAALR EFSPEQVMLL PENAICLTIT EMLNVEGVVL EPAGALAITA LETLGRGGLE
GKIVVAVVSG GNFDFERLPD VKERAMRHAG LKKYFILRMA QRPGALRDFL SLLGEEDDIA
RFEYLKKSAR NFGSVLIGIE TKHAENFPIL KQRFDAAGLR YQDITDNEML ANFLI