Gene Smed_0748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0748 
Symbol 
ID5321585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp801944 
End bp803143 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID640789685 
Productaminodeoxychorismate lyase 
Protein accessionYP_001326439 
Protein GI150395972 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.309418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACT CAAACGATAA CAGCGCGGTA CAATTCGGCC GCAATGAAAC CGGGAGCAAC 
GGGCCGATCA TTCCGAAGTC GGCCAATGAA GCGTTGCGTC CCGAGAGGGT TCCACACCCG
CCGAAGCGTT CACGCAAGGC GCGCAGCCAG GTCGTCATCT TTCTCAACTT CGTCATGACC
GCGGTTGTGT TCGTGGCGCT GGCGGCGGCC GGCGCCGTCT ATTATGCGAT GCATGAATAT
GAGAAGGCGG GGCCCCTGCA GGCGAACAAG AATTTCATCG TTCGCAGCGG CGCCGGCATC
AGCGAAATCG CCAGCAATCT GGAGCGCAAC GAGATCATCA CCGACAGCCG TGTTTTCCGC
TTCGTTTCGG AAGCTTATCT GAGCAATGAC ACGCTCAAGG CCGGCGAGTA TGAGATCAAG
GCGCATGCAT CGATGCAGGA GATCATGCAG CTCCTGAAAT CCGGAAAGTC GATCCTCTAT
TCCGTCTCGT TGCCGGAAGG TCTGACTGTG AAGCAGATGT TTCGCAAGCT TTCCGACGAC
CCGGTTCTGG TCGGCGACCT GCCGGCGGAA CTGCCGCCGG AAGGTTCTCT GAAACCGGAC
ACTTACAAGT TCACCCGAGG AACCGACCGT AACGAGATCG TCAAGCAGAT GATCGCTGCG
CAAAAGGCCC TGGTGCAGCA GATCTGGGAG AAGCGGGATC CGGACCTGCC CGTGTCAACC
ATCGAGGAAT TCGTCACTCT TGCCTCTATT GTGGAGAAGG AGACCGGTCG CGCGGACGAG
CGGCCGCGAG TGGCCTCCGT ATTCATCAAT CGCCTGGAAA AGGGCATGAG GCTGCAGTCG
GATCCGACGA TCATCTACGG CATCTTCGGT GGAGAAGGTA AACCCGCCGA CCGGGCTATC
CTGAGGTCCG ACCTCGACAA GCAGACCCCA TACAACACCT ACCTCATCAA GGGTCTTCCG
CCGACGCCGA TCGCAAATCC GGGTCGCGCC GCGCTTGAAG CCGTCGCCAA CCCGTCGCGC
ACGCCCGAAC TCTATTTTGT CGCCGATGGC ACTGGCGGAC ATGTCTTTGC CGAGACGCTC
GACGAACACA ACGCCAATGT CCGGCGCTGG CGCAAGCTCG AGGCGGAGAG GGCAGCGGAA
GCGGCCAAGG CCACCGAAGC GGCTGAAGAC GCCGTAACGC AGACTGGAAC GCAGCAGTAA
 
Protein sequence
MSDSNDNSAV QFGRNETGSN GPIIPKSANE ALRPERVPHP PKRSRKARSQ VVIFLNFVMT 
AVVFVALAAA GAVYYAMHEY EKAGPLQANK NFIVRSGAGI SEIASNLERN EIITDSRVFR
FVSEAYLSND TLKAGEYEIK AHASMQEIMQ LLKSGKSILY SVSLPEGLTV KQMFRKLSDD
PVLVGDLPAE LPPEGSLKPD TYKFTRGTDR NEIVKQMIAA QKALVQQIWE KRDPDLPVST
IEEFVTLASI VEKETGRADE RPRVASVFIN RLEKGMRLQS DPTIIYGIFG GEGKPADRAI
LRSDLDKQTP YNTYLIKGLP PTPIANPGRA ALEAVANPSR TPELYFVADG TGGHVFAETL
DEHNANVRRW RKLEAERAAE AAKATEAAED AVTQTGTQQ