Gene Smed_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2793 
Symbol 
ID5323663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2913150 
End bp2914526 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content61% 
IMG OID640791738 
Productpeptidase S41 
Protein accessionYP_001328458 
Protein GI150397991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.227908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.76429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAG GATGGCTCGC AATCCCTCGA ACTGCCGCGA TGGCCCTCAC GTTAGCTGTT 
ACCGGTTCTG CGCAGGCGGC TGATGTTATT TTGACGCCGG CCCAGATGCG AGAGGATCTT
ACATTCTTAG TAGAGAAATG GGCGCCGCTC GATAAAAGCT TCAGCGACAG CCAGAGGGCG
GAGTTCGGCC GGCATGTCGA TGAGACCGCG TCTGCGGCGG AAAATCTGGC GCCGGAAGAG
TTCGCGCTTG AAGTGATGAA AGCAGTGGCC ATCGCGCGCA ACGGACACAC CAACGCCAAT
GTCGGGACGT TGCTTGGCGC CGACCTTCCC GTGCGCGTGT GGACGTTTTC CGATGGGCTT
TATATCGTCA AGACTCACCC CGATTTCAAA CGGTTGCTTG GCGCGCGCGT CGTCGGGATC
GGCTCCCTGA CCACGCAGGA AGCGCTCGAC CGGGTGCGTG CCTACCTTCC CGGCACGGAC
CAGCGGATTC GTTTTCTCTC GCCCGGCTAT CTGGTTGCGC CCGCCGTCCT CGAGCGGATC
GGCGCCGTCG ACGGCACGGA CCGCATTCCG TTGACCTTGC AGTTTGACGA TGGACGGACG
GAGATCGTCG AGCTTGCGCC TATGAAAGGC GAAGATCCCG GCGATGAGCG CAAGGCAAGC
CTGAACAGGG GCTATTCCGT GCTTATTCCC GACGATCCGG ACTTGCCGGG ACGCTGGCGC
CATCTGCTCG ACGGCCGCAA GGAGCTGCCG CCGATCTACG GCAAGCGCAG CGACGTGCAG
GCGCGCTTCC TCGATGATCA CATGAAGGTG CTCTACATCC GCAACGACAC GGCAAGAAGC
ATCGACGAAA CGCCCCTGCC GGACAAGTTG GCCGGCGTTG TCCTCGACAA AATCGTGCCG
ATGCAGCCCA AGCACATCAT CGTCGACCTC CGCCTGAACA ATGGCGGCGA CTTCTTCAAC
ACGATCCTGT TTTCCCGGGC GATACCTCGA CTTGTGCCGC GCGACGGACG TGTGTTCGTG
CTTGTTGGGA GAGCAACCTT CTCTGCTGGC ATAACGACCG CCGCCATGCT CAAGGGAGAG
GGAGGCGACA AGGTCACCCT TATCGGCGAA CCGATGGGGG ACGGCGGCCA GTTCTGGTCC
GAGGGCAAGT ATGTCAAGTT GCCAAACTCC CGGATCGCTG TTCGCTACAG CCCGCAATTT
CATGACTACG AGACCGGCTG CTTCGACATC GACGACTGCT ATTGGGCGAC TGTCGCCTTC
GGCCCGCGCG GTATTTCGAT CGCGCCGGAA ATCACGGTTG AGCTGAGTTT CAGGGCTTAT
GCGGAGGGGC GAGACCCTGT CCAGGAGAAA GTGCTGGACT TGGCCGGCAG GCACTAA
 
Protein sequence
MSPGWLAIPR TAAMALTLAV TGSAQAADVI LTPAQMREDL TFLVEKWAPL DKSFSDSQRA 
EFGRHVDETA SAAENLAPEE FALEVMKAVA IARNGHTNAN VGTLLGADLP VRVWTFSDGL
YIVKTHPDFK RLLGARVVGI GSLTTQEALD RVRAYLPGTD QRIRFLSPGY LVAPAVLERI
GAVDGTDRIP LTLQFDDGRT EIVELAPMKG EDPGDERKAS LNRGYSVLIP DDPDLPGRWR
HLLDGRKELP PIYGKRSDVQ ARFLDDHMKV LYIRNDTARS IDETPLPDKL AGVVLDKIVP
MQPKHIIVDL RLNNGGDFFN TILFSRAIPR LVPRDGRVFV LVGRATFSAG ITTAAMLKGE
GGDKVTLIGE PMGDGGQFWS EGKYVKLPNS RIAVRYSPQF HDYETGCFDI DDCYWATVAF
GPRGISIAPE ITVELSFRAY AEGRDPVQEK VLDLAGRH