Gene Smed_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3889 
Symbol 
ID5318683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp346824 
End bp347975 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID640775701 
Productglycosyl transferase group 1 
Protein accessionYP_001312634 
Protein GI150376038 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0066878 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCC ATGCCCTTTC CGGCTCAAGA CAAGAGGTCC GCCACGACGC CCCGAGCCCG 
TTGCCGCGGC GGATATTGAT GACCGTCGAT GCGGTGGGCG GTGTCTGGCG CTATGCGGTG
GATCTCGCCG AAGCCATGCG CCGTTCCGGC GTCGAGACGC TGATCGTCGG CTTCGGCCCG
GCCCCTTCCC CCGAGCAACG GCGCGAGGCG GAGAGCATCG GCCGGCTCGA ATGGTCGGAT
CAGCCGCTCG ACTGGATGGT GGAAGATGAA AGTGAACTCT GCGGCGTTCC GGACCTCTTG
TCCGAACTGG CGCTCAAGCA CTCCGTCGAT CTCATGCACC TCAACCTGCC TTCCCAGGCT
TCGGGAATAA CGGCTGAGAT TCCCGTCGTC ACCGTTTCGC ATTCCTGCGT AGCCACCTGG
TTCGAGGCGG TGCGCGGATC CGGCTTGCCA ACCGGCTGGT GCTGGCAGAA GCGATTGAAC
CGAATTGGAT TCGACCGCGC CGACCTGGTG CTTGCCCCGA GCCGGAGTCA TGCCATGGCG
GTGACGCGTT GTTACGGTCC GGTTCGCGAT CTCGCGGTCG TCTACAATGC GAGCCGGAAC
GCATCCTCCA TCTCCGCCAA GGAGAACTTT GCTTTTGCGG CCGGGCGCTG GTGGGACGAA
GGCAAGAATG GAGCCGTACT GGACAGGGCG GCCGCCGATA TAGGCTGGCC GGTCGTCATG
GCCGGTGCCT GTGACGGCCC CAACGGGCAA CGCCTGGCGA TTGCGCATGC CGATCACAAG
GGCGAACTCA GTCACGACAG GGCGATTTCG CTGATGTCGA GGGCGGCGAT CGTCGTTTCC
CCTTCGGTCT ACGAACCATT CGGTCTGGTG GCGCTGGAGG CAGCACGCGC CGGTGCGGCA
TTGGTTCTCG CCGACATCGA AACCTATCGC GAGCTCTGGG AGGGCAGCGC TCTTTTCGCC
GATGCCGAGG ATCCCGCCGC CTTCGCCGGG GCCGTCAACC GGCTTGCGGA GGATGCGGGG
TTCAGGGCCG AGCTCGGCCG GCGAGCGCAG TCGCGCGGTT CCGACTTCAC CCTTGAGGCC
CAGCGCGAGG CGATATTCGC CGCCTATAGG CGCGTGATGC GCGGGGAAAA TCGACGGACG
GCAGCGGAGT GA
 
Protein sequence
MSVHALSGSR QEVRHDAPSP LPRRILMTVD AVGGVWRYAV DLAEAMRRSG VETLIVGFGP 
APSPEQRREA ESIGRLEWSD QPLDWMVEDE SELCGVPDLL SELALKHSVD LMHLNLPSQA
SGITAEIPVV TVSHSCVATW FEAVRGSGLP TGWCWQKRLN RIGFDRADLV LAPSRSHAMA
VTRCYGPVRD LAVVYNASRN ASSISAKENF AFAAGRWWDE GKNGAVLDRA AADIGWPVVM
AGACDGPNGQ RLAIAHADHK GELSHDRAIS LMSRAAIVVS PSVYEPFGLV ALEAARAGAA
LVLADIETYR ELWEGSALFA DAEDPAAFAG AVNRLAEDAG FRAELGRRAQ SRGSDFTLEA
QREAIFAAYR RVMRGENRRT AAE