Gene Smed_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4357 
Symbol 
ID5318206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp855905 
End bp857107 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID640776162 
Productglycosyl transferase group 1 
Protein accessionYP_001313095 
Protein GI150376499 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAA AATCGGAGAC CTCCGGAGAT CGTCGTCCGG CACAGGCGGA TGCGTTGACT 
TTGACGGGTC GCCAGGCGGC GAGAGTCGCC AAGCGCCGGC GCGTGATGAT GCTTGGGCTT
CGCGGCATCC CGAATGTCCA GGGCGGCGTC GAAAAGCACG TGGAGATGCT TTCCTCCCGG
CTGACAGGGC ATGGCTGGGA GGTCGAGGTC GTCGGCCGAC GGCGCTATCT GCCGGTCTCC
GGGCCGCATC TCTGGAATGG CGTTCGCGTC TCGCCTCTCT GGGCTCCGCG CATGATGGCG
CTCGAAGCCA TTATTCATAC CTTTCTCGGC ATATGCTTCG CCGCCATGCG GCGTCCCGAT
GTCCTGCACA TCCACGCCGT CGGTCCGGCA CTCCTCGTGC CGCTTGCCCG GCTCATGCGG
CTGAATGTCG TCGTGACCCA CCACGGTTAC GATTACGATC GGCAGAAATG GGGCGCCTTT
GCCAAGAAGA TGCTGCGGAC AGGCGAACGG ATGGGCATGC GCCTCGCCCA TGGGCGCATT
GCGATCTCAA GGGAAATCGC GGAGACGATG GGCGAACGCT ATCGCGTGCC GGTTGCCTTC
GTGCCGAATG GCGTTGCCGT TTCGCATTGC GACGTTGAAA CCGGGGTTCT CGGCGAATTC
GGGCTTACAC GCCGTCGCTA CATCCTGCTT GCAGCGCGGC TCGTGCCGGA GAAGCGGCAG
ATCGATCTCA TCAAGGCCTA TGCAAAACTG GGAAATACCG GATTCAATCT CGTGCTCGCG
GGCGGTGCCG AATTCGAGAG CGCTTACGAA GACGAGGTAA GGGCGCTGGC CAGCCGCGTA
CCGGGCGTCG TTCTGACGGG GTTCCAAACC GGCGATCGTC TGGCGGAGCT CTTCGCCAAC
GCGGCGCTCT TCGTTCTGCC GTCCAGTCAC GAAGGCATGC CGATCGCGCT TCTGGAGGCC
ATGGCTCACG GCCTCCCGGT TCTGGCAAGC GACATCGTCG CCAATCGCCA GCTCGACCTG
CCCGCGGGCG ACTACTTTCC GCTTGGCGAT ATCGATGCTC TTGCGTCCGC CATTTCTGAA
AAGACTGCCA CGTCCCTGGA TGAACAAGAG GTGCTCGCGC AGACTGAACG CGTCGAGTCA
GCCTATAGCT GGTCGAGCGT GGCCTTGAAG ACGCTCGATG TCTACCGGGC GGTGACGAAA
TGA
 
Protein sequence
MSRKSETSGD RRPAQADALT LTGRQAARVA KRRRVMMLGL RGIPNVQGGV EKHVEMLSSR 
LTGHGWEVEV VGRRRYLPVS GPHLWNGVRV SPLWAPRMMA LEAIIHTFLG ICFAAMRRPD
VLHIHAVGPA LLVPLARLMR LNVVVTHHGY DYDRQKWGAF AKKMLRTGER MGMRLAHGRI
AISREIAETM GERYRVPVAF VPNGVAVSHC DVETGVLGEF GLTRRRYILL AARLVPEKRQ
IDLIKAYAKL GNTGFNLVLA GGAEFESAYE DEVRALASRV PGVVLTGFQT GDRLAELFAN
AALFVLPSSH EGMPIALLEA MAHGLPVLAS DIVANRQLDL PAGDYFPLGD IDALASAISE
KTATSLDEQE VLAQTERVES AYSWSSVALK TLDVYRAVTK