Gene Smed_5686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5686 
Symbol 
ID5319988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp652389 
End bp655631 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content60% 
IMG OID640777413 
Productglycosyl transferase group 1 
Protein accessionYP_001314345 
Protein GI150377750 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GTTTGCGCAT AGTGCTTGCG ACCGACAGCG TGGATCCATC CGGAATGGGT 
GAGCACATGC TGACGCTCGG TCGGGCGCTC CAGGGCCAGT TCGATGTGAC GCTTGCCGCC
ATCGATGGGA TCGAAGCCGC TCTGCTGACA AGGGCTGCGC GTTGCGGCGT CGCGGTCAAA
GGCATTGATG ACAATGCATC GTTCGAACAT TGGCTGGAGT TTTCGGGCGT TTCTTTGCTC
CACGTTCACG CTGGCATCGG TTGGGAGGGG CATGAGATAG CGCGCGCCGG GGTCGCCTGC
GGCATCCCGG TTATCCGTAC GGAGCATCTC CCCTATCTGC TCACCGATGC CGAGCAGAAG
GAGCGCTATG CCAGAGAAAG TGATGCTCTT ACGCATCATA TTGTCGTTTC GGAAGCATCG
AAGTTAAGCT TCGAGAGCAA GGGGGTCAAG GGCAACCGAA TGACGGTCGT CCGCAATGGA
ATCTTTCCGC TAGGCGCACG GGACATTGCC GGTCGCAGCA AAGTCGCACT TGGGCTCGAC
GGGAAAAGCG TCCTCATTAC GGTGGCGCGC TTTTCTAAGC AGAAGGACCA CGCCACCCTG
ATTAAGGCAA TGCCGGCAGT ATTAGCCGCG GACCCGTCGG TCGTTTTGCT TCTGGTAGGC
AAAGGCGAGG AATTGGAGGC CGTTCGGGCC CTGGTCGAAG ACCTGTCGCT TGGCCCGCAT
GTCCAATTTC TGGGGCATCG CATCGAGGTC GACCAGTTGA TGGGCAACGC CGACCTTTTC
GTCCTGCCGT CGCGATTCGA AGGGCTTCCC TTAGCAGTGC TCGAAGCAAT GTCGATCGGA
CTACCGGTCG TCGCAACCCG GATCGGCGGC ACCGTCGAGG CGCTTGGATC GGAACATCCG
TTCCTGGCTG AGTGCGAAGA TCCGTCCTCA TTGGCTCGCG TGCTGATCGA AGCTTTGAGC
GACCCGGAAC GGGCGAAAAC GATCGGCCGG GCCGGCCGGG CCAGGTTCGA CACTGAATTT
TCAGCGCGGC GAATGGCGGA CGAAACCGCG GCTGTCTATC GGCGATTTCT TTCCGAGCGG
ACGGAGAATA AACAAGGACA CCATTTTATG GACAAGACAC GTGTCGGCTT CATTGGCGTC
GGGGGCATCG CCCACCGGCA TCTTGATATT TTCGCCGGTT TCGAGGACGT GGCGCTCGTT
GCTTTTGCAG ACCCGGACTT CGCACGCGCC GAGCATGCCG CGTCGCGGTT CGGTGCGAAG
GCGTTCGAGA GCCACAGTGC GATGCTGGAG AAAGAGGCGC TGGATGCGGT CTATATCTGC
ATTCCCCCCT TTGCCCATGG CGACGCCGAA CGGGATCTGA TCGCGCGTGG CATCCCGTTC
TTTGTCGAGA AGCCCATTAC GCTCGATATC GAGCTGGCGG AGGAACTCTG TGCCGCGATT
GAAGCCGCAA AGCTGATCAC GGCTGTCGGT TATCATTGGC GCAATCTCGA TACAGTAGAG
GAGGCGCGCC GGCTTCTGGC CGAAAATCCA GCTCAACTTC TTTCGGGCTA TTGGCTGGAC
CAGACGCCAC CGCCGCAATG GTGGTGGAAG AACGACCGCT CAGGCGGCCA GATGGTTGAG
CAGGCGACCC ACATCATTGA TCTGGCGCGA TACCTCATTG GTGAAGTCAC CGACGTTTAT
GGCCGCGTCG GCTTCAAAGA CCGCCCCGAA TTTCCAGGCC TCGATGTGCC GGCGGTGACC
ACTGCGAGCC TCACCTTCCA ATCGGGCGTT ATCGGCAACA TCTCCTCGAC ATGTCTTCTC
GGCTGGAGCC ACAGGGTCGG ACTGAACATC TTCGCCGACC GACTTGCGAT CGAGCTGACC
GACCATGACA TCATGATCGA TGTGGGTGCT GGTCGACCAG TGCGGCATGC CCAGGGCGAT
CCCGTCTGGC GAGAAGATCG CGATTTTGTC GACGCCGTGC GCGGGGGAAA TAATAACATT
CGCTGTGCCT ACGCGGATGC GCTGGCGACG CACCGGCTCG CGCTGGCGGT CGCTTCCTCG
GCGCGCAGCG GCGAGCCGAT AAAGCTCGAT CCGCCTGCCA TAGTCCGCAA CGCTGTGACG
ACGCTGCAGT ATCCACCGTC GTCCGAAGCC TGTCAGGGAT TATCCCCAGG CCACCGTGCC
ATTCGCTCTC TTGGCATTGA AGGGCCCGGA AAAGCATTCT TCTTCGACTA TCAAGAGGGT
CCGCCCGCTG ACGGCCATGT CCGGTTGGAA ACGCTCTTTA CCGGCTTTTC CGCCGGCACC
GAACTTACCT TCATGAAGAA CACTAATCCT TACTTCCACT CCCGTTTCGA TAGCGAGCGT
GGCGTGTTCT TCGAGAACGA GCCGGATCTT CACTACCCGG TGCCTTTTCT GGGATACATG
GAAGTGGCGC GCGTCTCGGA GTCGAAGGCT GCAGGCTTCA AGGAAGGGGA TGTCGTTGCC
GCGACCTACG CGCACAAAAG CGGCCATACT GCCGATCCAT TCCTCGACTT GCTGGTGCCC
TTGCCAGCCG AAATCGATCC CGTTCTTGGC GTCTTCGTAG CGCAAATGGG CCCTATCGCG
GCCAACGGCA TTCTACATGC GGATGCGGAC GCTCTGGGGA GCAATGTATC CTGTCTCGGA
GTGGGCGTTG CGGGGCGTCA CGTGGTCGTG CTCGGAGGAG GCACGGTTGG ATTGATGACG
GCGCTTTTTG CGCAGAAGGC GGGGGCCTTG GAAATCGTCG TTGCCGATCC TTCGGCGTTT
CGGCGGAACA AGGCGCGGGG CATGGGTTTG ATTGCGATGA CCGAGGATGA GGTCTGGCAG
CACGCGAAAA CGCGATGGCA TAACGGCGGA AGTGATCGCG GCGCCGATGT CGTGTTTCAG
ACACGGGCGC ATCCGTGGAG CCTCCACGTC GCGCTGAAGG CACTGCGCCC GCAGGGCACG
GTCATTGATC TCGCATTCTA TCAGGGCGGC GCCGAGCGGC TGCGGCTAGG CGAAGAGTTT
CATCACAATG GCCTCAACAT CCGCTGCGCA CAGATTAACC GCGTACCCAG AGGACTGGCG
CCGCTGTGGA ACAGGCGCCG GCTTGCCGAG GAAACGGTCC AGCTCTTAAA AATCTACGGA
GCTCTCATTC GCGAGCATAT GATCACGCAT GTCGTTCCAT TCGACGATGG GCCAAAATTC
CTCGCTGATC TGGTGGAGAA CCGGCCGGAA TTCCTGCAGA TCGTCTTCAA GGTCAGCGGA
TGA
 
Protein sequence
MNKRLRIVLA TDSVDPSGMG EHMLTLGRAL QGQFDVTLAA IDGIEAALLT RAARCGVAVK 
GIDDNASFEH WLEFSGVSLL HVHAGIGWEG HEIARAGVAC GIPVIRTEHL PYLLTDAEQK
ERYARESDAL THHIVVSEAS KLSFESKGVK GNRMTVVRNG IFPLGARDIA GRSKVALGLD
GKSVLITVAR FSKQKDHATL IKAMPAVLAA DPSVVLLLVG KGEELEAVRA LVEDLSLGPH
VQFLGHRIEV DQLMGNADLF VLPSRFEGLP LAVLEAMSIG LPVVATRIGG TVEALGSEHP
FLAECEDPSS LARVLIEALS DPERAKTIGR AGRARFDTEF SARRMADETA AVYRRFLSER
TENKQGHHFM DKTRVGFIGV GGIAHRHLDI FAGFEDVALV AFADPDFARA EHAASRFGAK
AFESHSAMLE KEALDAVYIC IPPFAHGDAE RDLIARGIPF FVEKPITLDI ELAEELCAAI
EAAKLITAVG YHWRNLDTVE EARRLLAENP AQLLSGYWLD QTPPPQWWWK NDRSGGQMVE
QATHIIDLAR YLIGEVTDVY GRVGFKDRPE FPGLDVPAVT TASLTFQSGV IGNISSTCLL
GWSHRVGLNI FADRLAIELT DHDIMIDVGA GRPVRHAQGD PVWREDRDFV DAVRGGNNNI
RCAYADALAT HRLALAVASS ARSGEPIKLD PPAIVRNAVT TLQYPPSSEA CQGLSPGHRA
IRSLGIEGPG KAFFFDYQEG PPADGHVRLE TLFTGFSAGT ELTFMKNTNP YFHSRFDSER
GVFFENEPDL HYPVPFLGYM EVARVSESKA AGFKEGDVVA ATYAHKSGHT ADPFLDLLVP
LPAEIDPVLG VFVAQMGPIA ANGILHADAD ALGSNVSCLG VGVAGRHVVV LGGGTVGLMT
ALFAQKAGAL EIVVADPSAF RRNKARGMGL IAMTEDEVWQ HAKTRWHNGG SDRGADVVFQ
TRAHPWSLHV ALKALRPQGT VIDLAFYQGG AERLRLGEEF HHNGLNIRCA QINRVPRGLA
PLWNRRRLAE ETVQLLKIYG ALIREHMITH VVPFDDGPKF LADLVENRPE FLQIVFKVSG