Gene Smed_4803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4803 
Symbol 
ID5318690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1322312 
End bp1323628 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content61% 
IMG OID640776597 
Productglycosyl transferase group 1 
Protein accessionYP_001313529 
Protein GI150376933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.70984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGA GGTCGTCTTT GCAGCGCGAG ATCTTTTCGA TGAGCGCAAC AATGAAAGAG 
GGCTTTGCCC TTTGCGGCGC GGCCGCTTCG GTCTTCCGTG AAATACCGCG GCAGTTTGGT
CGCCGCACCC GCCAGATCAG CCACATTGTG GCGACGACTG GGTTTCAGGG GGTCATCTGC
CGCGCCAGAT TCAAGGCATC GGACTGGATA AGACCTCGCG AGCCGGTCTG GCCAGTTGTT
CCCGATGACA TTATCGCCGC CGACCTCTCA CAGCCCTTCT GCGCACGGGT TCCGGAGATC
GATCGGGAGG CCCCGATAAC GGTGAACTGG GTAACGGGGC CGGCCGGACG CGGATCGGGC
GGACACACGA CACTGTACAG GATCGTCAAG CAACTCCAGA ATAGCGGCTA TTTGAACCGC
GTCTACCTGT ACGATCCGTA CGCAGGGGAT CCCAAGTACT ATCAGGGGCT CGCACGTGAG
CATTATGGGC TCACCTGCGA GATAGGCGAC ATCCGCGACG GCATGAAGGA TGCCGACGCA
CTGGTAGCGA CGAGTTGGCC AACGGCATAC GCCGTCTTCA ATGCGCGCTG CGCCGGCAAG
CGGTTCTATT TCGTTCAGGA CTACGAACCA TACTTCTATC CGGTGGGCAC AAACAGTGTG
CTTGCCGAAA ACACCTATCG AATGGGCTTC CATGGCATCA CCGCTGGGCG CTGGTTGGCT
GAAAAGCTCG CCCGGGAGTT CGGCATGCAG AGCGACTATT TCCCGTTCGG TTGCGATACG
GCCCTGTATC GCCGGGACCC CGCCTCGAAG CGGTCCGGGG TCGCTTTTTA CGCGCGGGTC
GGTACCCCGC GTCGCGCCGT CGAGCTGGGC CTTCTGGCAC TCGAGTTGTT CGCAAAACGG
CAACCCCAGA TCGAATTGCA CCTGTTCGGC GAGCGGTTCG ACAATCTGCC GTTCCGCGTC
ACCAATCATG GGCTCGTATC CCCCCAAAGG CTCAACGAGA TCTACAATCG CTGTTTTGCC
GGCCTGAGCC TCTCGCTGAC GAATGTCTCG CTCGTGCCGC AGGAGATGCT CGCGTCCGGC
TGTCTCCCCG TCGTCAACGA CGCGGTGCAA AATCGAATTG TCCTCGATAA CTCGTACGTG
CGGTACGCTC CACTGACGCC CCATCTGCTT GCGGCCGCAT TGGAGAGCGT GGTGAGCATG
CCTGACTTTG CAAGCGTATC GAAGAAGGCA TCCGAGAGCG TTGCGCCAAC GTCCTGGAAC
ATGGCGGGGG CGGCGGTAGA CAGGGCATTT CGCGTGGCGC TCCGGCAGGC TCTTTGA
 
Protein sequence
MGGRSSLQRE IFSMSATMKE GFALCGAAAS VFREIPRQFG RRTRQISHIV ATTGFQGVIC 
RARFKASDWI RPREPVWPVV PDDIIAADLS QPFCARVPEI DREAPITVNW VTGPAGRGSG
GHTTLYRIVK QLQNSGYLNR VYLYDPYAGD PKYYQGLARE HYGLTCEIGD IRDGMKDADA
LVATSWPTAY AVFNARCAGK RFYFVQDYEP YFYPVGTNSV LAENTYRMGF HGITAGRWLA
EKLAREFGMQ SDYFPFGCDT ALYRRDPASK RSGVAFYARV GTPRRAVELG LLALELFAKR
QPQIELHLFG ERFDNLPFRV TNHGLVSPQR LNEIYNRCFA GLSLSLTNVS LVPQEMLASG
CLPVVNDAVQ NRIVLDNSYV RYAPLTPHLL AAALESVVSM PDFASVSKKA SESVAPTSWN
MAGAAVDRAF RVALRQAL