Gene Smed_4993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4993 
Symbol 
ID5318714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1507504 
End bp1508805 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID640776775 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001313707 
Protein GI150377111 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0145163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTC ACGCCGGCGA GGGCTCACGA GCGACGGATT TCGGTAGCGG CGCTGCGCCT 
CAGGGGCGCC GAACGACCTT TGCCGTCCAG ATCACCGAGT GGAAGCGGGA GCCTTTCGAA
CGATATTTCC CCGACAGGCA CTTCCACTTC CTTCCGATGA ATCTCGGCGA ACACGAGTTC
GAGCGCGTCT GGAAGCCGCG AATTCTCGCC GAGAGCAGCG CCGAGATACT CGCCTGGGGG
CCGGAGCTAC CAGGCCCGCT GGATGCGCTG GCCAAAGCGC GGAACATTCC TGTTACTTTC
ATTGAGGACG GCTTCCTTCG TTCGGCCAGG CCGAGCGCCA GCCGCACGCC TCCCCTCTCT
CTGGCCCTCG ACAGCAAGGC GATCTATTTC GACTGCCGGC ATCCCTCGCA GCTCGAGGAG
CTGTTGGGAA CCTACGATTT CGAGGCGGAT GCGGAATTGA TGACGCGCGC GCGCGCCGGC
ATCGCGCTTC TCACCGAAAG CGGCATCAGC AAATATAATG GCGGCCGGCA GCGGACGGCC
GAAGAGGTTT ACGGCGAGAA GACGCGCAAG CGTGTTCTGG TTGTCGGCCA GGTGGAGGAC
GATGCTTCCG TCCGCTACGG CTGCCTCAGC CGGATGACCA ACAACGACCT GGTGCGGCTC
GCGGCCTCAG AGCAGCCGGA CGCCCAGATC CTCTACAAGC CGCATCCGGA CGTGCTGAGC
CGCGTGCGGC CGGCAAGGTC CGATCCCGCG GAAGTCGCGC ACCTTTGCAC CCTGGTCACC
GAGAGCCTGC CGCTTGCGGA GGCGCTGCGC ACCGTCGATC ACGTCTATAC GATCACCTCG
CTCGCGGGTT TCGAGGCGCT TATCCGCGGC ATCGAGGTCA GCACCGCCGG CTGTCCCTTC
TATTCCGGAT GGGGGCTGAC GGACGACCGC CAGCCTAACC CGCGCCGCGG GCGGCGCTTG
AGCATCGAGG CGCTTTTCGC GGGCGCCTAT CTGCTTTATC CTTGCTATTT CGACCCCGAG
ACCGGAGAAC GATCATCATT CGAAGCAACC GTCGCGACAC TCAGAAGTCA GCTGGAAGAG
CCGCAGGGCC TTGCCCGGCC GCGGCCGGCC TGGCGCGCCT GGGGGCCCTA TGGCCTCCTC
GGCTGGCGCC ACCTGCTGCC GCCCTTCGTC ACGCCCGTGA TCCGCAAGAT AGGCAGTGAC
CGCGATGTCG AAGACTTCAG GGCCGATCCG ATCCGCTTCT TCCGCACCCT CTCCGACCGG
AAGTTCCGCG TCATCGGCCG TATTCTCTAT CCGTTCGGGT GA
 
Protein sequence
MKPHAGEGSR ATDFGSGAAP QGRRTTFAVQ ITEWKREPFE RYFPDRHFHF LPMNLGEHEF 
ERVWKPRILA ESSAEILAWG PELPGPLDAL AKARNIPVTF IEDGFLRSAR PSASRTPPLS
LALDSKAIYF DCRHPSQLEE LLGTYDFEAD AELMTRARAG IALLTESGIS KYNGGRQRTA
EEVYGEKTRK RVLVVGQVED DASVRYGCLS RMTNNDLVRL AASEQPDAQI LYKPHPDVLS
RVRPARSDPA EVAHLCTLVT ESLPLAEALR TVDHVYTITS LAGFEALIRG IEVSTAGCPF
YSGWGLTDDR QPNPRRGRRL SIEALFAGAY LLYPCYFDPE TGERSSFEAT VATLRSQLEE
PQGLARPRPA WRAWGPYGLL GWRHLLPPFV TPVIRKIGSD RDVEDFRADP IRFFRTLSDR
KFRVIGRILY PFG