Gene Smed_4810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4810 
Symbol 
ID5318697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1329561 
End bp1331069 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content63% 
IMG OID640776604 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001313536 
Protein GI150376940 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGC ATCCCAGGCC TGAGGAAGGT CATAGCTACG GTCAAATCCT CAAGTCGACG 
GCGCTGATCG GCGGTTCGTC GGCCGTCAAC GTGGTATTTG CGATCTTCCG CAACAAGGCG
ACGGCTTTGC TGCTCGGTCC CGCGGGCGTA GGGCTGATGG GTCTCTACAG CTCCATCGCC
GATATCGCCT GCGCACTCGC CGGGCTTGGA ATCCAGGCGA GCGGCGTACG CCAGATCGCC
ATAGCCGTCG GCAGCGGCGA CGCGGACGCG ATCGCGCGGA CGGCCACTGC GTTGAGGCGC
GTATCGGTCC TGCTGGGGCT TGTTGGCGCC CTTCTCCTAA CTGCTCTGGC AGTGCCAATC
GCATGTTTCA CTTTCGGCGG TCATGGCTAT GCCGGATGCG TCACTCTACT CTCGGCCGCG
ATCTTCCTCC GCCTGCTGGC GGACGGACAG ACTGCCTTGA TCCAGGGCAT GCGAGATATC
GCCAGCCTTG CCCGCATCAA TGTCCTCGCC GCCTTCTTCA GCACGGTTGT CACAATCCCG
CTGATCTATT TTTTTGGCGC GTCGGGCATC GTGCCCTCGC TCGTGGTCGT TGCTGCGGCT
TCGCTCGCGA CCTCCTGGTG GTACGGCCGG CGACTGCGGG TAACCGCGCG CCCGATGTCG
ACAGCACAAC TCCGCCGAGA GGTGGAAGCC CTCTTGAAGC TCGGCTCCGC CTTCATGGTC
AGCAGCTTTC TAACATTGGG CGCAGCCTAT GCGGTGCGCA TCTTCGTGCT GCGCGCCGAA
GGCTTGACGG CGGCCGGCCT CTACCAGGCA GCCTGGACAC TCGGCGGTCT CTATGCCGGC
TTCATCCTGC AGGCGATGGG AACCGATTTC TACCCGCGCC TGACGGCGGT GGCGGAAGAC
AATGGCGAAT GCAACCGCCT CGTCAACGAG CAAGCCCAGG TCAGCATGCT ACTCGCCGGC
CCTGGCCTCA TAGCAACGCT CACCGCCGCG CCATTGGTGG TCAGGCTGCT GTATTCGCCC
GAATTCTACC CCGCTGTGGA ACTCCTTCGC TGGATCTGCA TGGGCATGAT GCTGCGGATC
ATTTCATGGC CAATGGGGTT CATCGTTCTC GCAAAAGGTG CCAGGAGAGC CTTTTTCTGG
ACGGAGGTTA CGGCAACCGT GGTCCATGTC GGCCTCGCAT GGCTCTGTGT GGGCGTGTTT
GGATCGGCCG GCGCAGGCCT GGCGTTTGTC GGTCTATATG TCTGGCACGG CTTGCTAATC
TATGCGATCG CACGTCACCT CTCGGACTTC CGCTGGTCCG CCACCAACCG AAAGCTAGCC
CTGTTCTTCC TGCCTGCGTC AGGCTTCGTC TTCGGTGCTT TCGTCGCTCT GCCGCCTTGG
CCGGCGACGA TATTCGGCAT GCTGACAACC GCGCTGAGCG GAGCCTATTC ACTGCGGATG
CTCATGGAAC TCGTCCGGCT GCCGTCCTTG CCGGCCGCAG TCCGCGCCTG GTGCTCCCGG
TCGACCTGA
 
Protein sequence
MQSHPRPEEG HSYGQILKST ALIGGSSAVN VVFAIFRNKA TALLLGPAGV GLMGLYSSIA 
DIACALAGLG IQASGVRQIA IAVGSGDADA IARTATALRR VSVLLGLVGA LLLTALAVPI
ACFTFGGHGY AGCVTLLSAA IFLRLLADGQ TALIQGMRDI ASLARINVLA AFFSTVVTIP
LIYFFGASGI VPSLVVVAAA SLATSWWYGR RLRVTARPMS TAQLRREVEA LLKLGSAFMV
SSFLTLGAAY AVRIFVLRAE GLTAAGLYQA AWTLGGLYAG FILQAMGTDF YPRLTAVAED
NGECNRLVNE QAQVSMLLAG PGLIATLTAA PLVVRLLYSP EFYPAVELLR WICMGMMLRI
ISWPMGFIVL AKGARRAFFW TEVTATVVHV GLAWLCVGVF GSAGAGLAFV GLYVWHGLLI
YAIARHLSDF RWSATNRKLA LFFLPASGFV FGAFVALPPW PATIFGMLTT ALSGAYSLRM
LMELVRLPSL PAAVRAWCSR ST