Gene Smed_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4786 
Symbol 
ID5318408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1303585 
End bp1305144 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content57% 
IMG OID640776582 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001313514 
Protein GI150376918 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.419568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG GTGCGGATGC ACGCTTTTAT TTTTCAATCC TTCTGAGGAG ATTGCCGTAT 
CTGGTGGCGA TCGTCGGTTC CGTGATTGCG CTTACCGTCA TTGTCGCTAG CATTTTACCG
CCCCGTTATC GTGCGAGCGC CAAAATTCTC GTAGAGGCTC CGCAAATTCC TGTGGAACTG
GCGCGATCGA CTGTCCCAAT ACAGGCGGCC CAGCAATTGC AGATTATTCG GCAGCAGATC
ACGACGCGCG ACGACCTTCT CGCGCTTGCC GACATGCTCG ACATTTACGG CAAAGAAGAA
GACGAACTGT CCAAGGATGA CATCGTAGAT AATATGCGCT CCCGCATCAC ATTCGAGGAA
CTCGCGTTGA GCGCGCCATA CGGTGATACT GGTGCCTCTG TGGCAAGCGT GAGCTTCACC
GCGGCAGATC CTGATCTTGC GGCCAGGGTC GCTAACGAGC TTGTCGACTT TATTCTGCTG
AAGCAGCAGC AACAACGCAC CAGTCGTGCC GCGGACACGG TCAAGTTCTT TGATCAAGCG
GTCGCGAGAC TTGGCACGGA TCTGAGTAGG GCCGAACTCG AAATTCTAAG ATACAAGAAC
GAACATGCAG ACACTCTTCC CGAAAGTCTT GATTTTCGCC GCAGTCAACA AACAGGCCAG
CAACAGAGAT GGATCACGCT CGAGCGCGAA GAGTCGGATC TCCGGGCTAA GCGGAGCACC
CTTGTCGAGA GTTACGTCCT TGGTGGCCAA GCTCCCGATG GTAAGGCGGC GACACCGGAA
CAGCTGGCCC TGCAAGAGCT GACGCGTGCG CTTGCCGAGC AGCGTGCGAT TTTCTCGGAG
AACAGCCCCA ATATAATGGC CCTTCGCGGC CGCATTGCTT CATTGCAGGC CACATTGCGC
ACGACGCAGA CAAGCGAAGC TAGTTCGGCC CAGGACAGGG TCGCGCGTTC CCCGCTGGAC
CAACAGTTGG CGTATATCGA CGAGCGCTTG CGCGCGATTG TCGGGGAGAA GGCCGCAATC
ACCGATCGCA TAGACGAACT GAGCAAATCA ATCAGTGCGA CGCCGGAAAG TGAAACCGTT
CTTTATTCGT TCGAGCGCGA CCGGGCAAAT CTTCAATCAC AATACAATAC TGCGATAGCC
CGACGCGCCG AGGCGATCAT CGGCCAGCAG ATCGAGAGGC GGTCCGACGG GAGTAGTTTC
TCAGTGCTTG AGCGCGCGAC CGCACCTGAG ATGGCGGAGA GCCCAAATCG CCGCCGTATC
GTGCTCCTTG GCGCGCTGGC CGGAACGGCT CTCTCCGTGG CCTTCATCGG GCTGCTTGAG
TTTTTCAACG CGGCCATACG CAGACCCAAT GAACTTGCGC GGCTGCTCGA CCGTCAGCCG
CTTGCCACTA TACCGTATAT TTCGACTGTA GCCGAGGTAC GTAGTCGGAT CAGACGAACG
GTCGCCGCGG TACTTGCTGC TGCGGCTGCT CCCGCGGCAT TGATCGTAGT CCACCAGTTC
TATATGCCGC TGCCGATAGC ATTTCAGAAT CTCTATCGGT GGCTCACCAC GCTGGCCTAG
 
Protein sequence
MMNGADARFY FSILLRRLPY LVAIVGSVIA LTVIVASILP PRYRASAKIL VEAPQIPVEL 
ARSTVPIQAA QQLQIIRQQI TTRDDLLALA DMLDIYGKEE DELSKDDIVD NMRSRITFEE
LALSAPYGDT GASVASVSFT AADPDLAARV ANELVDFILL KQQQQRTSRA ADTVKFFDQA
VARLGTDLSR AELEILRYKN EHADTLPESL DFRRSQQTGQ QQRWITLERE ESDLRAKRST
LVESYVLGGQ APDGKAATPE QLALQELTRA LAEQRAIFSE NSPNIMALRG RIASLQATLR
TTQTSEASSA QDRVARSPLD QQLAYIDERL RAIVGEKAAI TDRIDELSKS ISATPESETV
LYSFERDRAN LQSQYNTAIA RRAEAIIGQQ IERRSDGSSF SVLERATAPE MAESPNRRRI
VLLGALAGTA LSVAFIGLLE FFNAAIRRPN ELARLLDRQP LATIPYISTV AEVRSRIRRT
VAAVLAAAAA PAALIVVHQF YMPLPIAFQN LYRWLTTLA