Gene Smed_4578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4578 
Symbol 
ID5318025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1071872 
End bp1073125 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID640776379 
Productpolysaccharide export protein 
Protein accessionYP_001313311 
Protein GI150376715 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.653885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.998934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATTT CACCGGCCGT TTTCGGAACG ATTTTGCCCC GCTTAATGAA TGGGGCGTTC 
CTGCTGCTGA TGGTTCTATG GGTGCTCGCC TGTGCGATGA TAGGAGCAAT GCCGGCTCGA
GCCAGTGATT ACCGGCTTAA CACTGGAGAT GTTCTGACCT TCGACTTCCT TGATGATACA
GAGTTGCCGG TCACCGCGAC CGTTTCGGGC GAAGGCGAAG CACAATTTCC GCTCATTGGC
GCCGTGGGCG TCGTCGGGCT CACGGTGCCG GAGGCGCTGG AGAGGCTTCG CGGCGAGTAC
CGCAAGCGCG AAATTCTCGT CGATCCGAAG ATCTCTCTCG ATATATCCAC CTTCCGGCCG
ATCTTCGTTC TCGGAGAAGT CAAGACGCCG GGCTCGTTTC CCTTCTACAG CGGGTTGACG
GTCGAGCAGG CCGTCGGTCT GGCGGGAGGC ATGCAGGTGG TTGCGGCAAA CGCCTCAGAC
AGGATCATCG CACGGGCGCG TTTGCGCGGG GACATCGAGG GTGCTCGCGC CGAGATCGTG
CACGAAGCCA TCTATGCCGC GAGGCTCGTG GCGCAATTGA AGTCCTCTGA CAAAATCGAC
CTCGCCGACG TCCCCGAGGT CGCGCGCGAC TATGTGACCA GCGTGCCGCT CGATGGTGTC
GTCGAGCTCG AAGAAAAGAT CCTGAAAGCC GACCTTGCGG CCAACAAGTC GCAGGCGCAG
ATCCTGACCG AAGGTATCGC TCAGGCCGAG GGTGGAATAG ATATTCTGAA CCAACTGGTT
CTGCAGCAAA AGGACGTGGT GCAGAACAGC AAGGAGGATG TGGACCGTAC CGCTACCTTG
CGCAAGCGCG GCCTGAATAC AGAGAGCGAC CTGTCGCGCG CCGAGAACAA TGCCTCGGCC
GAACAGGCGC AGCTTCTCGA GACTTTCGCC ACCCTTGCAC GGTCGCGTCA GGAAATGAGT
GAATTGAAGC TGCAACTGGC GAAGCTTGCG GCCGACCGGG AGAAGGATAT CCTGACCCAA
CTCCAGGCGC GTGAGATCGC GATCAAAAAG CTGATTTCCC AACAGCACTC CGCCGAAGAA
CAAATTCTCC TCATGACCGC TGTGGCCGAA GACGAGTCGA AGAAGAAGCA GATTTCCTAT
ACCTACGAGA TCCGTCGAAA CCCGGTTGGC GGCACGCCGG CCAGCATAAA GGCGTCGCCC
TTGACTGAAC TCGTTCCTGG CGACGTGCTG ACGGTGGCTA TTGCCGGAAT GTAA
 
Protein sequence
MGISPAVFGT ILPRLMNGAF LLLMVLWVLA CAMIGAMPAR ASDYRLNTGD VLTFDFLDDT 
ELPVTATVSG EGEAQFPLIG AVGVVGLTVP EALERLRGEY RKREILVDPK ISLDISTFRP
IFVLGEVKTP GSFPFYSGLT VEQAVGLAGG MQVVAANASD RIIARARLRG DIEGARAEIV
HEAIYAARLV AQLKSSDKID LADVPEVARD YVTSVPLDGV VELEEKILKA DLAANKSQAQ
ILTEGIAQAE GGIDILNQLV LQQKDVVQNS KEDVDRTATL RKRGLNTESD LSRAENNASA
EQAQLLETFA TLARSRQEMS ELKLQLAKLA ADREKDILTQ LQAREIAIKK LISQQHSAEE
QILLMTAVAE DESKKKQISY TYEIRRNPVG GTPASIKASP LTELVPGDVL TVAIAGM