Gene Smed_4999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4999 
Symbol 
ID5318720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1514089 
End bp1515114 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content51% 
IMG OID640776781 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001313713 
Protein GI150377117 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.851528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0115997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG CATTTGTCTT TGGGCTAAGC CCTTGGAAGG ACTTCATTAG AAGCTGGCTT 
CCCGAAGAAA GGATATATTG CCAAAAGCGG ACAATTTCCT GGCTGGAATT TCACGCGGTC
TGGGCACCGT TGATATTAGT ATCAAAGGAT CCGAAAATAT ACGTATGGGG CTACAAGCAC
CCTCCCTTCA TCGAAAGGTT CTCAAGGCTA AGTCGCGTTA ATCTCATCCG AATAGAAGAC
GGATTCATTC GCTCAGTTGC TTTGGGCGCC AGCAAGGCGC CACCACTTTC TCTTTGCTTC
GATTCTCCTG TCCTTTACTA CGATCCCAGC TCACAGTCGA CACTTGAACG CCTCATAGAG
ACCTACCACT TTTCGGCGGA CCCAGCTCTG CTGTTGCGAG CACGAACGGG AATGAACCGC
TTGGTCAGCA GCCGGTTGAG CAAGTACAAC ACGTCCCAAG ACGTTGATGT CCACCGTATC
TACGGCCCGA AAGACTGCAA GCGAATACTG GTTCTCGGTC AAGTAGAGGA TGACATGTCG
ATCATCAAGG GCTGTTCGCG CCTGATGACT AACAACGACC TCGTCCGTCT CGCCGTTCAA
GAGAATCCCG GTGCACAAGT AATCTACAAG CCCCATCCGG AAGTATTACA CGGCACCAGA
CTCGCCCGAT CGAGTCCGGA AGAGGTTAGA CCAATCGCGC AGGTTCTCGA TGACGACATT
GCTTTGGCGG ATGCCTTCGA AACAATCGAT CACGTTTATA CGATCACCTC ACTCTCGGGA
TTCGAAGCGC TGATAAGAGG AATAAAGGTA ACTTGCCTTG GCATGCCGTT CTATGCGGGT
TGGGGACTTA CTGATGATCG CCAATCCTGC TTGCGCCGCT CGGCGAAGCG TAGCGTGGAA
GAGGTGTTCG CTGCAGCCTA TCTACTCTAT CCCAAATATT TCCATCCGCA TGAGAAGAAG
ATGATTTCAT TTGAAGAGGC GCTGGAACTT CTCCATTCCA TGAAACACGC TTCGGCTACT
CCCTAA
 
Protein sequence
MTTAFVFGLS PWKDFIRSWL PEERIYCQKR TISWLEFHAV WAPLILVSKD PKIYVWGYKH 
PPFIERFSRL SRVNLIRIED GFIRSVALGA SKAPPLSLCF DSPVLYYDPS SQSTLERLIE
TYHFSADPAL LLRARTGMNR LVSSRLSKYN TSQDVDVHRI YGPKDCKRIL VLGQVEDDMS
IIKGCSRLMT NNDLVRLAVQ ENPGAQVIYK PHPEVLHGTR LARSSPEEVR PIAQVLDDDI
ALADAFETID HVYTITSLSG FEALIRGIKV TCLGMPFYAG WGLTDDRQSC LRRSAKRSVE
EVFAAAYLLY PKYFHPHEKK MISFEEALEL LHSMKHASAT P