Gene Smed_4996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4996 
Symbol 
ID5318717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1510238 
End bp1511569 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content61% 
IMG OID640776778 
Productcapsule polysaccharide export protein-like protein 
Protein accessionYP_001313710 
Protein GI150377114 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0773002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAA GCAAAGACGC GGCTGTGAAC AAGGCTGATT CCGCGCAAGC CGCCGGGAAG 
GACGTGTCGG CAACGGCCGC GGCGAAAAGC AAGGGCATAG CGGTTCTCGA AGGGCTTCTC
GGCAAGGAAA GCGCGCCGGT CATCCCCCTG CCGGGACGCC AAAAGGCCTT TCCCCCCAAG
AAGACAGGCT GGAAAGCCGG CTGGCTCAAG AAGCTCCGCT GGCGTCACGC GATCATCGGC
GGCACCTTCC TGGGTCTTGT TGCCGTTCCC GCGACGCTTG CTTCGCTCTA CATGGCCTTC
ATCGCCGCCG ACCAGTATCA CAGCACCACG TCTTTTGCCG TGCGCAGCAT CGAGGGCGGC
GTTTCAAGCG ACATCCTGGG AATGTTCACA CAGGCATCCG GCGGCAGCAC GGTTTCGGAT
AGCTATATCC TCATGGATTA TATCCTGAGC GAGCGCATGG CGGCGGACGC GGACCGCCGG
TTCAAGCTGG AAGACGTCTA CGCGACGCGC GGACTGGACT ATTTCTACGG TATCGGCTCC
GAATTGCCGA TCGAGGATAA GCTCGACTAT TGGCGCGACA TGGTGAACGT CAATTTCGAC
CACGCCTCCG GCATCATGCA GGTAACCGTC AAGGCCTTTG AACCGCGGCA GGCGCGTGAG
ATCGCGAAAT TCATCGTGGA CCAGAGCGAC AACCTCGTGA ACAGCCTCTC GCTCTCCGCC
CGCAACGACG TGCTGCGTGC GGCGCAGGAC GAAGTGCTTG CGGGCGAAGC GCGGCTTTCC
AAAGCGCGCG CGGCACTGCG CGACTATCGC GACAAATCGC AGGAAATCAG TCCGGAAGAG
GGGGCAAAGC TTGCCGTTCA GCTCATCGGA TCGCTGGAGC AGCAGCTGAC GCAGCTCAAT
GCCGATCTTG CGACGGCCAA GAGCCAGATG GGCGAAGACA CGCCCCGAAT CCGTGTCCTC
AAGACGCGCA TAGAGAGCCT GGAGCAGCAG CTCGACGTGG AGCGCCAACG CCTGGGCGCC
GGCGAAAAGT CCGCGGCCGG AAACGACCCC AATTCACCGG ATGTCGCGGG TCGCATCGCC
GAGTTCGAGG AATTGGAGAC GGAGCGCGAG TTCGCGGAAC GTGCCTATAC GGCAGCGTTG
GGATCGCTGG AGAAGGCACG CATCGACGCA AACAACCGCC AGCGCTATCT GGCACTTTTC
ATCGAGCCGA CGCTTTCGGA ACTTGCTCAG TATCCGGCGC GCCTGCTGAA CTCGTTTCTG
GTGATGCTGG GACTACTATT TGCCTGGGGC ATCGGCGTGA TGGGATATTA TAACATCCGC
GATCGGGCGT AG
 
Protein sequence
MAASKDAAVN KADSAQAAGK DVSATAAAKS KGIAVLEGLL GKESAPVIPL PGRQKAFPPK 
KTGWKAGWLK KLRWRHAIIG GTFLGLVAVP ATLASLYMAF IAADQYHSTT SFAVRSIEGG
VSSDILGMFT QASGGSTVSD SYILMDYILS ERMAADADRR FKLEDVYATR GLDYFYGIGS
ELPIEDKLDY WRDMVNVNFD HASGIMQVTV KAFEPRQARE IAKFIVDQSD NLVNSLSLSA
RNDVLRAAQD EVLAGEARLS KARAALRDYR DKSQEISPEE GAKLAVQLIG SLEQQLTQLN
ADLATAKSQM GEDTPRIRVL KTRIESLEQQ LDVERQRLGA GEKSAAGNDP NSPDVAGRIA
EFEELETERE FAERAYTAAL GSLEKARIDA NNRQRYLALF IEPTLSELAQ YPARLLNSFL
VMLGLLFAWG IGVMGYYNIR DRA