Gene Smed_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1146 
Symbol 
ID5321992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1217427 
End bp1218404 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID640790087 
ProductTonB-system energizer ExbB type-1 
Protein accessionYP_001326832 
Protein GI150396365 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0811] Biopolymer transport proteins 
TIGRFAM ID[TIGR02797] tonB-system energizer ExbB 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.200277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC GGGTCCGGTC GAAATTGAAC CTGTTGCTGA CCGCAATGCT CACTGTCTTT 
CTTTTCGGTC CGGTTGGTGC CGGGCTTGCG CAGACCGCAC AACAGCCCAA TTCCGTTTCG
GTCGACGCCC AGCCTGCGGC GCCGGACGTG TCGGGCGCCG ATGGACCGCT CCTGCAGGCG
GGCGAGTCGG TCGAGGCAGC GACGGACGGC GCAACGGCGG AGGGAGCGAA CCCGGTGCTT
CCGCACGATC TCTCGCCGGT TGGAATGTTT CTTGCCGCCG ATATCGTCGT TAAAGCGGTG
ATGATCGCTC TTGCGCTTGC ATCCGTCGCA ACCTGGGCGA TCTTCATCGT CAAGACGCTG
GAACTCGCCT ATGCCAAGTC GCGTCTCAAG CGCGCCGTAG CAAATCTCGT TTCGGCAAAT
GGCCTGGCCG AGGTTCATTC CAAGCTCGAG CGCCGCTCCG GCGTCGCCGG AAACATGGTC
ACTGCGGCGA TCGACGAAAT GACACGCTCC GAGGCCGTTC TGGATCTTAC GCCGTCAGCC
GGGGTGAAGG AACGCGTCTC TTCGCTGCTT ACGCGTATAG AGGTTCGCGC CGGCAAGAGG
ATGAGCGCCG GTACCGGGAT TTTGGCCTCC ATCGGGTCCG TCGGACCGTT CGTTGGCCTC
TTCGGTACCG TCTGGGGTAT CATGAATTCC TTCATAGGCA TCAGCAAGGC GCAGACAACC
AACCTCGCCA TTGTTGCGCC GGGTATTGCA GAGGCGCTGC TGGCGACGGC AATAGGACTC
GTCGCGGCGA TACCTGCGGT GGTGATCTAC AATTACTTCG CCCGGTCGGT CGGGGGCTAC
AAGCTCATCC TTGCGGATGC GGGAGCAGCC GTTGAGAGGT TGGTAAGCCG CGATCTGGAT
CATCGTCACG CCCGCAAAGC GTCGCGCCGC CAGGACAGCT TCACCCACGG CCCAGACGCT
ATCGCCAGAA TCGGATAA
 
Protein sequence
MSDRVRSKLN LLLTAMLTVF LFGPVGAGLA QTAQQPNSVS VDAQPAAPDV SGADGPLLQA 
GESVEAATDG ATAEGANPVL PHDLSPVGMF LAADIVVKAV MIALALASVA TWAIFIVKTL
ELAYAKSRLK RAVANLVSAN GLAEVHSKLE RRSGVAGNMV TAAIDEMTRS EAVLDLTPSA
GVKERVSSLL TRIEVRAGKR MSAGTGILAS IGSVGPFVGL FGTVWGIMNS FIGISKAQTT
NLAIVAPGIA EALLATAIGL VAAIPAVVIY NYFARSVGGY KLILADAGAA VERLVSRDLD
HRHARKASRR QDSFTHGPDA IARIG