Gene Smed_5679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5679 
Symbol 
ID5319981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp645196 
End bp646284 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID640777409 
ProductHipA domain-containing protein 
Protein accessionYP_001314341 
Protein GI150377746 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.75549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCTCAG GAACGGTCGG TGCAGACAGG CTGCTGCCGT GGCTCGCAAA CCTGCTTCCG 
GAAACCCATC TCGCAGAAAT CGGTCAACGG CTGAAAGTAT CTCCTCAGGA CATCGTGGGC
CTGCTCGGCC ACATCGGCCG AGACACGGCG GGAGCGCTGT CGATCGGCGA ACCAAGAAAG
GCAGGCGTCA GCCTGGAGCC TATTCTGGAC AAGGTGACGC TGGAGCGCAT CCTCAACGAG
CTTCCAGCGA AACCCTTTCT GGTGGGAGAA CGAGGGGTCT CGATGTCGCT TGCAGGTGTG
CAGGAGAAGC TGCCCGTATT TGTCGATGGA GATCGCATCA TCTCGATACC GGTAGACGGC
ACGCCATCGA CCCAAATCAT CAAGCCGGAT AACGCCCGTC TGGCCCTTGC ACGGGCATGT
GGACTGGAAG CGGCCGAAGC TTCGATCGGC GTAGCCGGTA AAAGGCGCTA TTTGCTGGTG
AAGCGAGATG ACCGTTTCGC GGCCCCGCAG GGCGAGATCC GCAGGCTGCA CCAGGAAGAC
CTTTGCCAGC TGAAAGGACA TTTTCCATTA CAGAAATACG AGCGATCCTC GACAGGTGGC
GGCGTGACGT TGAAGATGAT GTTCGATGCC GTCAGTGATC TGGTTTCCCC CGGCGAGCGC
GTGAAGCTTC TGGATGCGAT GATTTCCAAC GTGCTGATCT GCAACTCCGA CTCGCACGCA
AAGAACTATT CCATCCTGAT CGGTGCGGCG GGATCTGCGA AGATCGCGCC ACTTTACGAT
TTAATGTGTG CTGCTGTTTA CCGTCAGGTC GATCAGAGCC TACCTCAAGG CATTGCCGGG
CGCTTCATCG CGGCTGACTT GGGGCGACGC GATTGGCAAG CAGTAGCTGA GGAGATTGGG
TTGAGTTGCG CATCAACTGT CAGAAGGGTC GGAGAACTTT CCGCTGTGGT CGCAGACGCC
TGCGAAGATG TTACGAAGCG GACTTCTGAA ATCGTTGGCG ATCCCACAAG GATTCTGGAG
CGCGTCACCC ACCAAATTCA AAAGCGATGC AGGCGAATTC AACGGCACCT TTACGTGGCG
CGCAGTTGA
 
Protein sequence
MRSGTVGADR LLPWLANLLP ETHLAEIGQR LKVSPQDIVG LLGHIGRDTA GALSIGEPRK 
AGVSLEPILD KVTLERILNE LPAKPFLVGE RGVSMSLAGV QEKLPVFVDG DRIISIPVDG
TPSTQIIKPD NARLALARAC GLEAAEASIG VAGKRRYLLV KRDDRFAAPQ GEIRRLHQED
LCQLKGHFPL QKYERSSTGG GVTLKMMFDA VSDLVSPGER VKLLDAMISN VLICNSDSHA
KNYSILIGAA GSAKIAPLYD LMCAAVYRQV DQSLPQGIAG RFIAADLGRR DWQAVAEEIG
LSCASTVRRV GELSAVVADA CEDVTKRTSE IVGDPTRILE RVTHQIQKRC RRIQRHLYVA
RS