Gene Smed_5117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5117 
Symbol 
ID5319419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp69044 
End bp70681 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content61% 
IMG OID640776895 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001313827 
Protein GI150377232 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0275586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGAT TGGCGAATAG CGTATATGGT ACTTTGGGGC TGGCAATGGT CCTGTTCGTG 
GCCGCATCGG CGCTGCTCGT CGGGTCCAGC TTCATTGCTC TGGAACGCCT GCGGTCCGAT
CTTGCGGCGA CATCGTCGCT GGGCAAGGAG CGACTCGCCT ATCGGATGAT CTACGCTGCC
AGTCACCTGG AGCGAGCGCA AGGACCTGCA AGGGGGGCCG CTGCCGATGA GCTGCGCCAA
TTGATGGACC AGAACGAGCG GTTTCTCGAG CGGTTGGGAA GTGACCGCGA GAGGATCGGG
CTGGCGACAA CGAGCGATCC CGCTGCTTTC GCGCAGCTTG AACGAGTCCG GGAACAATGG
CGGGATGATG TCAGGCCAGC TCTTGAGAGC GCCATCGCCG AAGTTCCGCT CACTCACGCA
GCACTGGATG CGCTCGATCC CAAGGTCAGA GTCTTCGCGG AGCAGCTGGA AGATTTGATC
AATCCGATCG AACAGGCAGG AATCGCCCGA CTCAAGCGGT CCCAGTTGCT GCAACTCGGT
TTCTCGATCC TTTCCCTCCT GCTGCTCATC CACATATTGC GGGTTGTGCG CCGTCTTGCA
CGCCGCACGC GTGCACTCGC CGGCCTCGCG GAAAGGGTCA GCACCGGTGA CTTAGGACAG
AAAGCTGCCG TCGAGGGTAC CGACGAGCTC GCGGTGCTCG GCGCTTCCTT CAATGCGATG
ACCACAAGGC TTGCCGCGAT GATCGACAGC GAGCGGGGCA GCAGGGAAAG ATTGGAAGCG
TTGCTCGCCA CTATCTCGGA AACGGCACAA CACCTCTCAT CATCCGCTGC CGAGATTCTC
GCCGGCGCCA CGCAACAGGT CGAAGGGATG CGCGAACAGT CCTCGGCAGT TGCTCAAACG
GTCACGAGCG TTGATGAGGT ACTGCAAACC TCAGAACAGG CGGCGCAGCG CGCGCAGCAG
GTTGCCGCCT CCTATGACAA CGCCGTCAAG ATCAGCAATG AGGGCCGCAG AGCACTCGAC
GACACGGTGC GGGTGATGAA TGCGGTGAAC GCGCGGACGG AAGCGATCGC TGCAGATATT
CTGTCACTTG CGGAGAACAG CCTCGAGATC GGCGAGATCG TCTCGGTCGT CGCCGAGATC
GCGGACCAGA CCAACCTGCT GGCACTGAAT GCAGCAATCG AAGCATCGCG TGCCGGCGAG
CACGGCAGGG GTTTCAATGT CGTCGCTACG GAAATCCGTA CACTTGCCGA TCAGTCGAAA
TCCGCGACCG CCAGGGTCCG ACGCATCCTG ATGGAAATTC AGAAATCCAC GAACTCCGCG
GTCATTGGCG CGGAGGACGG GTCCAAGAGC GTCAGCCGTG CGCTCGAAAC GGTCAGTGAA
GCGGGCGAAA CCATCCGGCA ACTCGAGGCG ATCGTCGCCG ACTCTGCGCG ATCGGTGGCT
CAAATCGCCG CCTCGGCCGG TCAGCAACGC GCCGGCATGA AGCAGATTCA CGAGGCAATG
CATTATATCG AACAGACCAG CAGCCAGAAT CTATCGGCGA TACGGCAGGC CGAGGAAGCC
GCGAAGGATC TGAATGAGCT CGGCTCGAGG CTGAAGGAAA TGCTCACCGA CCACGGTAAC
GACCATGATA ACACCTGA
 
Protein sequence
MPRLANSVYG TLGLAMVLFV AASALLVGSS FIALERLRSD LAATSSLGKE RLAYRMIYAA 
SHLERAQGPA RGAAADELRQ LMDQNERFLE RLGSDRERIG LATTSDPAAF AQLERVREQW
RDDVRPALES AIAEVPLTHA ALDALDPKVR VFAEQLEDLI NPIEQAGIAR LKRSQLLQLG
FSILSLLLLI HILRVVRRLA RRTRALAGLA ERVSTGDLGQ KAAVEGTDEL AVLGASFNAM
TTRLAAMIDS ERGSRERLEA LLATISETAQ HLSSSAAEIL AGATQQVEGM REQSSAVAQT
VTSVDEVLQT SEQAAQRAQQ VAASYDNAVK ISNEGRRALD DTVRVMNAVN ARTEAIAADI
LSLAENSLEI GEIVSVVAEI ADQTNLLALN AAIEASRAGE HGRGFNVVAT EIRTLADQSK
SATARVRRIL MEIQKSTNSA VIGAEDGSKS VSRALETVSE AGETIRQLEA IVADSARSVA
QIAASAGQQR AGMKQIHEAM HYIEQTSSQN LSAIRQAEEA AKDLNELGSR LKEMLTDHGN
DHDNT