Gene Smed_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4060 
Symbol 
ID5318883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp522359 
End bp524017 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content60% 
IMG OID640775867 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001312800 
Protein GI150376204 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.666301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCA CCCGGAATGA CGCGACTTTG CGTGCGTATC GTCCCGCCAA GGAATTTGCC 
GGGGCGGCGC GTGAATGGTT GTTGCGTACC GACCCGGAAC GCCTGATTTG CCTGGGTCGC
GTGATCACCG CAGTCTTTGC GATACTGGCG ATCTATCTCG ATCCCACCCG TCCGAACTCG
ATGCTTTACG AGTCTCGTCT TGTTCTCGGC CTCTACCTTC TCCTCGCGGT TGCCCTGGTG
TTTTTTCCAC TGCGGTTTTC GTTCGTCAGC CCCGTGCACC TGCTGATCCA CGGCGTCGAC
GCCGCCGTCG TCGGTTGGCT GACCTTTCTG ACGAACGAAC TGGCGAGCCC GTTCTTCTCT
ATTCTGCCTT TCGTGATCCT TGCCATGACG ATGCGCTGGG GGCTCAAGGG TGCCTCGCTG
GGGGCGCTGA TCGCGCTGAT CGTTCAGCTC GTGGTGGGGC TTCCCGATCT CCTGGATGGG
GACGCCGAGC TCAACATATT CATCATGCGT TCGATCTATT TTGTGCTTAT CGCGGCGACG
CTCGGCTATT TCGGGGCGTA TCGGGAACGA AGCCGGCAGC GCCTTGCGCA GCTCGCGCAA
TGGCCGTCTG GCGCGATCGG CGAGGACCGC CTGTCCTGGT TGAGCATCGT CCTGCAGCAT
GCTTCCGGCG TTCTGGGAGA TGCGCATCTG CTCGTCATCT GGCGCGAACA GGAGTTCGAG
TCCGGATGCG CCGCATTCTG GACGAGTGGT CGGCTCCAAT TGGCTGACCT GAGGGATCCT
GAATTCTGGC GGCGCCATGA TCCCGATGGC TGCGACGAAC GCCATTCGAG GAGCGGTGAG
GCCCTGAACG GCCTTTTCGC CGATCTGCCC CAGATCCACG CGAATGCCGG CCGGCCAAAT
TGCAAGGTGG TTTCCGCGGC CTTTTCGAGC CTCCGTTACC GGGGCCGCGT TTTCGTTATA
AGCTACGCAA ATTCAACCGA CGACATGAAA GACCTGACTC AGATCATCGC AACGCGCGTC
GGGACGGAGC TCGAGCGTGT CGCTCTCATC CAGGCTGCCC GTGCCGAAGG GCGAATGCGA
CTCGCTCGCG ACCTGCATGA CAGCGTGCTT CAGAATCTCA CGGCCGCGCG CCTCAAACTG
AAGCTCATCG GTGAAGCTTT CCCCGATGGC GCAAGGCAGA AGCTGATGGA GGTGGGTTCG
CTCATTCTCG AGCAACAGCA ATGCGTGCGC AAATTCGTCG ATGAGAACCG GCCCGGAGAG
GAGGGCAATC TCGCGAGGCT CGATCAGGAC CTGCCGGAAT TTCTCGACCT TCTGCGAATG
CAGTGGAGTT GCAGCATCGA CGTTTCGATC GGATCGCCTG GAATGATGGT CCCGCGATGG
ATGCTTTTCG AGATAATGCA GCTGATTTCC GAGGCTGTCG CCAATGCAGT GCGCCATGGA
CGAGCCACAG TGGTGCGGAT CGGCTTTATC GGGAGCGCAG GCCTTCTGGA ACTGGATATT
TCCGATAACG GCACAGGAAT AGCGGATGGA CTCACGTCCA AGAAGCCCTT CTCGCTGTCG
CAACGTATTG CGGAACTCGG CGGCAGTCTG GCAATTTGCC GGAGTTCGCC GGGGATCGGT
CTGACGATCA CGCTGCCGCT AAAGCCGGGG CTCAGATGA
 
Protein sequence
MAITRNDATL RAYRPAKEFA GAAREWLLRT DPERLICLGR VITAVFAILA IYLDPTRPNS 
MLYESRLVLG LYLLLAVALV FFPLRFSFVS PVHLLIHGVD AAVVGWLTFL TNELASPFFS
ILPFVILAMT MRWGLKGASL GALIALIVQL VVGLPDLLDG DAELNIFIMR SIYFVLIAAT
LGYFGAYRER SRQRLAQLAQ WPSGAIGEDR LSWLSIVLQH ASGVLGDAHL LVIWREQEFE
SGCAAFWTSG RLQLADLRDP EFWRRHDPDG CDERHSRSGE ALNGLFADLP QIHANAGRPN
CKVVSAAFSS LRYRGRVFVI SYANSTDDMK DLTQIIATRV GTELERVALI QAARAEGRMR
LARDLHDSVL QNLTAARLKL KLIGEAFPDG ARQKLMEVGS LILEQQQCVR KFVDENRPGE
EGNLARLDQD LPEFLDLLRM QWSCSIDVSI GSPGMMVPRW MLFEIMQLIS EAVANAVRHG
RATVVRIGFI GSAGLLELDI SDNGTGIADG LTSKKPFSLS QRIAELGGSL AICRSSPGIG
LTITLPLKPG LR