Gene Smed_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1089 
Symbol 
ID5321935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1156651 
End bp1157793 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content61% 
IMG OID640790030 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001326775 
Protein GI150396308 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA AGGCACCGGC TGACGGCGCA AACGATCTTT CCATGGCCGT GCTGAACGCG 
ATTCAGAATC CGGTCATCCT GGTGGATGAG AACGGCTTTG TCGCCTTTGC GAACTGGGAG
GCGGAATCCT TCTTCGGTGC CAGCGCCAAC CATCTTGCGC GGCATGACAT CAGCGCGTTC
ATTCCCTTCG GCAGCCCGCT CCTGACGCTG ATCGATCAGG TTCGCGAGCG GCGCGCTGCG
GTGAACGAGT ACCGGGTAGA CCTCAGCTCT CCGCGCCTCG GGGCCGACAA GCTCGTCGAT
CTCTACGTGG CCCCGGTACT CTCGCAACCC GGATCGGTGG TGGTCGTCTT TCAAGAGCGG
TCGATGGCGG ACAAGATCGA TCGCCAGCTG ACACATCGGA CGGCGGCGCG ATCCGTCACG
GGGCTTGCTT CGATGCTGGC GCACGAAATC AAAAACCCTC TTTCCGGCAT TCGCGGCGCC
GCGCAGCTCC TCGAAACCTC CGTCAACGAC GAAGACAGGG CGCTCACAAG ATTGATCTGT
GACGAGACCG ACCGCATCGT CTCGCTCGTG GACCGGATGG AGGTCTTCTC CGACGAGCGT
CCCGTCGACC GCGTGCCCCT TAACATTCAT GCGATACTCG ACCACGTCAA GGCAATCGCG
AAGGCCGGCT TTGCCCGGCG GATCAAGATC TCCGAACATT ATGACCCGTC GCTTCCGCCC
GTTTTCGCAA ATCGCGACCA GCTGGTTCAG GTGTTCCTCA ATCTGGTAAA GAATGCGGCC
GAGGCGATCG GCGACAGGGC GGACGGCGAA ATTCTGCTGA CGACGGCCTA TAGGCCGGGC
ATTCGCCTCT CGGTCGCCGG TACGCGCGAA AAAATCTCGC TCCCGCTGGA ATTCTGCGTG
CATGACAACG GGCCGGGTGT ACCCGCCGAT CTTCTGCCGC ATCTTTTCGA CCCTTTCATC
ACCACCAAGA CGAACGGGTC CGGCCTCGGT CTCGCGCTTG TGGCGAAGAT CATCGGCGGC
CATGGCGGCA TCGTCGAATG CGACAGCCAG CATAGCCGCA CGATATTCCG CGTTCTGATG
CCGGCGTCCA AGGGCCCAGC GGCAGATGAC GAAACTCCGA TGACAAAAGG AACCAATGGA
TGA
 
Protein sequence
MTEKAPADGA NDLSMAVLNA IQNPVILVDE NGFVAFANWE AESFFGASAN HLARHDISAF 
IPFGSPLLTL IDQVRERRAA VNEYRVDLSS PRLGADKLVD LYVAPVLSQP GSVVVVFQER
SMADKIDRQL THRTAARSVT GLASMLAHEI KNPLSGIRGA AQLLETSVND EDRALTRLIC
DETDRIVSLV DRMEVFSDER PVDRVPLNIH AILDHVKAIA KAGFARRIKI SEHYDPSLPP
VFANRDQLVQ VFLNLVKNAA EAIGDRADGE ILLTTAYRPG IRLSVAGTRE KISLPLEFCV
HDNGPGVPAD LLPHLFDPFI TTKTNGSGLG LALVAKIIGG HGGIVECDSQ HSRTIFRVLM
PASKGPAADD ETPMTKGTNG