Gene Smed_5321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5321 
Symbol 
ID5319623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp279301 
End bp280554 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID640777095 
Productputative signal transduction histidine kinase 
Protein accessionYP_001314027 
Protein GI150377432 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0967984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACA TGGCGCCGGC AGCACTGGTC GATCACTATC TCGGTATTTC GCGGCTGTTA 
GCCGGTCAGC TCGATTTTCG TTCAGCGATC CGGGCCGTCG CGGCCGAGAT TGCGCACATC
ATTCCGCACG ACCATCTGGA CGTGTGCATT TTGATCGTGG ACGGCAACTA TCACACGGCG
TACGAAACCG GCATGGACAC CGCTTGGGGA AACGCCGCCT CGGCGCCGGT CGTCAACAGT
CCGATACGCA GCCTTCTCTG GGGGGAAGTA GATTATCTGC TGACGGACGA TGCGATCAAC
GATGCGCGGT TTCACTTTGA GGGCGCCTTC AAACGGCCGA TCATCGAACA GTCCCTGCGC
AGCCGGCTGC ACGTGCCGCT GAAGGTTCAG GGCGCGATCA TCGCAGCGCT CAGTTGTTCG
TCGCAGAGTC CGGGCGTCTA TGGCATGGAG CACGTCGACC GGGCACGCAT CATTGCCGAT
CTCCTGGCAC CCTATTTCTT CGCGCTGCGT GCCGCCGAAC AGGCGCAGCA GTCGGCCATA
GTCGAGGCGG AAGCGCGGGC ACGCGAGGAA GGCCTGCGGC AAGGGGCGCT GAAGTTGACG
GAGGCGCTGG AGCAGGAGCG TCAGCGAATC GGAATGGACC TCCATGACCA GACGCTAGCG
GATCTTACCC GGCTTGCGCG CCGCGTCGAT CGGCTTGCGC GTTCCGGGGA ACTGACCAGT
GAAGCGCTGG AGCCGGTGTC GCGCGGGCTG CAACACTGCA TGCAGGATCT CAGGCAGATC
ATCGAGCAGG CGAAACCATC TGTTCTCCAG CTTTTCGGCC TCGCGCAGGC TATCGAGAAC
CATCTCGACC GGTCGGTTCG CGATAGCAAC ACGCCGGTTG AATGGGCGAT CGTCGACGAG
ACCGCAGGCG CCCTCGACAC ACTCGAACCG ACCGTCAGCG TTGCGCTCTT CCGGATCGCC
CAGGAAGCGA TCAACAATGC GGTCCGTCAC GCCCAGCCGC TCGCAATCAC CGTTCGGCTT
CGGGCCGAAG CGAAGCAGCT TGCGTTGGAG ATAACGGACG ACGGGCGCGG CCTTGCGCGA
TCTCGCGGTC GCGTCGGCGG CGGCATTGAC AACATGAAGA CGCGCGCGCG GCTAATCTCG
GCGAAGTTCG TGATCGGCCC CGGACGCAAT AACCGCGGAA CGACGGTCAC CGTCTCGTTG
CCGCTCGAGC GGGATGCGGA AATTGCAGCA ATGGGCCAGG AGGATCGGCA ATGA
 
Protein sequence
MPHMAPAALV DHYLGISRLL AGQLDFRSAI RAVAAEIAHI IPHDHLDVCI LIVDGNYHTA 
YETGMDTAWG NAASAPVVNS PIRSLLWGEV DYLLTDDAIN DARFHFEGAF KRPIIEQSLR
SRLHVPLKVQ GAIIAALSCS SQSPGVYGME HVDRARIIAD LLAPYFFALR AAEQAQQSAI
VEAEARAREE GLRQGALKLT EALEQERQRI GMDLHDQTLA DLTRLARRVD RLARSGELTS
EALEPVSRGL QHCMQDLRQI IEQAKPSVLQ LFGLAQAIEN HLDRSVRDSN TPVEWAIVDE
TAGALDTLEP TVSVALFRIA QEAINNAVRH AQPLAITVRL RAEAKQLALE ITDDGRGLAR
SRGRVGGGID NMKTRARLIS AKFVIGPGRN NRGTTVTVSL PLERDAEIAA MGQEDRQ