Gene Smed_5456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5456 
Symbol 
ID5319758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp422750 
End bp423856 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID640777218 
Productalanine racemase 
Protein accessionYP_001314150 
Protein GI150377555 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000537083 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.747323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGGAA ATTTCCCCGG GGCTCCGAAA AGCAAGGCGA CGATCGCGGA TGTTGCACGC 
ACAGCAGGCG TTTCCACGGC GACCGCGGGC CGCGTTCTTG GCGGCTACGG CTATACTAGC
GAAAAGAAGA GGGAGCAGGT GCTCAAGGCC GCCCAGGATC TCGGCTACCG GCCTAATTCG
CTCGCGCGCA GCCTCATCAC CGGCAAAACC CGCACGCTCG GCGTCGTTGC CGGCGATATC
CAGAACCCGT TCTACGCCTC GGTGCTGCGC GGTATTTCCA ATGTCGCGGA GGCCAACGGC
TTCGGCCTGC TGATCACCAA CAGCGACGAA ACACAGCTCA AAGAGGTTCA TTCGGTCGAG
TTGTTGGCGC AGAAGCAGGT GGACGGACTG ATCGTCACCC CCAGCGACAC GCGCAAGGCG
CGGCACCTGC ACAATCTGCG GACCGTGGGC GTCCCGCTCG TTCTTATCGA CCGTGCGGTC
GCCGGCCTGA TGGTGGACCG CGTAGCAACA GACAACATCG CCGCCGCCGA ACATGCGGTA
CGCCAGTTGA TTGCAGCCGG ACACCGCCGG ATCGCCATCG TGGCGGAACT CGTCGACGAA
GGAAGCGGCG GGTTGGATAC ATTCCTCGCC CGCGCCGTGG CGGGCGATCC GATCGAGACC
GATACGCTCT ATCCGAGCTG GCAGCGCCTC CTCGGCTATA TCCGGGCGCA TAGAATCGAG
GGCCTGCCCG TCGACCAGCG CCTGATACTG CAAGCGGGCA GCTATTCCGC GCTCGCGGCG
CAAGCGGTCG TCCCGCGCCT GATGATAGCG TCGGACCCGC CAACGGCGCT GTTCACCACG
GACGGCACAA TGTCCGAAGG CGCCATGCGG GCGCTCACGG AGCTGAAGCT TTCGATCCCG
CAGGATCTCT CTATCATCTG CTTCGACGAT CTCGACTGGA TGAGTTTCCA CCGCCCCGGC
ATCACCACCG TGGCACAGCC GCGTCTCGCC ATGGGCGAAG CCGCCGCGCG GATGCTGCTT
GAGCGCATTC GCGGCGAGGA CTATCCGCCT CGCACGGTGT TGATGCCCGC CGAACTGATC
GAACGCGGCT CCGTCGCCCG GCTGTAA
 
Protein sequence
MRGNFPGAPK SKATIADVAR TAGVSTATAG RVLGGYGYTS EKKREQVLKA AQDLGYRPNS 
LARSLITGKT RTLGVVAGDI QNPFYASVLR GISNVAEANG FGLLITNSDE TQLKEVHSVE
LLAQKQVDGL IVTPSDTRKA RHLHNLRTVG VPLVLIDRAV AGLMVDRVAT DNIAAAEHAV
RQLIAAGHRR IAIVAELVDE GSGGLDTFLA RAVAGDPIET DTLYPSWQRL LGYIRAHRIE
GLPVDQRLIL QAGSYSALAA QAVVPRLMIA SDPPTALFTT DGTMSEGAMR ALTELKLSIP
QDLSIICFDD LDWMSFHRPG ITTVAQPRLA MGEAAARMLL ERIRGEDYPP RTVLMPAELI
ERGSVARL