Gene Smed_5472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5472 
Symbol 
ID5319774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp441133 
End bp442053 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content60% 
IMG OID640777233 
Productdihydrodipicolinate synthetase 
Protein accessionYP_001314165 
Protein GI150377570 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATT CCACTGGACT TCGCGGGATT CTTCCGGCGT TGGTGACCCC CGTCAAATCG 
GACGACACGA TCGACACCAA AGCGACCGAT GCGCTTTTCA ACTGGCTGCA GAGGCAAGGC
GTCGACGGGG TCGTTCCGCT CGGCGGAACC GGCGAATACG GTGCGCTGTC GCGCGGTGAA
CGCATCCGCT TTGTCGAGCT ATCGGCCAAG GCATTCGGTG GAAAGGTGCC CGTCGTTCCC
GGAGTGCTCG ACCCCGGCTT CCACGACGCA ATGGAGTCCG CGCGCGATTT TGCTGCGGCA
GGCGCCGACG CGTTGCTTGT TATTACGCCG TACTACACAA ACCCGACCCA GGCTGGCATT
CGCGATTATT TTTTGCGCTA CGCGGATCAG TCTCCTGTGC CGATCCTGAT CTATGAAATT
CCCTATCGGA CGAGGATCGC GATCGATCCC GAGGTTCTGC ACCAACTCTC CGCTCACGAG
CGGATCATCG GTATGAAGGC ATGCAACACG GATATGTACC ACTACCTGCG GGTCATGGCA
GGACTGGCGC CTTCCTTTTC CATGCTCAGC GGCGAAGATT CGCTGTTTCC GTTCCATGTT
GCGGCTGGCG CCAAGGGCGG AATCGTAGTC ACTGCAAACC TGCTGCCGAA GGTATGGCGC
CGGCTCTTCG ACCTCGCCGA AAGCGGCAAC GCGGCCGACG CCCTGGCGCT GCATCGTGAA
TTGATTCCGT TCATGAACAT GGCGTTTGCC GAAACCAATC CAGGTCCGAT GAAGTCCGTG
ATGGACCTGA TCGGCGTGGA TGCGCCTCAC ATGCTCGCAC CGCTGCGCCA GCCCGCATCC
GAACTCCGGG ACGCGCTGCA CAAGGAATGT AGCCGCCTCC TCGAAAAGTA CGAACTGGAT
AACACCAAGC TTGCATCGTA G
 
Protein sequence
MLDSTGLRGI LPALVTPVKS DDTIDTKATD ALFNWLQRQG VDGVVPLGGT GEYGALSRGE 
RIRFVELSAK AFGGKVPVVP GVLDPGFHDA MESARDFAAA GADALLVITP YYTNPTQAGI
RDYFLRYADQ SPVPILIYEI PYRTRIAIDP EVLHQLSAHE RIIGMKACNT DMYHYLRVMA
GLAPSFSMLS GEDSLFPFHV AAGAKGGIVV TANLLPKVWR RLFDLAESGN AADALALHRE
LIPFMNMAFA ETNPGPMKSV MDLIGVDAPH MLAPLRQPAS ELRDALHKEC SRLLEKYELD
NTKLAS