Gene Smed_5076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5076 
Symbol 
ID5319378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp22476 
End bp23675 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID640776856 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_001313788 
Protein GI150377193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0463571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTGC TCAATAACCA TCCCGAATAC AGGCAACCAC TCGATCCGGC CGATGCTGAA 
ACGCTCGGTG TTGCCGCAGC CAATAAGGTC GAGAGGTTTC TCTCATTCCG TGAAAATCAT
GCCGAGACGC CACTGGTGGC CCTGCCGCGG CTGGCGGCCG AGATCGGGGT TAGCGCGATT
CACGTCAAGG ACGAAGCTTA TCGTCTCGGG TTGGGGAGCT TCAAAGCATT GGGCGGCGCC
TACGCCGTGA TCCGGCTGCT TCTCGAGGAG GCAGGAGAAA GCCTCGGGCG CGCGGTTGAC
GTTTCCGAAC TGTATTCAGC CGAAGTCCGC CCGGTCGCGT GTTCCATGAC CTTTGCCTGC
GCAACGGACG GTAATCACGG TCGCTCGGTC GCCCAAGGCG CTCAGCTCGT CGGGGCCAAG
GCGGCGATCT TCGTACACGC CGGTGTGAGC AAGGAACGTG TCGCCGCGAT CGCCCGGTTC
GGGGCGGAGA TAATCGGGGT TGATGGCTCT TATGATGACT CCGTGCGCGA ATCCTCGCGC
GTCGCGGAGG CGAATGGCTG GACAGTCGTT TCGGACACCT CATGGCCGGG ATATGAGCGT
ATCCCGGGCC TGGTCATGCA GGGTTACGTG GCGCTTGTTC GCGAATCCTT GCGCCAAATG
CCGGAACCGC CGACGCATGT GTTCATTCAG TCGGGCGTTG GCGGAATTGC CGCGGCTGTG
GCTGGGCATC TGGCGGTCGA GCTTGGCGCC AGGCGTCCGA CCTTCACGGT GGTCGATCCT
GCCCGCGCAG CCTGCATCGT CGAGACGGCG CGCGCGGGAC GTCCGGTGAC TATTGCCCAT
GGCGAACCGA CCGTCATGGC GATGCTCGAA TGCAACACCC CCTCGCTGCT GGCCTGGCGC
ATTCTCGCGC GCGCTGCCGA TGCCTTCATG ACGGTGGACG AAGACGACGC AATTTCGGCC
ATGCGGCAGC TCGCCGATCC GGTGGCGGAT GATCCGGCGA TCGTGGCCGG CGAGAGCGGA
GGGGTTGGTC TCGCAGGGTT GCTGAAGGCG GCTTCCGACC CGGAGATGAG GGCTGCACTG
CGAATCGATG GACACTCGCG CATCTTCCTC GTCAACACCG AAGGTGCGAC CGACCCCGGC
AAATATGAGG AGATCGTCGG GGCTTCGCCG GCAGCGATCG CGACGAAGAC CAGGATGTGA
 
Protein sequence
MFLLNNHPEY RQPLDPADAE TLGVAAANKV ERFLSFRENH AETPLVALPR LAAEIGVSAI 
HVKDEAYRLG LGSFKALGGA YAVIRLLLEE AGESLGRAVD VSELYSAEVR PVACSMTFAC
ATDGNHGRSV AQGAQLVGAK AAIFVHAGVS KERVAAIARF GAEIIGVDGS YDDSVRESSR
VAEANGWTVV SDTSWPGYER IPGLVMQGYV ALVRESLRQM PEPPTHVFIQ SGVGGIAAAV
AGHLAVELGA RRPTFTVVDP ARAACIVETA RAGRPVTIAH GEPTVMAMLE CNTPSLLAWR
ILARAADAFM TVDEDDAISA MRQLADPVAD DPAIVAGESG GVGLAGLLKA ASDPEMRAAL
RIDGHSRIFL VNTEGATDPG KYEEIVGASP AAIATKTRM