Gene Smed_5721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5721 
Symbol 
ID5320023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp692074 
End bp693489 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content58% 
IMG OID640777442 
Productaldehyde dehydrogenase 
Protein accessionYP_001314374 
Protein GI150377779 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACATCA TCGATCGAAC CTACATCGAC GGCAAGTTCA TCGAGCCTCA TGGGGACGAA 
TGGGCGCCAC TGTTCAATCC GGCGACCGAA GATCAAATTG GTAGCGTGCG GCTCGCGGAC
GCGCAGGACG CAGACCGTGC GGTCGCCGCC GCCAAAAGGG CATTTTCGAC CTTCTCGAAT
ACGGCGAAGG AAGAGCGGAT CGACATGCTG AACCGCTTGC ATGCTGCGGT ATTGGCGCGA
ACCGACGATC TCGCCTGCGC CATGCGCGAA GAATATGGCG CGCCATCCAA TTTCATCGGC
TTTTCTGCAC CAAGGGCAGG CATGGTCTTT CTGGAGATGG CAAGAACGCT GAAAGACTAT
GAATTTCGAC GTCGGATGGG AAGCGCCGAC GTGGAGATGC GGCCCTCAGG GGTTGCCGTC
GCGATCACGC CCTGGAACAG CAATTATAGT TTTATCTGCG GCAAGCTATC TGCCGCCATT
GCTGCCGGCG CAACGATGGT GATCAAACCA TCGGAAAAGA GCGCTCTTCA GACCCAGGTG
ATCACCGAGT GCCTTCATGC TGCGGGCCTG CCCGATGGCA TATTCAATAT TGTTACCGGA
TACGGAAACG TCGTTGGCGC GGCTCTGACG ACCCACCGCG ACGTTTCGAA AATCACCTTC
ACAGGTTCCA CAGCGACCGG GCGCCGGATC GTCCAAACCG CCGCAGCCAC CATGAAGCGT
GTCACCATGG AACTCGGCGG CAAATGTCCG ACTGTCATTC TGGACGACGC CGATCTTGAG
GTCGTCATCC CGCAGGTGCT GGCAGCAGGG TTCCGAAACA GCGGGCAGGC ATGCATCGCC
GGAACTCGCA TCCTCGCTCC TGAAAGCAGA CTCAACGAGA TCGAGGAACG TCTCAGCCAC
ACGGTCAGGG AGTTCAGGGC AGGATACCCT GCAGACGACC AGGTCCAAGT AGGTCCAATG
GTAAGCCAAA AGCAGTGGGA TCGCGTACAA AGCTATATCC GATTAGGACA GGAAGAAGGT
GCCAGACTGC TTGCAGGCGG CGAAGGTCGG CCGGAAGGGC TTCACCGCGG CTGGTTCGTT
AAACCGACCA TCTTTACCGG CGTTCGCAAC GACATGCGCA TCGCCCGCGA AGAGATCTTT
GGACCGGTTC TCTCACTGAT CCCCTATAGG GATGAGGAAG ACGCCATCGC AATCGCAAAC
GATACCGATT ACGGCTTGCA GGCGCACGTC TTTTCCTCTG ATGTCGAGCG GGCGAAACGC
GTCGCGAACC AGATCGAAGC AGGCCGCGTG TTCATCAACG ATGCGCCGCA TGACCCGCTG
GCACCCTTTG GCGGCTTCAA GCAATCCGGT ATCGGGAGAG AATTCGGTGT GTTCGGACTC
GAGGCATTTC TCGAACCTCG CGCCGTACTG TCATGA
 
Protein sequence
MHIIDRTYID GKFIEPHGDE WAPLFNPATE DQIGSVRLAD AQDADRAVAA AKRAFSTFSN 
TAKEERIDML NRLHAAVLAR TDDLACAMRE EYGAPSNFIG FSAPRAGMVF LEMARTLKDY
EFRRRMGSAD VEMRPSGVAV AITPWNSNYS FICGKLSAAI AAGATMVIKP SEKSALQTQV
ITECLHAAGL PDGIFNIVTG YGNVVGAALT THRDVSKITF TGSTATGRRI VQTAAATMKR
VTMELGGKCP TVILDDADLE VVIPQVLAAG FRNSGQACIA GTRILAPESR LNEIEERLSH
TVREFRAGYP ADDQVQVGPM VSQKQWDRVQ SYIRLGQEEG ARLLAGGEGR PEGLHRGWFV
KPTIFTGVRN DMRIAREEIF GPVLSLIPYR DEEDAIAIAN DTDYGLQAHV FSSDVERAKR
VANQIEAGRV FINDAPHDPL APFGGFKQSG IGREFGVFGL EAFLEPRAVL S