Gene Smed_6141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6141 
Symbol 
ID5320443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1067101 
End bp1068441 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID640777772 
Productaldehyde dehydrogenase 
Protein accessionYP_001314704 
Protein GI150378109 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTCA AACATAACGC CGCCCGCCGC CATCGCATCG GCAGTATGAA ATTCAAGGTG 
ACGAATTGGC CGTCGCGCTC CGGCATCCTC GAGCGCCATG CGATCCTGAA GAAAACCGCC
GACGAAATCC TCGCCCGCAA GGACGAGCTG GGACGGCTGC TGTCGCGTGA GGAGGGCAAG
ACCCTCGCCG AGGGGATCGG CGAAACGGTC CGGGCCGGTC AGATCTTCGA ATTCTTCGCC
GGCGAAACTT TGCGCCTCGC TGGCGAGGTC GTCCCATCGG TCAGGCCGGG CATCGGCGTC
GAGATCACCC GCGAGCCGGT CGGCGTCGTT GGCATCATCA CGCCCTGGAA CTTCCCCATC
GCCATTCCCG CCTGGAAGGT CGCTCCGGCG CTCTGCTACG GCAACACCGT CGTCTTCAAG
CCGGCGGAAC TGGTACCGGG CTGTTCATGG GCGATCGTCG ATATCCTCCA TCGTGCCGGC
CTGCCAAAGG GCGTATTGAA CCTCGTCATG GGCAAGGGTT CGGTCGTAGG CCAGGCAATG
CTCGACAGCC CGGACGTTCA GGCGATAACC TTCACAGGCT CGACCGCAAC CGGAAAACGG
GTCGCCGTCG CCTCGGTCGA ACATAACCGC AAATACCAGT TGGAGATGGG AGGCAAGAAC
CCGTTCGTCG TGCTCGACGA CGCCGATCTT TCCGTTGCCG TCGAAGCGGC CGTCAATTCC
GCCTTTTTCT CGACCGGTCA GCGTTGCACC GCCTCGTCGC GCATCATTGT CACGGAGGGA
ATCCATGACC GGTTCGTCGC CGCCATGGGC GAGCGGATCA AGGGCCTCGT CGTCGACGAT
GCGCTGAAGG CCGGCACCCA TATCGGACCG GTGGTCGATC AGAGCCAGCT CAACCAGGAC
ACCGACTATA TCGCCATCGG CAAACAGGAA GGCGCGAAGC TCGCCTTCGG CGGCGAGCTG
ATCTCGCGCG ACACGCCCGG CTTCTATCTG CAGCCGGCGC TGTTCACCGA GGCGACCAAC
GAGATGCGCA TCTCGCGCGA GGAAATCTTC GGACCGGTCG CGGCCGTCAT CCGGGTCAAG
GATTACGACG AAGCGCTGGC CGTCGCCAAT GACACGCCCT TCGGTCTGTC TTCGGGTATC
GCCACCACCA GCTTGAAACA CGCGACGCAC TTCAAGCGCA ATGCTGAGGC CGGCATGGTG
ATGGTCAACC TGCCCACGGC GGGTGTCGAC TTCCACGTGC CGTTCGGCGG CCGCAAGGCT
TCCTCCTACG GTCCTCGCAA GCAGGGCAAA TACGCCGCTG AATTCTACAC CAACGTTAAG
ACAGCCTACA CGCTGGCGTG A
 
Protein sequence
MPFKHNAARR HRIGSMKFKV TNWPSRSGIL ERHAILKKTA DEILARKDEL GRLLSREEGK 
TLAEGIGETV RAGQIFEFFA GETLRLAGEV VPSVRPGIGV EITREPVGVV GIITPWNFPI
AIPAWKVAPA LCYGNTVVFK PAELVPGCSW AIVDILHRAG LPKGVLNLVM GKGSVVGQAM
LDSPDVQAIT FTGSTATGKR VAVASVEHNR KYQLEMGGKN PFVVLDDADL SVAVEAAVNS
AFFSTGQRCT ASSRIIVTEG IHDRFVAAMG ERIKGLVVDD ALKAGTHIGP VVDQSQLNQD
TDYIAIGKQE GAKLAFGGEL ISRDTPGFYL QPALFTEATN EMRISREEIF GPVAAVIRVK
DYDEALAVAN DTPFGLSSGI ATTSLKHATH FKRNAEAGMV MVNLPTAGVD FHVPFGGRKA
SSYGPRKQGK YAAEFYTNVK TAYTLA