Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6141 |
Symbol | |
ID | 5320443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 1067101 |
End bp | 1068441 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777772 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001314704 |
Protein GI | 150378109 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTCA AACATAACGC CGCCCGCCGC CATCGCATCG GCAGTATGAA ATTCAAGGTG ACGAATTGGC CGTCGCGCTC CGGCATCCTC GAGCGCCATG CGATCCTGAA GAAAACCGCC GACGAAATCC TCGCCCGCAA GGACGAGCTG GGACGGCTGC TGTCGCGTGA GGAGGGCAAG ACCCTCGCCG AGGGGATCGG CGAAACGGTC CGGGCCGGTC AGATCTTCGA ATTCTTCGCC GGCGAAACTT TGCGCCTCGC TGGCGAGGTC GTCCCATCGG TCAGGCCGGG CATCGGCGTC GAGATCACCC GCGAGCCGGT CGGCGTCGTT GGCATCATCA CGCCCTGGAA CTTCCCCATC GCCATTCCCG CCTGGAAGGT CGCTCCGGCG CTCTGCTACG GCAACACCGT CGTCTTCAAG CCGGCGGAAC TGGTACCGGG CTGTTCATGG GCGATCGTCG ATATCCTCCA TCGTGCCGGC CTGCCAAAGG GCGTATTGAA CCTCGTCATG GGCAAGGGTT CGGTCGTAGG CCAGGCAATG CTCGACAGCC CGGACGTTCA GGCGATAACC TTCACAGGCT CGACCGCAAC CGGAAAACGG GTCGCCGTCG CCTCGGTCGA ACATAACCGC AAATACCAGT TGGAGATGGG AGGCAAGAAC CCGTTCGTCG TGCTCGACGA CGCCGATCTT TCCGTTGCCG TCGAAGCGGC CGTCAATTCC GCCTTTTTCT CGACCGGTCA GCGTTGCACC GCCTCGTCGC GCATCATTGT CACGGAGGGA ATCCATGACC GGTTCGTCGC CGCCATGGGC GAGCGGATCA AGGGCCTCGT CGTCGACGAT GCGCTGAAGG CCGGCACCCA TATCGGACCG GTGGTCGATC AGAGCCAGCT CAACCAGGAC ACCGACTATA TCGCCATCGG CAAACAGGAA GGCGCGAAGC TCGCCTTCGG CGGCGAGCTG ATCTCGCGCG ACACGCCCGG CTTCTATCTG CAGCCGGCGC TGTTCACCGA GGCGACCAAC GAGATGCGCA TCTCGCGCGA GGAAATCTTC GGACCGGTCG CGGCCGTCAT CCGGGTCAAG GATTACGACG AAGCGCTGGC CGTCGCCAAT GACACGCCCT TCGGTCTGTC TTCGGGTATC GCCACCACCA GCTTGAAACA CGCGACGCAC TTCAAGCGCA ATGCTGAGGC CGGCATGGTG ATGGTCAACC TGCCCACGGC GGGTGTCGAC TTCCACGTGC CGTTCGGCGG CCGCAAGGCT TCCTCCTACG GTCCTCGCAA GCAGGGCAAA TACGCCGCTG AATTCTACAC CAACGTTAAG ACAGCCTACA CGCTGGCGTG A
|
Protein sequence | MPFKHNAARR HRIGSMKFKV TNWPSRSGIL ERHAILKKTA DEILARKDEL GRLLSREEGK TLAEGIGETV RAGQIFEFFA GETLRLAGEV VPSVRPGIGV EITREPVGVV GIITPWNFPI AIPAWKVAPA LCYGNTVVFK PAELVPGCSW AIVDILHRAG LPKGVLNLVM GKGSVVGQAM LDSPDVQAIT FTGSTATGKR VAVASVEHNR KYQLEMGGKN PFVVLDDADL SVAVEAAVNS AFFSTGQRCT ASSRIIVTEG IHDRFVAAMG ERIKGLVVDD ALKAGTHIGP VVDQSQLNQD TDYIAIGKQE GAKLAFGGEL ISRDTPGFYL QPALFTEATN EMRISREEIF GPVAAVIRVK DYDEALAVAN DTPFGLSSGI ATTSLKHATH FKRNAEAGMV MVNLPTAGVD FHVPFGGRKA SSYGPRKQGK YAAEFYTNVK TAYTLA
|
| |