Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6251 |
Symbol | |
ID | 5320553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 1170983 |
End bp | 1172446 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777851 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001314783 |
Protein GI | 150378188 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.392289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCT TCCAGTGTTA CATCAACGGT GAATTCGCAG ACGGCGAAGC CCGCTTCGAA AGCATCGATC CCACCACCGG CCGTGCCTGG GCAGAGATGC CAGAGGCACG GGAAGCGGAC GTCAATCGTG CCGTCGAGGC TGCACGTATC GCCCTTCACG ACCAGTCATG GTCCACACTG ACGGCCACGC AGAGAGGCAA GCTCCTCTAC AAGCTTGCCG ATCTCGTCGC TGAGAATGCC GGAAGACTTG CCGAGCTGGA GACCCGCGAC ACGGGCAAGA TCATCCGCGA GACCTCGTCG CAGATCGCCT ATGTCGCCGA CTACTATCGC TACTACGCCG GGATCGCCGA CAAGATTGAG GGCTCCTATC TGCCGATCGA CAAGCCCGAC ATGGATGTCT GGCTGCGCCG CGAGCCGATC GGCGTCGTGG CCATGGTCGT GCCGTGGAAC AGCCAGCTTT TCCTTTCGGC CGTAAAGATC GGTCCGGCAC TGGCGGCCGG CTGCACCATG GTGGTGAAGG CCTCGGAGGA CGGGCCGGCG CCGCTTCTCG AATTTGCCCG GCTGGTGCAT GCGGCGGGTT TTCCCGCCGG TGTCGTCAAC ATCGTCACCG GCTTCGGCCC ATCATGCGGC GCGGCGCTCA GCCGCCATCC GCAGGTCGAT CACATAGCCT TCACTGGCGG GCCGGAGACG GCGCGCCATA TAGTCCGCAA TTCAGCGGAA AATCTCGCCT CGACCTCGCT CGAGCTCGGC GGCAAATCGC CCTTTATCGT CTTTGCCGAC GCAGATCTTG AAAGTGCGGC CAATGCCCAG ATCGCCGGGA TCTTTGCCGC GACCGGGCAG AGCTGCGTGG CCGGCTCACG GCTGATCGTC GAAAAAAGTG TCAAGGACCG CTTCTTGCAG ATCCTAAAGG CCAAGGCAGA GACAATTCGC ATCGGCAGCC CGCTCGAGAT GTCGACGGAG GTGGGACCGC TCGCGACGGA GCGCCAGTGC AACCACGTCA AGGCCCTTAT CGCACGCTCG CTGGCTGCTG GCGCGAAGCT GGTGACCGGA GGCACAGCGC CGGAGGGCAC CGGGTTCTAT TATCGCCCGA CCATTCTCGA TTGCGACGGC AGCGCATCGC CGTCCCTCGA GAACGAATTC TTCGGTCCTG TGCTCTCGGT TCTGTCTTTC GAGACAGAAG CTGAAGCTCT CCATCTCGCC AACGGCTCCC GCTTCGGCCT TGCAGCCGGG GTCTTTACGC AGAATCTCAC CCGAGCGCAC CGCCTCATGA AGGGAATTCG CGCGGGAATC GTCTGGGTCA ATACCTATCG GGCGGTCTCC CCGGTCGCGC CCTTCGGCGG CTTCGGGCTC TCGGGTCACG GGCGTGAGGG CGGCCTGGAG GCGGCGCTCG ACTATACCCG GAGCAAGACC GTTTGGCTCA GGACGTCGGA CGATCCAATT CCTGATCCCT TCGTGATGCG GTGA
|
Protein sequence | MQRFQCYING EFADGEARFE SIDPTTGRAW AEMPEAREAD VNRAVEAARI ALHDQSWSTL TATQRGKLLY KLADLVAENA GRLAELETRD TGKIIRETSS QIAYVADYYR YYAGIADKIE GSYLPIDKPD MDVWLRREPI GVVAMVVPWN SQLFLSAVKI GPALAAGCTM VVKASEDGPA PLLEFARLVH AAGFPAGVVN IVTGFGPSCG AALSRHPQVD HIAFTGGPET ARHIVRNSAE NLASTSLELG GKSPFIVFAD ADLESAANAQ IAGIFAATGQ SCVAGSRLIV EKSVKDRFLQ ILKAKAETIR IGSPLEMSTE VGPLATERQC NHVKALIARS LAAGAKLVTG GTAPEGTGFY YRPTILDCDG SASPSLENEF FGPVLSVLSF ETEAEALHLA NGSRFGLAAG VFTQNLTRAH RLMKGIRAGI VWVNTYRAVS PVAPFGGFGL SGHGREGGLE AALDYTRSKT VWLRTSDDPI PDPFVMR
|
| |