Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1476 |
Symbol | |
ID | 7091819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1594630 |
End bp | 1596144 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643464810 |
Product | 2-hydroxymuconic semialdehyde dehydrogenase |
Protein accession | YP_002361796 |
Protein GI | 217977649 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTTC AGGCATTGCA GGGGCTGGCG CAAATTCGCG TCGATACAGC GCGCCGCGAC GCGGCGCATT TCATCAATGG CGAGTTCACG CAGGGATTGA CCAGCAAGGG CTGGTGGGAG AACCGCTCGC CGCTCGACAA CAGCGTGATA GGCCGCGTGC CGGAGGGCGG CCAGGCGGAG GTTGACGCGG CGGTGCACGC CGCCCGTGCG GCGCTCGACG GCACCTGGGG CAAGATGACG GTGGCCCAGC GGACCGATCT GCTCGCCGCA GTTGCCAACG AAATCGACGC CCGCTTTGAT GAATTCCTCG CCGCCGAATG TCTCGATACC GGCAAGCCCT ATAGCCTCGC CTCGCATATC GATATTCCGC GCGGCGCCGC CAATTTCAAG ATGTTCGCCG ATACGGTGAA GAACGTCTCG ACCGAAACAT TCATCCTCGA CACGCCGGAC GGCAAGAGCG CCGTCAATTA CGGCCTCCGC CGACCCAAGG GCCTGATCGC GGTAATTTCG CCGTGGAACC TCCCGCTGCT GCTCATGACC TGGAAAGTCG GCCCGGCGCT CGCCTGCGGC AACACAGTTG TGGTGAAGCC GTCGGAAGAA ACGCCGTTGA CTGCGACGCT GCTCGGCGAA GTGATGAACA AGGTCGGCGT GCCGAAGGGC GTCTATAACG TCGTCCACGG CCTCGGCCCG AATTCCGCCG GCGAGTTCCT AACCCAGCAT CCGCTGGTCA ACGGCATCAC TTTCACCGGC GAGACCCGGA CCGGCGAGGC GATCATGCGC CAGGCGGCGC TCGGCGTGCG GCAGGTGTCG TTCGAATTGG GCGGCAAGAA TCCGGCGATT GTGTTCGCGG ATTGCGATCT GGACAAGGCG ATCGAGGGCA CGATGCGCTC GGCTTTCGCT AACTGCGGCC AAGTTTGTCT GGGCACCGAG CGCGTCTATG TCGAGCGCCC GATCTTCGAC TCCTTCGTGG CCCGCATGAA GGGGGACGCC GAGGCCTTGA GGCTCGGACG GCCGGAAGAC GGCGCGACTA ATCTTGGCCC ACTGATCAGT CAGGAACATC GCAGCAAGGT GCTGTCCTAT TACAAGTTGG CCCTGGAGGA AGGCGCGACC CTTGTCACGG GCGGTGGCGT TCCCGACATG CCCGGCGAGC TTGCGCTTGG CGCCTGGGTT CAGCCGACCA TCTGGACCGG ACTCAAGGAT GACGCCCGCG CCGTCAACGA GGAGATTTTC GGGCCGTGCT GCCATATCCG CCCGTTTGAC ACGGAAGAAG AGGCGGTCAG GCTAGCCAAT TCCACGCCCT ACGGACTCGC CGCGGCGGTG TGGACGGAGA ACGTCTCCCG CGCCCATCGC GTCGCGTCGA AAATGGATGT CGGCATCTGC TGGGTAAACT CCTGGTTTCT GCGCGACCTG CGCACGGCCT TCGGCGGCGC CAAGCAGTCC GGCATCGGCC GGGAGGGCGG CCTTCACTCG CTGGAGTTCT ATACCGAACT CTCCAACGTC TGCATCAAGC TTTGA
|
Protein sequence | MSVQALQGLA QIRVDTARRD AAHFINGEFT QGLTSKGWWE NRSPLDNSVI GRVPEGGQAE VDAAVHAARA ALDGTWGKMT VAQRTDLLAA VANEIDARFD EFLAAECLDT GKPYSLASHI DIPRGAANFK MFADTVKNVS TETFILDTPD GKSAVNYGLR RPKGLIAVIS PWNLPLLLMT WKVGPALACG NTVVVKPSEE TPLTATLLGE VMNKVGVPKG VYNVVHGLGP NSAGEFLTQH PLVNGITFTG ETRTGEAIMR QAALGVRQVS FELGGKNPAI VFADCDLDKA IEGTMRSAFA NCGQVCLGTE RVYVERPIFD SFVARMKGDA EALRLGRPED GATNLGPLIS QEHRSKVLSY YKLALEEGAT LVTGGGVPDM PGELALGAWV QPTIWTGLKD DARAVNEEIF GPCCHIRPFD TEEEAVRLAN STPYGLAAAV WTENVSRAHR VASKMDVGIC WVNSWFLRDL RTAFGGAKQS GIGREGGLHS LEFYTELSNV CIKL
|
| |