Gene Msil_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1476 
Symbol 
ID7091819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1594630 
End bp1596144 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content64% 
IMG OID643464810 
Product2-hydroxymuconic semialdehyde dehydrogenase 
Protein accessionYP_002361796 
Protein GI217977649 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTTC AGGCATTGCA GGGGCTGGCG CAAATTCGCG TCGATACAGC GCGCCGCGAC 
GCGGCGCATT TCATCAATGG CGAGTTCACG CAGGGATTGA CCAGCAAGGG CTGGTGGGAG
AACCGCTCGC CGCTCGACAA CAGCGTGATA GGCCGCGTGC CGGAGGGCGG CCAGGCGGAG
GTTGACGCGG CGGTGCACGC CGCCCGTGCG GCGCTCGACG GCACCTGGGG CAAGATGACG
GTGGCCCAGC GGACCGATCT GCTCGCCGCA GTTGCCAACG AAATCGACGC CCGCTTTGAT
GAATTCCTCG CCGCCGAATG TCTCGATACC GGCAAGCCCT ATAGCCTCGC CTCGCATATC
GATATTCCGC GCGGCGCCGC CAATTTCAAG ATGTTCGCCG ATACGGTGAA GAACGTCTCG
ACCGAAACAT TCATCCTCGA CACGCCGGAC GGCAAGAGCG CCGTCAATTA CGGCCTCCGC
CGACCCAAGG GCCTGATCGC GGTAATTTCG CCGTGGAACC TCCCGCTGCT GCTCATGACC
TGGAAAGTCG GCCCGGCGCT CGCCTGCGGC AACACAGTTG TGGTGAAGCC GTCGGAAGAA
ACGCCGTTGA CTGCGACGCT GCTCGGCGAA GTGATGAACA AGGTCGGCGT GCCGAAGGGC
GTCTATAACG TCGTCCACGG CCTCGGCCCG AATTCCGCCG GCGAGTTCCT AACCCAGCAT
CCGCTGGTCA ACGGCATCAC TTTCACCGGC GAGACCCGGA CCGGCGAGGC GATCATGCGC
CAGGCGGCGC TCGGCGTGCG GCAGGTGTCG TTCGAATTGG GCGGCAAGAA TCCGGCGATT
GTGTTCGCGG ATTGCGATCT GGACAAGGCG ATCGAGGGCA CGATGCGCTC GGCTTTCGCT
AACTGCGGCC AAGTTTGTCT GGGCACCGAG CGCGTCTATG TCGAGCGCCC GATCTTCGAC
TCCTTCGTGG CCCGCATGAA GGGGGACGCC GAGGCCTTGA GGCTCGGACG GCCGGAAGAC
GGCGCGACTA ATCTTGGCCC ACTGATCAGT CAGGAACATC GCAGCAAGGT GCTGTCCTAT
TACAAGTTGG CCCTGGAGGA AGGCGCGACC CTTGTCACGG GCGGTGGCGT TCCCGACATG
CCCGGCGAGC TTGCGCTTGG CGCCTGGGTT CAGCCGACCA TCTGGACCGG ACTCAAGGAT
GACGCCCGCG CCGTCAACGA GGAGATTTTC GGGCCGTGCT GCCATATCCG CCCGTTTGAC
ACGGAAGAAG AGGCGGTCAG GCTAGCCAAT TCCACGCCCT ACGGACTCGC CGCGGCGGTG
TGGACGGAGA ACGTCTCCCG CGCCCATCGC GTCGCGTCGA AAATGGATGT CGGCATCTGC
TGGGTAAACT CCTGGTTTCT GCGCGACCTG CGCACGGCCT TCGGCGGCGC CAAGCAGTCC
GGCATCGGCC GGGAGGGCGG CCTTCACTCG CTGGAGTTCT ATACCGAACT CTCCAACGTC
TGCATCAAGC TTTGA
 
Protein sequence
MSVQALQGLA QIRVDTARRD AAHFINGEFT QGLTSKGWWE NRSPLDNSVI GRVPEGGQAE 
VDAAVHAARA ALDGTWGKMT VAQRTDLLAA VANEIDARFD EFLAAECLDT GKPYSLASHI
DIPRGAANFK MFADTVKNVS TETFILDTPD GKSAVNYGLR RPKGLIAVIS PWNLPLLLMT
WKVGPALACG NTVVVKPSEE TPLTATLLGE VMNKVGVPKG VYNVVHGLGP NSAGEFLTQH
PLVNGITFTG ETRTGEAIMR QAALGVRQVS FELGGKNPAI VFADCDLDKA IEGTMRSAFA
NCGQVCLGTE RVYVERPIFD SFVARMKGDA EALRLGRPED GATNLGPLIS QEHRSKVLSY
YKLALEEGAT LVTGGGVPDM PGELALGAWV QPTIWTGLKD DARAVNEEIF GPCCHIRPFD
TEEEAVRLAN STPYGLAAAV WTENVSRAHR VASKMDVGIC WVNSWFLRDL RTAFGGAKQS
GIGREGGLHS LEFYTELSNV CIKL