Gene Msil_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3881 
Symbol 
ID7092577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4258811 
End bp4260334 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content65% 
IMG OID643467166 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002364125 
Protein GI217979978 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.159111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAC GAGCCAATGT TTCCGCCGCA GCGGCGGTCA TTCCGGATGA GATCCGCAAT 
TGGCTATCCG GTCCGCGGCC AATGTTGATC GACGGCAAAT GGGTCAAATC GGTCTCTGGC
AAGACGTTCG ACGTATTCGA TCCGGCGACG GGGGAAAAGA TCGCTTCGGT CGCGGAGGGC
GACGCCGCTG ACGTCGACCT CGCGGTGGCG GCTGCGCGCC GCGCGTTTGA AAGCGGCCCC
TGGTCGCGCA TGACGCCCTC CGAACGCGGC CGCATTATCC ATCGCATCGG CGATCTCATC
CTCGACCATG CCGATGAACT GGCCGCGATC GAATCGCTCG ACAATGGCAA GCCCAAAGCT
GTCGCGAAGG CCGCAGACGT TACTTTGTCA GCCGACATGT TCCACTATAT GTCTGGCTGG
GCGACCAAGC TTGAGGGCAA GCATATCCCG ATTTCGGCGC TGACCGCCCC GGGCATGGAA
TTCGTATCGA TGACGCGGCT CGAGCCGATC GGCGTCGTGG GCCAGATCAT TCCATGGAAC
TTCCCGCTTC TGATGGCGGC GTGGAAGCTG GCCCCGGCCC TGACCACCGG CTGCGCGGTC
GTGCTCAAGA TCGCCGAGGA GACGCCGCTT TCGGCGCTGC GACTTGGCGA GCTTCTGATC
GAGGCTGGCG TTCCGGACGG CGTCGTCAAT ATTGTCCCGG GCTTTGGCGA AACGGCGGGG
GCGGCGCTGG CCGGCCATCC GGGAGTCGAC AAGGTCGCCT TCACCGGCTC AACGGAAGTG
GGCCGCCTGA TCGTTCAGGC CGCCTCGCGC GATCTCAAGA AAGTCTCGCT GGAGCTTGGC
GGTAAATCGC CGAACATCGT CCTTGGCGAC GCCGATCCGG AGATGGCGAT CGCCGGCGCG
ACCGCCGCGA TCTTCTTCAA TCATGGTCAG TGCTGCAACG CCGGTTCGCG GCTGTTCGTG
CAGCGCAATC TGTTCGACAA GGTCGTCGAG GGCATCGCGG CGCAGGCGGA AAAGATCAAA
TTGGGCCACG GCCTCAACGC GGAAACGGAG ATGGGCCCGC TGGTGTCGCG CGTCCAATAT
GACCGCGTCA CGGGCTTGCT CGCCTCGGGC CGTCAGGAAG GCGCCCGCGC CGTCTGCGGC
GGGGAAGGGC TTGGCGGCGC CGGTTATTTC GTGCCCCCGA CGGTCCTCGT CGACACAAAC
CCGGGCATGC GGGTGGTGCG CGAGGAGATC TTCGGGCCTG TGCTCGTCGC CACCCCGTTC
GATGAGCCCG ACGACGCGCT GATCGCCGAA GCGAACAACA CGATCTACGG CCTCGCCGCG
GGCGTCTGGT CGGGCAATAC CGGGCGGGCG CATCAGATCG CCAACCGGCT GCGCGCCGGC
ACGGTGTGGA TCAACTGCTA CCATGTCTTC GACGCGGCGC TGCCCTTCGG CGGCTACAAG
CAGTCGGGAT GGGGCCGCGA GATGGGACAG GCGGTTCTGT CGAACTACCT CGAGGCGAAG
GCGATCACGA CGCGGATCGG CTGA
 
Protein sequence
MNVRANVSAA AAVIPDEIRN WLSGPRPMLI DGKWVKSVSG KTFDVFDPAT GEKIASVAEG 
DAADVDLAVA AARRAFESGP WSRMTPSERG RIIHRIGDLI LDHADELAAI ESLDNGKPKA
VAKAADVTLS ADMFHYMSGW ATKLEGKHIP ISALTAPGME FVSMTRLEPI GVVGQIIPWN
FPLLMAAWKL APALTTGCAV VLKIAEETPL SALRLGELLI EAGVPDGVVN IVPGFGETAG
AALAGHPGVD KVAFTGSTEV GRLIVQAASR DLKKVSLELG GKSPNIVLGD ADPEMAIAGA
TAAIFFNHGQ CCNAGSRLFV QRNLFDKVVE GIAAQAEKIK LGHGLNAETE MGPLVSRVQY
DRVTGLLASG RQEGARAVCG GEGLGGAGYF VPPTVLVDTN PGMRVVREEI FGPVLVATPF
DEPDDALIAE ANNTIYGLAA GVWSGNTGRA HQIANRLRAG TVWINCYHVF DAALPFGGYK
QSGWGREMGQ AVLSNYLEAK AITTRIG