Gene Msil_3750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3750 
Symbol 
ID7093104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4107326 
End bp4108393 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content69% 
IMG OID643467035 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_002363994 
Protein GI217979847 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC ATCCAATTTC GCCGCGACGC GCGCTGGGGC GCGTTTTCGT GCCGATGGTG 
ACGCTCGCCC ATGGCGGCGG CGGCAAGGCG ATGAAGGATC TGATCGACGA CGTGTTCATC
AGCGCCTTCG CCGATGCGAC GCCGCAAGTG CTGGAAGATC AGGCGCGGTT CGACCTTGCG
GGCCTCGCCG CGCACGGCGA CCGGCTGGCT TTCACCACCG ACAGTTTTGT CGTCGATCCG
CTGTTCTTCC CGGGCGGCGA CATCGGCAAG CTCGCGGTCT GCGGCACGAT CAACGATCTC
GCCGTCGGCG GCGCCAAGCC CCTCTATCTC TCCTGCGCCG TGGTCATCGA GGAAGGGATG
CAGGTGGAGC TGCTGCGTCG CATCGCGCAA TCCATGGCGC ATGCGGCGCG CGAGGCGGGC
GTTTCCATCG TCACCGGCGA CACCAAGGTG GTGCAGCGCG GCGCCTGCGA CAAGATCTTC
ATCACCACGA CGGGCATTGG CGTGATCGCG CCGGGCGTCG ACCTTGGCGT CGATCGGGTG
AGGCCCGGCG ACGGAATATT GGTCAATGGC CTGCTCGGCG ACCATGGCGC CGCCATCCTC
TGCGCGCGCG GCGACCTCGC GCTGGAGACA GCGATCGAAA GCGACTGCGC CGCGCTGCAC
GGCTTGATCG CGGCGCTGCT GCGGGCCGCG CCGGGCGCGC GCTGCATCCG CGACGCGACG
CGCGGCGGCC TCGCCACGGT GCTCAATGAA ATCGCCGACG CCTCCGGCGT CTCGATCGAG
ATCGAGGAAG CGCTGACGCC GCTGCGCGAG GAGGTGCGTG GTTTCTGCGA AATCCTTGGC
CTCGACCCGC TCTATCTCGC CAATGAGGGC AAGATCGTCA TCGCCGTGCC GCGAGACGAG
ATCGCCGCGG CGCTGGCTGC GCTCGGCGCG CATCCGCTCG GCGCGGGCTC CGCGCTGATC
GGCTATGCCG GCTGCGGCGA GCCGGGGCGC GTGACCATGC AGACCGTGTT CGGCGGGCGG
CGCATCGTCG ACATGCTGGT CGGCGAACAG CTGCCGCGAA TCTGTTGA
 
Protein sequence
MNMHPISPRR ALGRVFVPMV TLAHGGGGKA MKDLIDDVFI SAFADATPQV LEDQARFDLA 
GLAAHGDRLA FTTDSFVVDP LFFPGGDIGK LAVCGTINDL AVGGAKPLYL SCAVVIEEGM
QVELLRRIAQ SMAHAAREAG VSIVTGDTKV VQRGACDKIF ITTTGIGVIA PGVDLGVDRV
RPGDGILVNG LLGDHGAAIL CARGDLALET AIESDCAALH GLIAALLRAA PGARCIRDAT
RGGLATVLNE IADASGVSIE IEEALTPLRE EVRGFCEILG LDPLYLANEG KIVIAVPRDE
IAAALAALGA HPLGAGSALI GYAGCGEPGR VTMQTVFGGR RIVDMLVGEQ LPRIC