Gene Msil_2740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2740 
Symbol 
ID7092193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3017723 
End bp3018772 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID643466053 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_002363023 
Protein GI217978876 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAGCGC CAAGGCTCCG GACCCGCAAG CTGGATCTCC AGAATGGTCG CGTCGACCTT 
TCCCACGGCG CCGGCGGCCG GGCGATGGCC CAGCTCATCG ACGAGATCTT CCGCGAGGCG
TTCGACAATC CGATGCTCGA TCAAGGCAAC GACCAGGCTG CGTTCGACGT CCCGGCCGGC
CGCATGGTGA TGTCGACGGA CGGCTACGTG ATCTCGCCGC TTTTCTTTCC CGGCGGCGAT
ATCGGATCGC TGGCTGTGCA TGGCACGATA AACGACATCG CGATGGCCGG CGCACGCCCC
CTGCATCTGG CCGCCAGCTA TATCATCGAG GAGGGCTTTC CGCTCGCGGA CCTTCAGCGG
ATCGCGGGTA GCATGGGGTG CGCTGCGCGC GACGCCGGCG TGGCGATCGT GACCGGCGAC
ACCAAGGTGG TCGAGCGTGG AAAAGGCGAC GGCGTCTTCA TCGCGACGAC CGGCATCGGC
GTCGTCCCGC CGGGCCTCCA TCTCTCGGGC GAGCGCGCCC GCCCGGGCGA CCGGGTGATC
ATTTCCGGCT ACATCGGCGA TCACGGCGTC GCGGTCATGT CGACAAGGCG CGATCTCGGA
TTCGAGACGG AACTCCTCTC GGACAGCGCC GCCTTGCACG GGCTGGTCGC CGAAATGGCG
CGCGTCGCGG GTTCCTCGCT CCGGCTCTTG CGCGACCCAA CGCGCGGCGG CCTGGCCACG
ACCCTCAACG AGATCGCCCA GCAATCGGGC GTCGGATTCC TCATCGATGA GGGCGCGATC
CCCGTTCGGG CGGAGGTCGC CGCCGCCTGC GAACTCCTCG GATTGGACCC GCTCTATGTC
GCCAATGAGG GCAAGCTGGT CGCCATCGTG GCGCCGGACG CCGCGGAGAC CCTCGTTGCC
GCGATGCGCG CGCATCCCCT CGGCCGCGAC GCGGCTCTGA TCGGAGAAGC GACCGCCGAC
GAACAGCGCT TCGTACAGAT GACGACTTCG TTCGGAGGCG GCCGGATTGC GGATTGGCTG
ATGGGCGAGC AATTGCCCCG GATCTGCTGA
 
Protein sequence
MRAPRLRTRK LDLQNGRVDL SHGAGGRAMA QLIDEIFREA FDNPMLDQGN DQAAFDVPAG 
RMVMSTDGYV ISPLFFPGGD IGSLAVHGTI NDIAMAGARP LHLAASYIIE EGFPLADLQR
IAGSMGCAAR DAGVAIVTGD TKVVERGKGD GVFIATTGIG VVPPGLHLSG ERARPGDRVI
ISGYIGDHGV AVMSTRRDLG FETELLSDSA ALHGLVAEMA RVAGSSLRLL RDPTRGGLAT
TLNEIAQQSG VGFLIDEGAI PVRAEVAAAC ELLGLDPLYV ANEGKLVAIV APDAAETLVA
AMRAHPLGRD AALIGEATAD EQRFVQMTTS FGGGRIADWL MGEQLPRIC