Gene Msil_3751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3751 
Symbol 
ID7093105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4108390 
End bp4109547 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content62% 
IMG OID643467036 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_002363995 
Protein GI217979848 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATG CCGACGAATT CCGCGACCGC GAGAAGGCGG CGATTTTGAT CCGGGAGATC 
GACAAGCTCG CCGCCACGAT AGAGATCGCC AAAACCCGGC CAATCAACAT CATGGAAGTC
TGCGGCGGAC ATACCCATTC GATCTTCCGC TATGGACTGG AAGGCCTGCT GCCCGGCGCC
ATCGAACTGG TGCATGGACC GGGCTGCCCC GTCTGCGTTC TACCGATGGG CCGGGTCGAC
GACTGCGTGT CCATCGCCGA GCGGCCGGAG GTCATCTTCG CCACCTTTGG CGACGCCATG
CGCGTGCCCG GATCGAAAAA AAGCTTGCAA CAGGCCAAGG CCCAAGGCGC CGACGTCCGC
ATGGTCTATT CGCCTCTCGA CGCGCTCGGC CTGGCCCGCA AAAATCCCGG CCGCGAAGTG
GTCTTCTTCG GGCTCGGCTT CGAGACCACC ATGCCGTCGA CCGCCTTGAC GGTGCTGCAG
GCCGAGGCTG ACGGCGTCGA GAATTTTTCG GTGTTCTGCA ACCACATCAC CATCGTGCCG
ACGATGAAGG CGATCCTCGA CAGCCCCGAG CTCAACCTCG ACGGCTTCCT TGGACCCGGC
CATGTCTCGA TGGTGATCGG CGCGGCGCCC TATCAATTCA TCGCCGATGT CTACAAGCGG
CCGATGGTGA TCGCCGGCTT TGAGCCGCTC GACGTGCTGC AATCGATCTG GATGGTGCTG
AAGCAGATCA AGGAGGGCCG CGCCGAGATT GAGAACCAAT ATGCGCGCGT CGCCCCCGCG
GCGGGCAACG CGGCGGCGCT GAACGCTGTC GGCAAAGTCT ATGAGTTGCG CGAATTTTTT
GAATGGCGCG GCCTCGGCTC CATCGATCAT TCCGGAGTGA AAATCCGCGA CGAATATGCG
CGTTTCGACG CGGAGCGGAA ATTCGCCATT CCCAACGTCA AGATCGCCGA TCCGAAATCG
TGCCAATGCG GCGATGTGCT GAAGGGCGTC ATCAAGCCGT GGCAATGCAA GGTCTTCGGC
GCAGCCTGCA CGCCGGAGAC GCCGCTCGGC GCGCTGATGG TGTCGTCCGA GGGCGCCTGC
GCCGCCTATT ATCAATATGG CGGCGTCAAG CGCCATGGCG CCAGTGAGGC GGCTCCGCAA
CTGGCGACAG CATCATGA
 
Protein sequence
MKYADEFRDR EKAAILIREI DKLAATIEIA KTRPINIMEV CGGHTHSIFR YGLEGLLPGA 
IELVHGPGCP VCVLPMGRVD DCVSIAERPE VIFATFGDAM RVPGSKKSLQ QAKAQGADVR
MVYSPLDALG LARKNPGREV VFFGLGFETT MPSTALTVLQ AEADGVENFS VFCNHITIVP
TMKAILDSPE LNLDGFLGPG HVSMVIGAAP YQFIADVYKR PMVIAGFEPL DVLQSIWMVL
KQIKEGRAEI ENQYARVAPA AGNAAALNAV GKVYELREFF EWRGLGSIDH SGVKIRDEYA
RFDAERKFAI PNVKIADPKS CQCGDVLKGV IKPWQCKVFG AACTPETPLG ALMVSSEGAC
AAYYQYGGVK RHGASEAAPQ LATAS