Gene Msil_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3764 
Symbol 
ID7090692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4119231 
End bp4120193 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content63% 
IMG OID643467049 
ProductNADH ubiquinone oxidoreductase 20 kDa subunit 
Protein accessionYP_002364008 
Protein GI217979861 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATC TGTTGTGGCT CCAGGGCGGA GCCTGTTCCG GCAACACGAT GTCGTTCCTG 
AACGCAGAGG AGCCGAGCGC CTGCGACCTA GTCACGGATT TCGGCGTCAA CGTCCTGTGG
CATCCCTCCC TCGGCATGGA GCTCGGCGAC AATCTGAAGA AGCTGCTGCG GGCGCTGACC
TCAGGCGAGA TCGCGCTCGA TATTTTTGTC TTCGAAGGCA CTGTGGTCAA CGCGCCGGAC
GGCACAGGCG AATGGAACCG CTTCGCCGGA CGGCCCATGA AGGACTGGGT CGCCGACCTC
GCCAAGGTCG CGAGCTTTAC GGTCGCGATC GGCGATTGCG CGACATGGGG CGGCATTCCG
GCGACCGCGC CCAATCCGTC AGAGAGCCAG GGCCTGCAAT TTCTCAAGCG CGCCCATGGC
GGCTTCCTCG GCAAGGACTA TAAATCCAAG GCCGGTCTGC CGGTCATCAA CATCCCTGGC
TGCCCGGCGC ATCCCGACTG GATCACGCAG ATCGTCGTGG CGGTCGCCAC CGGCCGCGGC
GGCGATTTGA CGCTCGACGA ATTTCAGCGT CCCAAAACTT TCTTCACCTC GTTCACCCAG
ACCGGCTGCA CGCGCAACAT GCATTTCGCC TACAAGGTGT CGGCAACGGA ATTCGGCCAG
CGCAAGGGTT GTCTCTTCTA CGATCTCGGC TGTCGTGGAC CGATGACCCA TTCGCCGTGC
AATCGCATCC TGTGGAACAG GCAATCGTCG AAAACTCGCG CCGGCATGCC GTGCCTTGGC
TGCACCGAGC CGGAGTTCCC CTTCTCCGAA CTCGCGCCCG GCACTGTGTT CAAGACGCAA
ACGGTGATGG GCGTGCCAAA AGACATGCCG AGCGGCGTCG ACAAGACCGG CTACGTGAAG
CTGACCGCGG CCGCCAAGGC CGCCTCGCCG CGCTGGGCCG AGGAAGACAT CTTCGTCGTC
TGA
 
Protein sequence
MANLLWLQGG ACSGNTMSFL NAEEPSACDL VTDFGVNVLW HPSLGMELGD NLKKLLRALT 
SGEIALDIFV FEGTVVNAPD GTGEWNRFAG RPMKDWVADL AKVASFTVAI GDCATWGGIP
ATAPNPSESQ GLQFLKRAHG GFLGKDYKSK AGLPVINIPG CPAHPDWITQ IVVAVATGRG
GDLTLDEFQR PKTFFTSFTQ TGCTRNMHFA YKVSATEFGQ RKGCLFYDLG CRGPMTHSPC
NRILWNRQSS KTRAGMPCLG CTEPEFPFSE LAPGTVFKTQ TVMGVPKDMP SGVDKTGYVK
LTAAAKAASP RWAEEDIFVV