Gene Msil_2923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2923 
Symbol 
ID7092843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3220374 
End bp3221684 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID643466236 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_002363201 
Protein GI217979054 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAAG ACAAGGACCG CATCTTCACC AATCTTTATG GCTTCGGCGA TTGGGGCCTG 
AAGGGCGCGC TGGCGCGCGG CGCATGGGAC AACACCAAGG GACTGATCGA GAAGGGCAAG
GACTGGATCC TGACCGAAAT GAAGGCCTCG GGGCTGCGCG GGCGCGGCGG CGCGGGTTTT
CCGACGGGAC TGAAATGGTC TTTCATGCCC AAGGCCGACG CCGCGCGGCC GTCTTATCTC
GTCGTCAACG CCGACGAGTC GGAGCCCGGC ACCTGCAAGG ACCGCGAGAT CATGCGCAAT
GATCCGCACC TTCTGGTCGA GGGCTGCTTC GTCGCCAGCT TCGCGATGGA GGCGCACGCC
TGTTACATCT ATATTCGCGG CGAATATATC CTCGAGGCGG ACCGGCTGGA GGCGGCGATT
CAGCAGGCCT ATGAGGCGGG CCTCGTCGGC AAGGACAATA TCCACGGCTG GCCGTTCGAC
ATCTACGTCC ATCGCGGCGC CGGCGCCTAT ATCTGCGGCG AGGAGACGGC GCTTCTCGAA
TCGCTTGAAG GCAAGAAGGG GATGCCGCGG CTGAAGCCGC CGTTTCCCGC CAATATGGGC
CTCTACGGCT GTCCGACGAC GGTCAATAAT GTCGAATCGA TCGCGGTCGC GCCGACCATT
TTGCGCCGCG GCGCGGCCTG GTTCTCAAGC TTCGGCGCGA AGAACAATTC CGGCACCAAG
CTTTTCTGCA TCTCCGGTCA CGTCAACAAG CCCTGCAACG TCGAGGAGGC GATGTCGATC
CCGTTCCGGG AGTTGATCGA CAAGCATTGC GGCGGCGTTC GCGGCGGCTC GGACAATCTG
CTCGCGGTCA TTCCTGGCGG GTCCTCGGTC CCCTGCGTGC CGGCGGCGCA GATCATCGAC
GCGGCGATGG ATTTCGACAC GCTGCGCGAT CTGAAGTCGG GCCTTGGCAC GGCGGCCGTC
ATCGTCATGG ATAAATCGAC CGATATCATC CGCGCCATCG CCAGGCTCAG CTATTTCTAC
AAGCATGAGA GCTGCGGCCA GTGCACGCCT TGCCGCGAAG GGACAGGCTG GCTGTGGCGC
GTCGTCACCC GCATGGCCGA GGGCCGCGCG CAAAAGCGCG AGATCGACAT GCTGCTCGAA
GTGACGACGC AGATCGAGGG CCACACCATC TGCGCGCTCG GCGACGCGGC GGCCTGGCCG
GTGCAGGGCC TCATCCGGCA TTTCCGGCCG GAGATCGAAA AGCGCATCGA TCAATATGCC
GCAAATCCGC ATTCGGAGCC CGCGCGCGGC TACCATGCGG CGGCGGAGTA G
 
Protein sequence
MLEDKDRIFT NLYGFGDWGL KGALARGAWD NTKGLIEKGK DWILTEMKAS GLRGRGGAGF 
PTGLKWSFMP KADAARPSYL VVNADESEPG TCKDREIMRN DPHLLVEGCF VASFAMEAHA
CYIYIRGEYI LEADRLEAAI QQAYEAGLVG KDNIHGWPFD IYVHRGAGAY ICGEETALLE
SLEGKKGMPR LKPPFPANMG LYGCPTTVNN VESIAVAPTI LRRGAAWFSS FGAKNNSGTK
LFCISGHVNK PCNVEEAMSI PFRELIDKHC GGVRGGSDNL LAVIPGGSSV PCVPAAQIID
AAMDFDTLRD LKSGLGTAAV IVMDKSTDII RAIARLSYFY KHESCGQCTP CREGTGWLWR
VVTRMAEGRA QKREIDMLLE VTTQIEGHTI CALGDAAAWP VQGLIRHFRP EIEKRIDQYA
ANPHSEPARG YHAAAE