Gene Msil_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1038 
Symbol 
ID7091866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1121674 
End bp1123308 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID643464377 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_002361369 
Protein GI217977222 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0466413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA TTCCGTCCCT TCTTCTTGCC GCCGCGCTGG CGACCCCGCT CCTCCTCGCT 
TGCGCCTGCC TCTTTCCCGC CGCGCGCCGC CATGCGCTGA CGCTGCTGCC GCTCGCGCCG
CTTCCCGGCC TCCTCGCCGC GCTTTCGGCC CCTTTCGCCG GACCCGCCTC CTTCGCGGCG
CCGGCGTTGC GCCTTGTCCT TGCGCTCGAT GCGCCGGGGG CCGTGCTGCT CGGCGTCGCC
GCGCTGCTTT GGACGATCGC CGGGTTTTAC GCCGCCGCCG CCATGCGCGA TCGAGCGCAC
GCACTGCGCT TTGCGATCAG CTGGCTATTG ACGCTCTCCG GCAGCCTTGG CGTCTTCATC
GCCGCGGATT TGCTGACCTT CTATCTTGTC TTCGCCCTCG TCAGCCTCCC GGCCTTCGCG
CTGATCGTCC ATGACGGCGA CGCCGCCGCC GCCAGAGCGG GCGCCGTCTA TCTTGCCTTC
ACCTTGCTCG GCGAAACAGT GCTGCTGTTC GGCTTCGTGC TGCTCGCGGC GGGAGAGCCG
AACGGCAGTC CCTTGATCAC GGACGTGATG GCCGCTCTGC CGCAATCGCC CTTCGCAGCG
CCCGCGCTCG CCCTGACCAT GGCCGGCTTT TGCATGAAGA TGGCGCTTGT GCCGATGCAC
GGCTTTCTGC CGCTTTCCTA TACGGCGGCG CCGATCGCGG CCGCCTCCGT TCTGAGCGGC
GCCGCCATCA AGGCCGGCGT AATCGGGATC ATCCGTTTCC TGCCCTATGA CGCCGCCCTT
CCGGGCCTGA GCGAAGCGCT GACGGCCGCC GGGCTGTTTT CCGCCTTCTA TGGCGTCGCT
CTCGGCATCA CGCAGAAGAA TCCGAAAACG ATTCTGGCCT ATTCAAGCAT CAGTCAGATG
GGCGTCATCG CTGTGGTGCT GGGGATGGGG CTTTCCGCCG ACGATCGCGG CGTATTGATC
GACGCCGCCT TTTATGGCGC CAATCATCTC CTGATCAAGG GCGCCCTGTT TCTGGCGGTC
GGGGTAGCGG CAATCACCGG CCCGAAACGC CTGAGGCTCG TGATCTGGCC CGCGCTTCTG
CTCGCGCTGA GCCTCGGCGG ACTGCCGCTG ACCGGCGGCG CGCTGGCCAA GCTCGCCGTC
AAGGACACGC TCGGCAGCTA TATCGTGGGC GCGCTCGCCA ATCTTTCGGC GGCCGGCACG
ACCATGCTGA TGCTGCATTT CGTGGCGCGT GTCGAAGCCT GCGCCGCAAC CGACCCGGAC
GCAAAGGCGC CGGCGGGACT GGCGGCGCCT TGGCTCGCGG CGGCGCTCGC AGCGCTGGTC
GTTCCCTGGT TGATCTTCCC CGTTCTCGGC GAGACGTTCT CCTATGCGCT GAGCCCGCCG
ATCCTCGTCG ACGGACTCTG GCCGGTCCTG CTCGGAGTTC TCCTTTCGCT CGCGCTGCGC
CGCTGGGGCG ACCGCCTGCC GGAGCCGGCC GCCGGAGACA TCGTCGGCGC GGAGGAAGCG
GGTTTTCGCG CGCTCTTTCC AATTGGCGCC CTGATGGAGC GCGCCGACCT CGCCATCCGG
CGATGGCCCG CCGCGACCCT GTCGCTAGCG ACGCTGGCGA TCCTCTTCGG CGTCTTGCTG
GCGAAGGGGT TTTAG
 
Protein sequence
MIDIPSLLLA AALATPLLLA CACLFPAARR HALTLLPLAP LPGLLAALSA PFAGPASFAA 
PALRLVLALD APGAVLLGVA ALLWTIAGFY AAAAMRDRAH ALRFAISWLL TLSGSLGVFI
AADLLTFYLV FALVSLPAFA LIVHDGDAAA ARAGAVYLAF TLLGETVLLF GFVLLAAGEP
NGSPLITDVM AALPQSPFAA PALALTMAGF CMKMALVPMH GFLPLSYTAA PIAAASVLSG
AAIKAGVIGI IRFLPYDAAL PGLSEALTAA GLFSAFYGVA LGITQKNPKT ILAYSSISQM
GVIAVVLGMG LSADDRGVLI DAAFYGANHL LIKGALFLAV GVAAITGPKR LRLVIWPALL
LALSLGGLPL TGGALAKLAV KDTLGSYIVG ALANLSAAGT TMLMLHFVAR VEACAATDPD
AKAPAGLAAP WLAAALAALV VPWLIFPVLG ETFSYALSPP ILVDGLWPVL LGVLLSLALR
RWGDRLPEPA AGDIVGAEEA GFRALFPIGA LMERADLAIR RWPAATLSLA TLAILFGVLL
AKGF