Gene Msil_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0152 
Symbol 
ID7090468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp147236 
End bp148243 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content66% 
IMG OID643463485 
ProductNADH dehydrogenase (ubiquinone) 
Protein accessionYP_002360495 
Protein GI217976348 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.274944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC AGCTGATCAG GACGAAACGG CTTGCAGTCG TATTCGGCGG CTCGGGCTTC 
ATTGGCCGGC ACGTCGTTCG CGCGCTGGCC AAGGACGGCT GGCGCGTCCG CGTGGCGTCG
CGTCGGCCCG ATCTTGCGTT CCATTTGCAG CCGCTGGGAA ACGTCGGCCA GATCCACGCC
GTGCAGGCCA ATCTGCGCTA TCCCGACTCG ATTGAGCGCG CCCTGCGCGG CGCGGACGCC
GCGGTCAATT GCGTCGGCAT TTTGAGCCCC GCGGGCGAGC AGACGTTTGA CGCGATCCAC
GCCTCGGGCG CCGAGGCCAT CGCCAAGGCG GCAAAGGCGG CGGGCGTGAA ATCCTTCGTG
CAAATCTCGG CGATCGGCGC TGATGACGCC AGCGCCTCCG CCTATGCGAA GACCAAGGCC
CAAGGCGAGG CGCTCGTCGC CGCGGCCTTC CCCGGCGCGG TCATTTTGCG CCCCTCCGTC
GTGTTTGGCC CAGAGGATGA ATTCTTTAAT CGCTTCGCCG CCATGGCCCG CTTCATGCCC
GTTCTGCCGC TGATCGGCGG CGGCGAAACC AAGCTGCAGC CGGTGTTCGT CGGCGATGTC
GCCCGCGCCG CGGCGCTTGC GCTCGACGGC AAGGCAAAGC CCGGCGCCAT CTACGAGCTC
GGCGGGCCGG AAGTCGCGAC CATGCGCCGA ATCATGGAGT TCGTCTTAAA GGTGACCGAA
CGCAAGCGGC GGCTCGTGAC GCTGTCCTTC GATCAGGCCA GAAGCGTCGG CGGCGTGACG
GAAGTTCTCT CAAAACTGTC GCTCGGCCTG CTGCCCAAAA TGTTCGAGAT CACCCGGGAT
CAGGTCGAGC TTTTGAAACA CGACAATGTC GTCTCGAAAG CCGCAATCGT CGAGGGACGG
ACATTGCAGG GCCTTGGCCT GGCGCCGGAA TCCTTCGAGG CCTTCACGCC CACCTATTTG
ACCCGCTACC GCGCGACCGG ACAATACGCC GACCGCCGCA TGGCCTGA
 
Protein sequence
MADQLIRTKR LAVVFGGSGF IGRHVVRALA KDGWRVRVAS RRPDLAFHLQ PLGNVGQIHA 
VQANLRYPDS IERALRGADA AVNCVGILSP AGEQTFDAIH ASGAEAIAKA AKAAGVKSFV
QISAIGADDA SASAYAKTKA QGEALVAAAF PGAVILRPSV VFGPEDEFFN RFAAMARFMP
VLPLIGGGET KLQPVFVGDV ARAAALALDG KAKPGAIYEL GGPEVATMRR IMEFVLKVTE
RKRRLVTLSF DQARSVGGVT EVLSKLSLGL LPKMFEITRD QVELLKHDNV VSKAAIVEGR
TLQGLGLAPE SFEAFTPTYL TRYRATGQYA DRRMA