Gene Msil_0138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0138 
Symbol 
ID7090454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp133942 
End bp135006 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content67% 
IMG OID643463472 
ProductTonB family protein 
Protein accessionYP_002360482 
Protein GI217976335 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGA AGACTTTTCG CGGCGATGAT GCCTCAGCGA TCCTGACCGG CGCCTGCGCG 
GCGCCGGGCG GCCAAGATGA CTCGCTCTTG ACGGATGAGT CGGCGCTTAC GGCCGGCGCC
GAGGACCATC TGGCCGAGCC GGCTCCCGCT GCGGAGGCGT CGCGGCGGCG GTTCTGGCTC
ATTCTCGCCG CCTGCCTCTT CGCCCATGCG CTGATTCTCG CCGCGATCCT TTACGAAAAC
AATGTGCAGC CGCCGATCGC CCCGGTCGAG GAGATCCCGG TCGAGCTCGT GCAGGAGATT
CCGCAGCCCA AGGTCGAACC TCCGCCTCCG CCGCAGCCCC CTAAAAAAGA GGAAAAGCGG
CCGAAGCAAA AAATAGAAGA CGACGACCGC GTCGCCTACG ACGCGCCGCG TGCGGAGAAC
AAGGAAAAGA TCGAGCGCGA GGCGCCAGAT CCCGAGACCA AGGCGCAGCG CCAGGCGCCG
CCCTCCGAGC AGACGGCCGA GACCCCGTCC CCGCCGCAAA AGGCGGAGGC GCCGCCTATC
GCCACAGTGA TCGCGCCGCC CGAGGAAGCG CCGGCGAAAA TCGCCGACGA CAAACCGGAC
GCCGAGCCTC TCGACAAGGC CACGCCCTCG CCAAAGAAGA AGCCGACCGA GGCGAAGTCG
CCGGTCGTCT CAAAGGCGCC GCCGACCAAA TCCAAGAAGC AGAGCGTCGC GGACCAGCTC
GCCTCGCTGG CGCCGACGCC CGACTACAAG GTGGGATCGG CGGCAAAGCC CTCGCCCGTC
GCCGGCGGCG CGGCCAAGAC GACCTATCTC TCGATCCTCT ACGGCCTCAT CATGCGCCAG
ATGCATGTGC CGGCGGACCT TCAGAATGGC CATCAGCAGG CCGACGGCAT CGTCGCCTTT
TATGTCGACG AAAGAGGCAA TCTCACGCAT CAGGCGATCT ATCGCGCCAG CGGGCGCCCG
GACTTTGACG CGGCGGCGCT GAATGCGGTG CGCCGCGCCG CGCCCTTCCC TGCCCCGCCG
CGAGGCGATC CACACTCGAT CTGGTTTCAC TACGATACGC GGTGA
 
Protein sequence
MLQKTFRGDD ASAILTGACA APGGQDDSLL TDESALTAGA EDHLAEPAPA AEASRRRFWL 
ILAACLFAHA LILAAILYEN NVQPPIAPVE EIPVELVQEI PQPKVEPPPP PQPPKKEEKR
PKQKIEDDDR VAYDAPRAEN KEKIEREAPD PETKAQRQAP PSEQTAETPS PPQKAEAPPI
ATVIAPPEEA PAKIADDKPD AEPLDKATPS PKKKPTEAKS PVVSKAPPTK SKKQSVADQL
ASLAPTPDYK VGSAAKPSPV AGGAAKTTYL SILYGLIMRQ MHVPADLQNG HQQADGIVAF
YVDERGNLTH QAIYRASGRP DFDAAALNAV RRAAPFPAPP RGDPHSIWFH YDTR