Gene Msil_0613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0613 
Symbol 
ID7093692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp663624 
End bp665111 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID643463946 
ProductTonB family protein 
Protein accessionYP_002360947 
Protein GI217976800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00466387 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTCGG CGTCGCCTCG AAGCTGTGCG AAAGCCGCCG TGCGCGGCTT TGCGGCCGGC 
GCTGCCCTGA AGCTCGAAGA TGATCCAACG CAATATGGCG ATTTTCCGCA GGATTTCTGG
GCCACGGAGT TTGACGGCGC CGAGCTGAAT TTTGCGGCGC GCGACGAATC TGGCGCCGGC
GGCCTGAGCG ACGCGGAGTC GGAAATCGCC GTCAGCGCGG ATGACAGGCT CGCCGCCGAT
CGTCCGGCCG GCTCCTTCTT TGACAGAGGC GTGATCTTCG TGCTGGCCGC ATCGCTTGTG
GCGCATGCCT GCTTGCTGCT CGCTTTTCTC TGGGCAAGCC CCTGGGACAC ACCGCCCGAG
GCGTCAAAGG AAACGCCGAT CGAAGTCGTG ATCGAACAAC CAAAGCCCGA CCCCGCGAAA
GAAGCCGCTG CGAAGCAGTC CGCGAACCGG CAGGCTGCAA GCAAGCAGGC CGAAGCCAAA
GCCGCGGAAG AACAGGCAAG GCAAGCCGCG GCCAAGGCGG CCGAGCAGGC CAAGGCTGCG
GAGCAAGCCA AGGCGACGCA GGCGAAGCAA CAGGCCGATC AAGCCGCTAA AGCGGCGGAG
GCCGCTGAAG CCAAAGCCGC ACGGGACGCC GCTGCGGCGA ACGAGGCGAA AGCCGGGGCG
GCTAAAGCTG AATCGGCTAA AGCTGAATCG GCCAAAGCCG AGGCGGCCAA AGCCGAGGCG
GCAAAGGCGG CTCAGGCCAA GGCTGACGCG GCTAAGGCTG AAGCAGCCAA GGCTGACGCG
GCGAAGGCTG AGGCGGCGAA GGCCGAAGAA GCCAGGGCTG CGCAAATGAA GGCTGCCGAA
ACGAAGGCTG CTGAAGCCAA GGCCGCCGCA ACCAAGGCTG CGGAGGAAAA AGCCGCTGCC
GCGCAGGCTG CAGCCGAAAA GGCCGAACAG GCGAAAGCGG CAAGGGCGCA GGCGAGGCAG
GCCGAGCAGG CGAGCGCCGA GCAAAAACGC GCCGAACGGG CGGAAAAGAA AGCCGCGGCG
GTGGCCAAGA GCGCTGCAGC CCGGGAGGCA GCGGCGGCCA GCGCCAATCG CCAGGGCGCG
CCGCTGCAAT CCGCCGGGTT TGGCGCGTCG AAGAGCGGGC AGGAGATGCA GCTTCCCTTC
GACAACGGGC CGCCCATTTT CCGCGCCGTC GCCGTGCCTT TGCCGACCGA GGGCGGCGAC
GAACAGATGA GCTACAAGGT CATCGTGTTC GGCATGCTGG AGCGCGCCAA GCAATATCCG
CCGGCGGCGC GCGAGCGCGG GGCGCGGGGC AGCGCCGTGG TCTCTTTTAC GCTCGAGGAA
TCGGGCGAGC TCGCCAATGT GTCGCTAATG CGCTCAAGCG GCGATTCCGA CCTCGATATC
GAGAGCCTCG CGCTGGTTGC GAGAGCCTCG CCTTTCCCGA AGCCGCCGGA GGGGGCGCAG
CGCTCCTTCG CCGCCGAAAT CACTTTCGAT CTGCGGGACG AACCCTGA
 
Protein sequence
MHSASPRSCA KAAVRGFAAG AALKLEDDPT QYGDFPQDFW ATEFDGAELN FAARDESGAG 
GLSDAESEIA VSADDRLAAD RPAGSFFDRG VIFVLAASLV AHACLLLAFL WASPWDTPPE
ASKETPIEVV IEQPKPDPAK EAAAKQSANR QAASKQAEAK AAEEQARQAA AKAAEQAKAA
EQAKATQAKQ QADQAAKAAE AAEAKAARDA AAANEAKAGA AKAESAKAES AKAEAAKAEA
AKAAQAKADA AKAEAAKADA AKAEAAKAEE ARAAQMKAAE TKAAEAKAAA TKAAEEKAAA
AQAAAEKAEQ AKAARAQARQ AEQASAEQKR AERAEKKAAA VAKSAAAREA AAASANRQGA
PLQSAGFGAS KSGQEMQLPF DNGPPIFRAV AVPLPTEGGD EQMSYKVIVF GMLERAKQYP
PAARERGARG SAVVSFTLEE SGELANVSLM RSSGDSDLDI ESLALVARAS PFPKPPEGAQ
RSFAAEITFD LRDEP