Gene Msil_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0235 
Symbol 
ID7090552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp262125 
End bp263255 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID643463569 
Producthypothetical protein 
Protein accessionYP_002360578 
Protein GI217976431 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.635295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA TCGGCGCCGC CATCCGGGCC GAGGGCGAGG CCTTAGGTTC GATCTTCAGC 
GAGCTCGCCG CCGATCTCAG AGAGGCGAGC CTCTTTGGCC CGCGGGCGCG GTTTTGCGCC
GCTTCCGCTC TCTCCGTCGG ATTGGCGACA GTCGTCGCGC TCGCCATGCA CGTCGACGAC
GTATGGTGGG CGGCGATCAG CGCCTTCATG TGCAGTCAGG CGACCCTGCC GGCGTCCCTG
ACAAAAGGGG TCTTGCGTAT GATCGGCACC ATCGCAGGCG CCATCGCCGC CCTCATGCTC
GCGTCCTGGC TCTCCTATGA CTGGGTGGCC TGTTGTCTTT TTCTGTTTAT GTCGACCTTC
ATCGGCACGC TCGGCTTTCA GCTCAGCCCG CATGCCTACG CCTGGCTGCT TGGCTCGATC
ACGTTTAATT TCATTATCCT GTTGGCGCTG TCCTCGCCGC AGGACACATT CTATTTTTCG
ATCTATCGCA TCATGGAAGT CGCCATCGGC GTGGCGTCGG CGCTGCTGAT TGCGGTCCTC
CTTGCGCCCA AGGAGGGCGG AGCGATGCTT CCCGCCGCCG GATGGGGCAG CTTTCTCGAC
GACGCTCAAA CCATGGCGCG GCTGCATGCG CTTCGCGCGG CGTTCACCGT CATGCTCATT
CCGATCGTTT GGAGCTACGC CGAACTGCCA AGCCTCGCGC AGATGGCAAT TACGATCAGC
GCCGTGATGG CCGTGCCGGC GCCGACGGCC GCGACGCCTG ACCCCGGCCT CATGATGGTC
CGTCGTGCGC TTCACCGACT GCTCGGCTGC TTTATGGGTG GGATCATTGC GCTTGTCTGC
CTCGCCGCGC CGCTGACCAA TTTCCTCGTC TGGCTCGCAA CGCTGATGGG CGGCGTCTGG
ATCGGCTGCC ACCTTCAGGC CACCCCGCGC AAGATTGGCT ATGTCGGCAC CCAGGGAGCC
ATCGTCTTCA TCATGACGCT GGTGCAGGGA TTTGGGCCGC CGACCAGCAT CTGGCCGGCC
GTCGAACGCC TCGGCGGCGT CAGTTTCGGC CTGCTGATCC TGCTTCTAGT GTCGATCGTC
TTCGAGATCC TGGTTCCCGA GACGACGCCC GCGCGCCTCG CCGTCGATTA G
 
Protein sequence
MSGIGAAIRA EGEALGSIFS ELAADLREAS LFGPRARFCA ASALSVGLAT VVALAMHVDD 
VWWAAISAFM CSQATLPASL TKGVLRMIGT IAGAIAALML ASWLSYDWVA CCLFLFMSTF
IGTLGFQLSP HAYAWLLGSI TFNFIILLAL SSPQDTFYFS IYRIMEVAIG VASALLIAVL
LAPKEGGAML PAAGWGSFLD DAQTMARLHA LRAAFTVMLI PIVWSYAELP SLAQMAITIS
AVMAVPAPTA ATPDPGLMMV RRALHRLLGC FMGGIIALVC LAAPLTNFLV WLATLMGGVW
IGCHLQATPR KIGYVGTQGA IVFIMTLVQG FGPPTSIWPA VERLGGVSFG LLILLLVSIV
FEILVPETTP ARLAVD