Gene Msil_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1988 
Symbol 
ID7094186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2158078 
End bp2159268 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID643465314 
Producthypothetical protein 
Protein accessionYP_002362292 
Protein GI217978145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGGG CCGGCGGAGG TAGCATAAAC GACACCCGGC GCAGATTTTT CGCCAAAGCT 
GGCGCCGCTC TCGCCTCGCT GCCTGTTTTG AGCATGGAAA GCAAGCAAGC GCAGGCGCAA
GATTTCGATA ATTCCTCGGC CAATCATCCC GCAGGCAACG GTCCCCTCGT CATCGCTCGG
CAGGGCAGCT TCATGGCAGG CGGTTCTGTG CTCCAGACGC CAGGCGTGTT CGATCCGACG
GTGACGGGCG GCCCCGGCCA GACTCTTCAC GGCGATCATG CTTACGCGCA GTTTCAAATC
CCGCAAAATC CGCGTTCTCT TCCACTGGTG TTTTGGCCGG GCGGCGGTCA GAGCGGCAAG
AGCTTTGAGA CGACGCCCGA TGGGCGCGAG GGATATCAAT CGATTTTTCT ACGCCGCAAT
TTTGGCGTCT ACATCATCGA TCAGCCGCGA CGCGCTCGCG CAGGCAACAC CACGGTAGGA
ACCACTCTGA CGCCAACGCC CGGAGAACAA GATCTTTTTG TTGCGTGGCG ACTCGGCGTC
TGGCCGAAGT TTTTTCCGAA CAGTCAATTC CCGCAGGCAA AACAGGGAAT CAACTCGCCC
GCGCTCAACC AATTCTTCCG TTGGGCGACG CCGGACACAG GGCCGCAAGA CAGAAACGTC
ATCACCGATG GAGTGGCGGC GCTTCTCGAA GAGATAGGAC CGTGCATCCT GATCATCCAC
TCCGCCAGCG GCGTCCTTGG CTGGCTTACG GCGATGAAGA GCCCGAACGT CAAGGCGATC
TATGCCTACG AGCCCGGTGG CGGAAACCCG GACTACGCTT TTCCATCCAA TGAGCTTCCG
CCCGCGCTCG GCAGCGGGCC GACCTTGCTG AGCCCGAACC CCGTGCCGTT GTCGGATTTT
CTCAAGCTCA CGAAAATCCC GATCCGGATG CAGTTCGGCG ACGGGCTTTC CGCGCAGTCA
AGCCCCTATC CGCGCGTGCA GCTCTGGCTG AACCGCTTCA AGATGGCGCA GCAAATGGTG
GCGGCGATCA ACAAACACGG CGGCAACGCC TCGATACTCC ACCTGCCGGA TATCGGGATT
CGCGGCAACA CGCATTTCTC GTTTTCCGAT GCGAACAATC TGCAGATCGC CGACATCTTT
TCACAATGGC TCGCCAAGAA TCGCCTCGAT GGTTATGGCG GCCATCGATA G
 
Protein sequence
MGGAGGGSIN DTRRRFFAKA GAALASLPVL SMESKQAQAQ DFDNSSANHP AGNGPLVIAR 
QGSFMAGGSV LQTPGVFDPT VTGGPGQTLH GDHAYAQFQI PQNPRSLPLV FWPGGGQSGK
SFETTPDGRE GYQSIFLRRN FGVYIIDQPR RARAGNTTVG TTLTPTPGEQ DLFVAWRLGV
WPKFFPNSQF PQAKQGINSP ALNQFFRWAT PDTGPQDRNV ITDGVAALLE EIGPCILIIH
SASGVLGWLT AMKSPNVKAI YAYEPGGGNP DYAFPSNELP PALGSGPTLL SPNPVPLSDF
LKLTKIPIRM QFGDGLSAQS SPYPRVQLWL NRFKMAQQMV AAINKHGGNA SILHLPDIGI
RGNTHFSFSD ANNLQIADIF SQWLAKNRLD GYGGHR