Gene Msil_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0039 
Symbol 
ID7092367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp35016 
End bp36743 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content63% 
IMG OID643463372 
Producthypothetical protein 
Protein accessionYP_002360384 
Protein GI217976237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTAG TCAATGGACA TTTTGGCGCG CGCACCATCG CTGCCGAATC GCCGTCGAGC 
GCTGCATTGG AGCGCGGCGG CGCCGCAGCT TCCCATGCGG AGCAGGGCGC GGCGTTCGGA
AACATTATGG CGGACATCTC AAAAGCCGAC GGACCTGTTG CTCAAGGCTC CGCAACGCCG
AAGGGATTGA GCCGCCCGAT CACGGCGCGC TTTGAAGAAG GAGAGAAAGC TTGCCCTTCT
GCCGACCCAA TCCAAGGCGG GTCTGATGCG TCGGCCGACG CCGGGTCTGC GGCAGGGCCG
CAACTACCGG GAGAGGCTGC TTTTGGCGCC GCGCGGCGCG CTTTCACCGC AGAGGGCTCC
GGCGCTGGGA TCGTTAATGG AGTCGGTCAG AAGGCCGCGT CGCCAACGCG AGCCGACGCG
GGCGTCGCCG CGCAATTCAC CGGCGATCCG GCCATTGACG CTGATCTGGC GCCGTGTTTT
GGAGCTGGCG GCCGACACAA ATTTCACGAC GTGAAGGAGT CAAAAGAGAC CGCGCCAAAT
CCTTCGACGC CAGATCATGT TACGACAAAG CCAAAAAGCG GCCTCGCCGG AGACGCCGCG
ATTTCGGCGT CTGTCGTTCC GTCGGCCGGG GTTGCGAGCG GGCTGTCGCT CCGTTCCGAT
CCAGCCGCCG CGCCTGTAGA TCCCTTGTCG GGCGCTGGGA GCGACTGGCT GACCAGACAT
CCGGCCGCGC CGTCGCTTGC ACTGCCGAGC GAAATGCGTC TGGCCCGCGC GGCATCCGAA
CCTGCCAGCG ATCTCGACTT GCTGGCGGCC GGACTGTCGG AGGCCCGAGG GCCAGGCGCT
GAAAAAGCGC AACGCCGTGA AAGCCTGCTC CAGCAATCGT CTGGCTCAAG GGCGACACAG
ATCGTCGCGG GCGCTTCGGA GGACTCATCG TCGAGCGCAA TGGCGATTCT TCGCGCGAGG
CGCCCGACAG TGGCGAACCT CGCGCCCCAT TCAACGGCAC ATGAAGGCGC AACGCAACTG
GATCCGCTGG CGGCTTCACG CAGCGAGGCC GATATTGGTC GAGAGATCGC GCGGCTTGGA
TCCCAGGGTA GCGAGATCAA GGTCTCCGTC ATTGAGCGGC GGACTCATCT ACCGCCTGTG
GTCGAGGCGG CAAAGCCAAT TGAGCAAATC GGCAATCAGA TCCTGACCGA AGCATCGCTT
TTGTTGACGC CATCCGCCGG CGGGAGTGAA TCCCGAACCA CGACACAGCG TCCCGCCGGC
GCATCCACGA CTGGCGTGGA GGCCCCGCAA AAACCGATGC CCGTGATGAA AACTCTCGAT
CTGAGGCTGG AGCCCGAAAG CCTTGGCGCC GTCACCATTC GCTTGCGCCT TTCCGGCGTC
CAGCTCGAGG TGCAGGTCGA GGCGTCCCAT GTTCAAACGA TGAAGCTGAT CCGAGACGAC
AAGGATCTGC TTTCCGCCAA GCTGCGGTCG TCCGGCTATG CAATGGATCA CCTTGTGGTG
AAGCTTGCAG AGCAACAGAT CGGGCCGGCG CAAACTCGGG CGGAAGCGGG ACAAGGTCAT
ACATTCGATG GTCAATCGGG GAATTTCACG CCCTCGTCAG AATTATCGCA GCAAGGGGGG
TCCGGCGCCA ATGATCGGCA GGCCGCCAGG CGAGATCAGA CGACGGGATT TGGCAAAGGC
GATGATGCGG AGGATCATGC TCGTCGCGGC GGCGATCTCT ATCTTTGA
 
Protein sequence
MNVVNGHFGA RTIAAESPSS AALERGGAAA SHAEQGAAFG NIMADISKAD GPVAQGSATP 
KGLSRPITAR FEEGEKACPS ADPIQGGSDA SADAGSAAGP QLPGEAAFGA ARRAFTAEGS
GAGIVNGVGQ KAASPTRADA GVAAQFTGDP AIDADLAPCF GAGGRHKFHD VKESKETAPN
PSTPDHVTTK PKSGLAGDAA ISASVVPSAG VASGLSLRSD PAAAPVDPLS GAGSDWLTRH
PAAPSLALPS EMRLARAASE PASDLDLLAA GLSEARGPGA EKAQRRESLL QQSSGSRATQ
IVAGASEDSS SSAMAILRAR RPTVANLAPH STAHEGATQL DPLAASRSEA DIGREIARLG
SQGSEIKVSV IERRTHLPPV VEAAKPIEQI GNQILTEASL LLTPSAGGSE SRTTTQRPAG
ASTTGVEAPQ KPMPVMKTLD LRLEPESLGA VTIRLRLSGV QLEVQVEASH VQTMKLIRDD
KDLLSAKLRS SGYAMDHLVV KLAEQQIGPA QTRAEAGQGH TFDGQSGNFT PSSELSQQGG
SGANDRQAAR RDQTTGFGKG DDAEDHARRG GDLYL