Gene Msil_0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0279 
Symbol 
ID7090598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp321380 
End bp322537 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content66% 
IMG OID643463612 
Productprotein of unknown function DUF195 
Protein accessionYP_002360620 
Protein GI217976473 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0843742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCC TTGTCGCATG GCTCAATGGC GCGCCGCTCA CAGCTCCGCA GCTCATGCTC 
GCCGGGGTCG GCGCCGCGCT TGTGCTGCTC TTTATCTTTG CCATCAGCCT GGCGCGGTCG
TCGCGCCGCA ACAGCGTCGC GGCGATGGCC GCCGCCGGGC GCCAATATGA GATAGAGCAG
GCGCTCGGCG CGATCGCGCG GCAGAACGCT GAGCTTTCGG GCCGCATGCG CGCCGTGGCC
GATTCGTTCG GATCGCGCCA GAGCGACCTC GCGCGCTTCG TCGCGGCGCG GCTCGACGCC
GTCGGCGAGC GGGTCGGCGC CGATGTCGAG GCCTCCGGCC GCAACGCCGG CGAGCAGCTC
GCAAGGTTGA ACGAGCGCCT GGCCGTGATC GACGCCGCGC AGGCGCGCCT GACGGGTTTG
TCGCAGGACA TGGTCGGGCT CAAGGACATT CTCGCCAATA AGCAGGCGCG CGGCGCTTTC
GGACAAGGCC GCATGGAAGC GATCATCAGC GACGCCCTGC CTTCCTCCGC TTATGCTTTT
CAGCACACGC TCTCGAACAG GATGCGCCCG GACTGCGTCA TTCGGATGCC GGGCGATCCG
CGGCTGATGG TGATCGACGC CAAATTTCCG CTCGAAGCGT TCACGGCTCA TAAGGCGGCC
CAGAGTTTCG AGGCGAAAAA ACACGCCGCG GCGCGGGCGC GCGCGGATCT TGGCAAGCAT
ATCCGGGACA TCGCCGAGCG CTATTTTCTG CCCGAGGAAA CGCAGGATAT CGCGCTGATG
TTCGTGCCTT CCGAATCGCT TTACGCCGAC ATCAACGAAC ATTTCGACGA TATCGTGCAA
AAAGCCCATC GCAGCCGGAT CATCATTGTT TCGCCGTCGC TGCTGATGAT GGCGATGCAG
CTGACGCAGG CGCTGGTGCG CGACGCGCGG GTGCGCGAGC AGACCCATGT CATCCAGGCC
GAGGTGCGCC GCCTCGTCGA GGACGTCGCG CGGCTGCGGG CGCGCGCCCT GAAACTCGAC
GCTCATTTCC AGAACGCGCA GCAGGATGTC GGACAGCTCA TCGCCTCAGC CGACCGGATC
GCCCGGACCG GCGAGCGCAT CGACGAAATG GACTTTTCGG ATGCGCCTGG CGGCGACAAG
CTGAAGGCGG CCGAATAA
 
Protein sequence
MDRLVAWLNG APLTAPQLML AGVGAALVLL FIFAISLARS SRRNSVAAMA AAGRQYEIEQ 
ALGAIARQNA ELSGRMRAVA DSFGSRQSDL ARFVAARLDA VGERVGADVE ASGRNAGEQL
ARLNERLAVI DAAQARLTGL SQDMVGLKDI LANKQARGAF GQGRMEAIIS DALPSSAYAF
QHTLSNRMRP DCVIRMPGDP RLMVIDAKFP LEAFTAHKAA QSFEAKKHAA ARARADLGKH
IRDIAERYFL PEETQDIALM FVPSESLYAD INEHFDDIVQ KAHRSRIIIV SPSLLMMAMQ
LTQALVRDAR VREQTHVIQA EVRRLVEDVA RLRARALKLD AHFQNAQQDV GQLIASADRI
ARTGERIDEM DFSDAPGGDK LKAAE