Gene Msil_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1733 
Symbol 
ID7093193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1883457 
End bp1884758 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID643465056 
Productprotein of unknown function DUF763 
Protein accessionYP_002362041 
Protein GI217977894 
COG category[S] Function unknown 
COG ID[COG1415] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.610359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC GCAGCGGAAA CGCTGACTTG CCGCTGCACT CCGGCCGCGT GCCCGCCTGG 
CTTGCGGACC GCATGACACT GCTCGGCGCG GCGATCAGCG AGGCCGTGAT CCTGCACTAC
GGCCGCGACG AATTCCTGCG CCGGCTCGCG CATCCGTTCT GGTTCCAATC CCTTGGCGCC
GTCATGGGAA TGGACTGGCA TTCGTCGGGG ATTACGACGA GCGTGCTCGG CGCCCTGAAG
CGGGGCCTTG CGCCGCGGGC TCGGGAGCTT GGAATTCATA TCTGCGGCGG CCGCGGCAAA
CATTCCCGCC AGACGCCGGC GGAGCTGATG CGCGCCGGCG AAGAGGCGGG ATTTGATGGA
GCGCCGCTGG CGGATGCGAG CCGCCTCGTC GCCAAGGTCG ACAGCGCCGC CGTGCAGGAC
GGCTTCGAGC TTTACCTGCA CAGCTTCATC GTCGCCGATG ACGGCAGCTG GGTCGTCGTC
CAACAGGGCA TGAACGGCGC GCTTCGGCAG GCGCGGCGCT ATCATTGGCT GTCAGAGGGG
CTGCGCAGCT TCGTCGACGA CCCGCATGCG GCGATCGACG GGCGCAAGGG CGCCGACATC
ATCAACCTCA CCGATCACCG CGCGGAATTT TCACGCGCGC GACAACTCGA TTTGCTGCAA
ACCCTCGGGC CGGATGGAAT CGCCAGCGAA TTCGCTTCGC TCGCGGAACG CGCGGCGCCG
ACGCCGAAGC CGCAGCTCGA GTTCCCCTTT CTCGTCATGC CGGCGCATCA TGAGGTTCGG
CCCGGCGATG TCATGATGCG CCGGCTGCGC GGCGCGCTTG CCGCTGCGGC CGATTGCGGT
CCGCAGGATT TCGCCGATCT CTTGATGACG CCGGGCGTCG GCGAGCGCAC GGTGCGCGCT
TTGGCGCTGG TCGCCGAAGT GGTGCATGGC GCGCCCTGCC GCTTCACCGA TCCGGCGCGG
TTCTCGCTCG CGCATGGCGG CAAGGACGGT CATCCCTTCC CCGTGCCCGT AAAAGTCTAC
GACGAGACGA TCCGCGTCAT GAAATCCGCC GTCCGCAAGG CCCGGCTCGG ACGCGCAGAG
GAGCTTGACG CCTTGAAACG TCTTGACGAT CAGGCGCGGC TTGCCGAGCG CGCCGCGACG
AAACCCTCTT TTGAGGCATT TGTTGCGGAA GAACGGCGCC TCTCGCCTTC TTATGGCGGC
AGGACCGTCG CCGGCGACGC CGCCCCCGCT CGAACAAAGC CAAGGCTCCC GCCGCGTGAT
GGATGGGGAA CGGGCGGGGC GGCGTCGGCG TTGCAACCAT GA
 
Protein sequence
MTRRSGNADL PLHSGRVPAW LADRMTLLGA AISEAVILHY GRDEFLRRLA HPFWFQSLGA 
VMGMDWHSSG ITTSVLGALK RGLAPRAREL GIHICGGRGK HSRQTPAELM RAGEEAGFDG
APLADASRLV AKVDSAAVQD GFELYLHSFI VADDGSWVVV QQGMNGALRQ ARRYHWLSEG
LRSFVDDPHA AIDGRKGADI INLTDHRAEF SRARQLDLLQ TLGPDGIASE FASLAERAAP
TPKPQLEFPF LVMPAHHEVR PGDVMMRRLR GALAAAADCG PQDFADLLMT PGVGERTVRA
LALVAEVVHG APCRFTDPAR FSLAHGGKDG HPFPVPVKVY DETIRVMKSA VRKARLGRAE
ELDALKRLDD QARLAERAAT KPSFEAFVAE ERRLSPSYGG RTVAGDAAPA RTKPRLPPRD
GWGTGGAASA LQP