Gene Msil_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3914 
Symbol 
ID7092611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4292123 
End bp4294015 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content62% 
IMG OID643467199 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002364157 
Protein GI217980010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCGATC CATTGGCCAT CTCCCGCCGT TCCTTCGTGC AATATTCCGC CTGCGGCCTG 
ATCGCGCCCC GGTTTCTCAC GCCGCCGGCC TTTGCTGCCG AAGCCTTTGC CGCCGGAGAG
CGCGAGGTTC ACGGCCTCTC CGTGTTCGGC GATCTGGCGC TGCCTGCCGA TTTTCCGCAT
TTCGCCTATG TCAATCCAGA GGCTCCGAAG GGCGGCGAGA TCTCGCTGCA GGTGAGCTCA
ACCTCTGGCA ATCAGAATTT CACGACCTTC AATACGCTGA ACGCTTATAT CCTGAAGGGC
GACGGCGCCG CCGGCATGGG GCTCATCTTC GATTCGTTGA TGACGGGCAA TGCAGACGAG
CCGGATTCGC TTTACGGCCT CGTCGCTCGC GCGGTGCGGG TCTCGGCCGA TCGCAGCGTC
TATCGCTTCC TGTTGCGCAA GGAGGCCCGT TTTCACGATG GGTCGCCGCT GACCGCCGCG
GACGTCGCCT TTTCCCTCAA CATCTTAAAG GCCAAGGGTC ATCCCTCTAT TCGTCAGGCG
CTGCGGGACC TCGACGCCGC CGAGGACGAG GCCGACGACA TCGTGATGGT GCGGCTAAAG
TCCCAACGCA GCCGCGAAGC GCCTTTGATC GTCGCGGGCC AGCCGATTTT CAGCGCGGCT
TATTACAAGA CCCGTGATTT TGACCAGACG ACGCTGGAGC CGCCGCTCGG CTCCGGCGGC
TATAAGGTCG GCCGGTTCGA TCAGGGCCAT TTCATCAGTT TCGAACGCGT CGCGGATTAT
TGGGGCAAGG ATTTGCCGGT CAATATCGGC CAGTCCAATT TCGACCGGGT TCGTTTCGAA
TATTTCGGCG ATCGCAAGGT CGCGTTCGAG GCCTTCAAGG CGGGCGTATT CAGCTTCCGC
GAGGAATTCA CCTCGGCGGT CTGGGCCACG GGGTATGATT TCGCCGCGGT CAAGGACGGC
AGGGTGCAGC GCGCGACGCT TCCCGACGAA TCGCCGATGG GAACGCAGGG CTGGTTCCTC
AACATGCGCC GCGACAAATT CAGGGATCCG CGGATCAGGG AGGCGATCGG CCTCGCCTTT
GACTTCGAAT GGACCAACCG CAACATCATG TATGGGGTCT ATTCGCGCAC GGTTTCCTTT
TTCCAGAATT CGCCGATGGC GGCGCAGGGC AAGCCCTCCC CCGAGGAGCT GGCCTTGCTC
GAACCCTGTC GCGGCGAACT GTCGCCAGAC GTGTTCGGCG AGGTCTATAC CCCGCCCGTC
TCGGACGGCT CGGGCCAGGA CCGCGCCCTC CTGCGCCGCG CCAATGATCT CTTCGTTTCG
GCCGGATGCA AACGGCAGGG GTCCGCCCTG ATCCTGCCGG ACGGCAAGCC GTTCGAGATT
GAATTTCTCG ACTTCGACGG CGCCCTCGAG CCGCATACGG CGCCGTTCAT CAAGAACCTC
AAGCTGCTCG GAATCGAGGC GCGCTATCGC GTCGTCGACG CGGCGCAATA CAAGCGCCGG
ACCGACGATT TCGATTATGA CATCGTCACC TCGCGCTTCG GGCTCGGCCT GACGCCGGGG
GAGGGCATGC GCGCGACCTT CGGTTCCGAA GCCGCCGACA TGGCCGGCTC GCGCAATGTG
TCGGGCATCA AGAACAAGGC AGTCGACGCG CTGATCGAAA AAGCGCTTGT CGCCGAGACG
CGCGAGGAGC TGACTTTCAT CTGCCGCTCG ATCGACCGCA TCCTGCGCGC CATGCACCCC
TGGGTTCCGA TGTGGAACAA GCCCAATCAT CTCGTCGCCT ATTGGGATCT GTTCAGCCGG
CCCGAGCGCA GCGGGCGCTA CGAGATCGGC GTGCTCAATA GCTGGTGGTA TGATGAAGAG
AAGGCCAAAC GCATTAATTT CGCCGGCCGC TGA
 
Protein sequence
MRDPLAISRR SFVQYSACGL IAPRFLTPPA FAAEAFAAGE REVHGLSVFG DLALPADFPH 
FAYVNPEAPK GGEISLQVSS TSGNQNFTTF NTLNAYILKG DGAAGMGLIF DSLMTGNADE
PDSLYGLVAR AVRVSADRSV YRFLLRKEAR FHDGSPLTAA DVAFSLNILK AKGHPSIRQA
LRDLDAAEDE ADDIVMVRLK SQRSREAPLI VAGQPIFSAA YYKTRDFDQT TLEPPLGSGG
YKVGRFDQGH FISFERVADY WGKDLPVNIG QSNFDRVRFE YFGDRKVAFE AFKAGVFSFR
EEFTSAVWAT GYDFAAVKDG RVQRATLPDE SPMGTQGWFL NMRRDKFRDP RIREAIGLAF
DFEWTNRNIM YGVYSRTVSF FQNSPMAAQG KPSPEELALL EPCRGELSPD VFGEVYTPPV
SDGSGQDRAL LRRANDLFVS AGCKRQGSAL ILPDGKPFEI EFLDFDGALE PHTAPFIKNL
KLLGIEARYR VVDAAQYKRR TDDFDYDIVT SRFGLGLTPG EGMRATFGSE AADMAGSRNV
SGIKNKAVDA LIEKALVAET REELTFICRS IDRILRAMHP WVPMWNKPNH LVAYWDLFSR
PERSGRYEIG VLNSWWYDEE KAKRINFAGR