Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3914 |
Symbol | |
ID | 7092611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4292123 |
End bp | 4294015 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643467199 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002364157 |
Protein GI | 217980010 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCGATC CATTGGCCAT CTCCCGCCGT TCCTTCGTGC AATATTCCGC CTGCGGCCTG ATCGCGCCCC GGTTTCTCAC GCCGCCGGCC TTTGCTGCCG AAGCCTTTGC CGCCGGAGAG CGCGAGGTTC ACGGCCTCTC CGTGTTCGGC GATCTGGCGC TGCCTGCCGA TTTTCCGCAT TTCGCCTATG TCAATCCAGA GGCTCCGAAG GGCGGCGAGA TCTCGCTGCA GGTGAGCTCA ACCTCTGGCA ATCAGAATTT CACGACCTTC AATACGCTGA ACGCTTATAT CCTGAAGGGC GACGGCGCCG CCGGCATGGG GCTCATCTTC GATTCGTTGA TGACGGGCAA TGCAGACGAG CCGGATTCGC TTTACGGCCT CGTCGCTCGC GCGGTGCGGG TCTCGGCCGA TCGCAGCGTC TATCGCTTCC TGTTGCGCAA GGAGGCCCGT TTTCACGATG GGTCGCCGCT GACCGCCGCG GACGTCGCCT TTTCCCTCAA CATCTTAAAG GCCAAGGGTC ATCCCTCTAT TCGTCAGGCG CTGCGGGACC TCGACGCCGC CGAGGACGAG GCCGACGACA TCGTGATGGT GCGGCTAAAG TCCCAACGCA GCCGCGAAGC GCCTTTGATC GTCGCGGGCC AGCCGATTTT CAGCGCGGCT TATTACAAGA CCCGTGATTT TGACCAGACG ACGCTGGAGC CGCCGCTCGG CTCCGGCGGC TATAAGGTCG GCCGGTTCGA TCAGGGCCAT TTCATCAGTT TCGAACGCGT CGCGGATTAT TGGGGCAAGG ATTTGCCGGT CAATATCGGC CAGTCCAATT TCGACCGGGT TCGTTTCGAA TATTTCGGCG ATCGCAAGGT CGCGTTCGAG GCCTTCAAGG CGGGCGTATT CAGCTTCCGC GAGGAATTCA CCTCGGCGGT CTGGGCCACG GGGTATGATT TCGCCGCGGT CAAGGACGGC AGGGTGCAGC GCGCGACGCT TCCCGACGAA TCGCCGATGG GAACGCAGGG CTGGTTCCTC AACATGCGCC GCGACAAATT CAGGGATCCG CGGATCAGGG AGGCGATCGG CCTCGCCTTT GACTTCGAAT GGACCAACCG CAACATCATG TATGGGGTCT ATTCGCGCAC GGTTTCCTTT TTCCAGAATT CGCCGATGGC GGCGCAGGGC AAGCCCTCCC CCGAGGAGCT GGCCTTGCTC GAACCCTGTC GCGGCGAACT GTCGCCAGAC GTGTTCGGCG AGGTCTATAC CCCGCCCGTC TCGGACGGCT CGGGCCAGGA CCGCGCCCTC CTGCGCCGCG CCAATGATCT CTTCGTTTCG GCCGGATGCA AACGGCAGGG GTCCGCCCTG ATCCTGCCGG ACGGCAAGCC GTTCGAGATT GAATTTCTCG ACTTCGACGG CGCCCTCGAG CCGCATACGG CGCCGTTCAT CAAGAACCTC AAGCTGCTCG GAATCGAGGC GCGCTATCGC GTCGTCGACG CGGCGCAATA CAAGCGCCGG ACCGACGATT TCGATTATGA CATCGTCACC TCGCGCTTCG GGCTCGGCCT GACGCCGGGG GAGGGCATGC GCGCGACCTT CGGTTCCGAA GCCGCCGACA TGGCCGGCTC GCGCAATGTG TCGGGCATCA AGAACAAGGC AGTCGACGCG CTGATCGAAA AAGCGCTTGT CGCCGAGACG CGCGAGGAGC TGACTTTCAT CTGCCGCTCG ATCGACCGCA TCCTGCGCGC CATGCACCCC TGGGTTCCGA TGTGGAACAA GCCCAATCAT CTCGTCGCCT ATTGGGATCT GTTCAGCCGG CCCGAGCGCA GCGGGCGCTA CGAGATCGGC GTGCTCAATA GCTGGTGGTA TGATGAAGAG AAGGCCAAAC GCATTAATTT CGCCGGCCGC TGA
|
Protein sequence | MRDPLAISRR SFVQYSACGL IAPRFLTPPA FAAEAFAAGE REVHGLSVFG DLALPADFPH FAYVNPEAPK GGEISLQVSS TSGNQNFTTF NTLNAYILKG DGAAGMGLIF DSLMTGNADE PDSLYGLVAR AVRVSADRSV YRFLLRKEAR FHDGSPLTAA DVAFSLNILK AKGHPSIRQA LRDLDAAEDE ADDIVMVRLK SQRSREAPLI VAGQPIFSAA YYKTRDFDQT TLEPPLGSGG YKVGRFDQGH FISFERVADY WGKDLPVNIG QSNFDRVRFE YFGDRKVAFE AFKAGVFSFR EEFTSAVWAT GYDFAAVKDG RVQRATLPDE SPMGTQGWFL NMRRDKFRDP RIREAIGLAF DFEWTNRNIM YGVYSRTVSF FQNSPMAAQG KPSPEELALL EPCRGELSPD VFGEVYTPPV SDGSGQDRAL LRRANDLFVS AGCKRQGSAL ILPDGKPFEI EFLDFDGALE PHTAPFIKNL KLLGIEARYR VVDAAQYKRR TDDFDYDIVT SRFGLGLTPG EGMRATFGSE AADMAGSRNV SGIKNKAVDA LIEKALVAET REELTFICRS IDRILRAMHP WVPMWNKPNH LVAYWDLFSR PERSGRYEIG VLNSWWYDEE KAKRINFAGR
|
| |