Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3826 |
Symbol | |
ID | 7090754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 4189189 |
End bp | 4191021 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643467111 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002364070 |
Protein GI | 217979923 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0497538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACG CCCCCCGCAA CAGCCGCTTC GCGGCGCTCG CCAGCGCTGT CCTTTGCTTC GCTTTGGCGC TGGCCGCGCC GGCGCTGGCG CAGCCCGCGC CGCGCGGGGC TATCGCCATG CACGGGGAGC CGCAGCTTCC GGAGGATTTC GACCATCTGC CCTATGCCGA TCCGGCGGCG CGCAAGGGCG GCAAGCTGTC GATCGGCTTT CCCGGCGCCT ATGACAGCCT CAATCCCTTC AATCTCAAGG CCGGATCGAC GGCGCAGGGC CTCAACGGCA ATGTCTTCGA GACGCTGATG ACGCGCTCGC TCGACGAACC TTTCACGCTC TACGGACTCA TCGCCGAAAG CGTCGAGACC GACGCGGACC GCACCTATGT GACCTTCCGG CTCAATCCGG CCGCGCATTT TTCGGACGGA ACGCCGATCG CGTCGGAGGA CGTGCGCTGG ACCTTCGAGC TTTTGAAGAC GCGCGGGCGG CCGCAGCACC GCGCCGCCTA TTCCCTCGTC AAATCGGTCG AGACGCCCGA TCCGCTGACC ATCCGCTATG AGCTCGGTTC GGGCGCCGAC CGCGAAATGC CCTTGACGCT TGCGCTGATG CCCGTGCTGC CGCGCCATGC CGTCGATCTT TCGAAATTCG ACGACGCGAG CCTGACCGTT CCGATCGGAT CGGGCCCCTA TAAGATCGCC GAGGTCAAGC CCGGCGAGCG GCTCGTGTTG AAGCGCGATC CGAATTATTG GGCCAAGGAT CTGCCGGTCC GCCGCGGCCT TTATAATTTC GACGAGATCG CCATCGACTA TTTTCGCGAC GCCAACAGCC TGTTCGAGTC CTTCGCCGCG GGACTCCTCG ACTACCGCGA GGAGACGAGC CCGTCGCGCT GGACCAGCGC CTATGATTTT CCGGCGATGC GCGAGGGGCG CGACCGCCGG GAGGCGCTGC CGGCCGGCGG CCCGAAAGGC ATGGAAGGCT TCGTCTTCAA CCTGCGCCGG CCGCTCTTTG ACGACATCAG GGTGCGCGAG GCGCTCGGCA TGATGTTCGA TTTCGAGTGG ATCAACGCCA ATCTCTACAG CGGACTCTAC AAGCGCACGA AAAGCTTCTT CGACGAATCC GAACTCGCCT CGACGGGCCG GCCGGCCAGC GCCGCCGAGC GCGCTTTGCT GGCGCCCTTT CCGGGCGCCG TGCGGGAGGA CATTCTCGAG GGAAAATGGC GGCCGCCCGA AACCGACGGG TCGGGGCGCG ACCGGACCAT GCCAAAGCGC GCGCTCGCCC TGCTCGAGCA GGCGGGCTAC CAGCTCGAGA ATGGCAAGCT CGTCAAGGAC GGCGCGCCGC TCGCTTTCGA GATCATGGTG AAAGACCGCA ATCAGGAGCG GCTGGCGCTG AACTATGCCG ATTCGCTCGG CCGGATCGGC GTTTCCGCCA AAGTGCGGCT GGTCGATGAA GTGCAATATC AGCGCCGCCG TCAGAAATTC GACTTCGACA TGATGATCGG CAGCTGGCTC GCCTCGGCCT CGCCCGGCAA TGAGCAGCGC TCGCGCTGGG GCTCGAAGAG CGCCGACCAG GAGGCCTCGT TCAATCTCGC CGGCGTCAAA TCCCCCGCCG TCGACGCGCT GATCGCCGCC ATGCTCGCCG CCCGCAGCCG CGAGGATTTC GTGACCGCGG TGCGCGCCTA TGACCGCGTA CTGCTGTCCG GCTTCTATAT CGTGCCGCTG TTTCATTCGT CCGATCTGTG GACGGCGTCC TCGACCGCGC TGGCGCGCCC GGCGGCGCTG CCCCGCTACG GCTCCCCGAC CGCGAGCTCG ACCCTCGACA ATTGGTGGCG CAAGCAGCCT TGA
|
Protein sequence | MTNAPRNSRF AALASAVLCF ALALAAPALA QPAPRGAIAM HGEPQLPEDF DHLPYADPAA RKGGKLSIGF PGAYDSLNPF NLKAGSTAQG LNGNVFETLM TRSLDEPFTL YGLIAESVET DADRTYVTFR LNPAAHFSDG TPIASEDVRW TFELLKTRGR PQHRAAYSLV KSVETPDPLT IRYELGSGAD REMPLTLALM PVLPRHAVDL SKFDDASLTV PIGSGPYKIA EVKPGERLVL KRDPNYWAKD LPVRRGLYNF DEIAIDYFRD ANSLFESFAA GLLDYREETS PSRWTSAYDF PAMREGRDRR EALPAGGPKG MEGFVFNLRR PLFDDIRVRE ALGMMFDFEW INANLYSGLY KRTKSFFDES ELASTGRPAS AAERALLAPF PGAVREDILE GKWRPPETDG SGRDRTMPKR ALALLEQAGY QLENGKLVKD GAPLAFEIMV KDRNQERLAL NYADSLGRIG VSAKVRLVDE VQYQRRRQKF DFDMMIGSWL ASASPGNEQR SRWGSKSADQ EASFNLAGVK SPAVDALIAA MLAARSREDF VTAVRAYDRV LLSGFYIVPL FHSSDLWTAS STALARPAAL PRYGSPTASS TLDNWWRKQP
|
| |