Gene Msil_3826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3826 
Symbol 
ID7090754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4189189 
End bp4191021 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content66% 
IMG OID643467111 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002364070 
Protein GI217979923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0497538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG CCCCCCGCAA CAGCCGCTTC GCGGCGCTCG CCAGCGCTGT CCTTTGCTTC 
GCTTTGGCGC TGGCCGCGCC GGCGCTGGCG CAGCCCGCGC CGCGCGGGGC TATCGCCATG
CACGGGGAGC CGCAGCTTCC GGAGGATTTC GACCATCTGC CCTATGCCGA TCCGGCGGCG
CGCAAGGGCG GCAAGCTGTC GATCGGCTTT CCCGGCGCCT ATGACAGCCT CAATCCCTTC
AATCTCAAGG CCGGATCGAC GGCGCAGGGC CTCAACGGCA ATGTCTTCGA GACGCTGATG
ACGCGCTCGC TCGACGAACC TTTCACGCTC TACGGACTCA TCGCCGAAAG CGTCGAGACC
GACGCGGACC GCACCTATGT GACCTTCCGG CTCAATCCGG CCGCGCATTT TTCGGACGGA
ACGCCGATCG CGTCGGAGGA CGTGCGCTGG ACCTTCGAGC TTTTGAAGAC GCGCGGGCGG
CCGCAGCACC GCGCCGCCTA TTCCCTCGTC AAATCGGTCG AGACGCCCGA TCCGCTGACC
ATCCGCTATG AGCTCGGTTC GGGCGCCGAC CGCGAAATGC CCTTGACGCT TGCGCTGATG
CCCGTGCTGC CGCGCCATGC CGTCGATCTT TCGAAATTCG ACGACGCGAG CCTGACCGTT
CCGATCGGAT CGGGCCCCTA TAAGATCGCC GAGGTCAAGC CCGGCGAGCG GCTCGTGTTG
AAGCGCGATC CGAATTATTG GGCCAAGGAT CTGCCGGTCC GCCGCGGCCT TTATAATTTC
GACGAGATCG CCATCGACTA TTTTCGCGAC GCCAACAGCC TGTTCGAGTC CTTCGCCGCG
GGACTCCTCG ACTACCGCGA GGAGACGAGC CCGTCGCGCT GGACCAGCGC CTATGATTTT
CCGGCGATGC GCGAGGGGCG CGACCGCCGG GAGGCGCTGC CGGCCGGCGG CCCGAAAGGC
ATGGAAGGCT TCGTCTTCAA CCTGCGCCGG CCGCTCTTTG ACGACATCAG GGTGCGCGAG
GCGCTCGGCA TGATGTTCGA TTTCGAGTGG ATCAACGCCA ATCTCTACAG CGGACTCTAC
AAGCGCACGA AAAGCTTCTT CGACGAATCC GAACTCGCCT CGACGGGCCG GCCGGCCAGC
GCCGCCGAGC GCGCTTTGCT GGCGCCCTTT CCGGGCGCCG TGCGGGAGGA CATTCTCGAG
GGAAAATGGC GGCCGCCCGA AACCGACGGG TCGGGGCGCG ACCGGACCAT GCCAAAGCGC
GCGCTCGCCC TGCTCGAGCA GGCGGGCTAC CAGCTCGAGA ATGGCAAGCT CGTCAAGGAC
GGCGCGCCGC TCGCTTTCGA GATCATGGTG AAAGACCGCA ATCAGGAGCG GCTGGCGCTG
AACTATGCCG ATTCGCTCGG CCGGATCGGC GTTTCCGCCA AAGTGCGGCT GGTCGATGAA
GTGCAATATC AGCGCCGCCG TCAGAAATTC GACTTCGACA TGATGATCGG CAGCTGGCTC
GCCTCGGCCT CGCCCGGCAA TGAGCAGCGC TCGCGCTGGG GCTCGAAGAG CGCCGACCAG
GAGGCCTCGT TCAATCTCGC CGGCGTCAAA TCCCCCGCCG TCGACGCGCT GATCGCCGCC
ATGCTCGCCG CCCGCAGCCG CGAGGATTTC GTGACCGCGG TGCGCGCCTA TGACCGCGTA
CTGCTGTCCG GCTTCTATAT CGTGCCGCTG TTTCATTCGT CCGATCTGTG GACGGCGTCC
TCGACCGCGC TGGCGCGCCC GGCGGCGCTG CCCCGCTACG GCTCCCCGAC CGCGAGCTCG
ACCCTCGACA ATTGGTGGCG CAAGCAGCCT TGA
 
Protein sequence
MTNAPRNSRF AALASAVLCF ALALAAPALA QPAPRGAIAM HGEPQLPEDF DHLPYADPAA 
RKGGKLSIGF PGAYDSLNPF NLKAGSTAQG LNGNVFETLM TRSLDEPFTL YGLIAESVET
DADRTYVTFR LNPAAHFSDG TPIASEDVRW TFELLKTRGR PQHRAAYSLV KSVETPDPLT
IRYELGSGAD REMPLTLALM PVLPRHAVDL SKFDDASLTV PIGSGPYKIA EVKPGERLVL
KRDPNYWAKD LPVRRGLYNF DEIAIDYFRD ANSLFESFAA GLLDYREETS PSRWTSAYDF
PAMREGRDRR EALPAGGPKG MEGFVFNLRR PLFDDIRVRE ALGMMFDFEW INANLYSGLY
KRTKSFFDES ELASTGRPAS AAERALLAPF PGAVREDILE GKWRPPETDG SGRDRTMPKR
ALALLEQAGY QLENGKLVKD GAPLAFEIMV KDRNQERLAL NYADSLGRIG VSAKVRLVDE
VQYQRRRQKF DFDMMIGSWL ASASPGNEQR SRWGSKSADQ EASFNLAGVK SPAVDALIAA
MLAARSREDF VTAVRAYDRV LLSGFYIVPL FHSSDLWTAS STALARPAAL PRYGSPTASS
TLDNWWRKQP