Gene Msil_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3812 
Symbol 
ID7090740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4173556 
End bp4175142 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content63% 
IMG OID643467097 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002364056 
Protein GI217979909 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0629305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTC CGTCGCGGCT TCTTTGCGCG CTGGCGCTCT GTCTCGGCCT TGTCCAGCCC 
GCCTGCGCCG AGATGGTCTG GCGGCGCGGC GCGCTCGGCG ATCCCGGCTC GCTCGATCCG
CACAAGGCGA CGACGCTCAT CGAAAGCAAT GTGCTCGGCG AGCTCTTCGA GGGGCTGCTT
TCGCGCAACG CCGCGGGCGC GCTTATTCCC GGCGTCGCCG AAAGCTGGAG CGTCGCGCCG
GATGGGCGCG TCTATAAATT CAAGCTACGC GAGGACGCCA AATGGTCGAA CGGCGATCCG
GTCACGGCGG AGGATTTCGT TTTCGCCTTC CGTCGCCTGA TGGACCCGCG CACCGGCGCG
CCCTACGCCA ATATTCTCTA CGCGCTGAAG AATGGCGAAC AGGTCAATTC CGGGGCCTTG
CCGCCAGATG CGCTGGGCGC CCGGGCGTTG GGCGAGCGCG AGCTTGAGCT GACCCTCGAA
CAACCCGTGC CCTATTTTCT GGAGCAGTTG GCGCATTTCA CCGCAAAGCC GCTGCATCGC
AAATCCATCG AGGCGTTCGG CTCCGATTTC GCCCATCCCG AGCATGTCGT CGCCAACGGT
CCGTTCCGGC TCAAAAAATT CATTCCCAAT GATGCGATCG TGCTGGAGAA AAACCCGCGC
TTTTGGGACG CCGGCAAGAT TGCGCTCGAC CGCGAGATCT TCATTCCGCT CGAGGATCGC
TCGGCGGCGC TGCGTCGCTT CATGGCCGGC GAGATCGATT CCTATGATGA AGTCCCGGTT
GAGGAGATCG GCTTCGTGCG CAAAACGCTG TCGGGGGCGC TGCATCTTTC GCCGAGCCTT
GGCGGCTATT ATTACGCGCT CGATACGCGC CGCCCGCCCT TCGACGACGC GCGGGTGCGT
CAGGCGCTCG CGATGGCGAT CGATCGGGAG TTTTTGGCCG AAAAGATCTG GGGCGGCTCG
ATGGCGCCCG GATACAGCTT CGTTCCGCCC GGCGTCGCAA GTTATGGCGC GCCCGCCGAG
GTCGCATGGA AAGATTTGAG CTTTCCCGAA CGGCAGGAGC AGGCGCGGCG TCTGCTCAAG
GAGGCCGGGT TCGGCGAGGG CGGCAAGACG CTCGAGGTCG AGATCCGCTT CAACAATTCA
GGCAGCCACC GAACGACTGC GGTCGCCATC GCCGATATGT GGATGAGGCT CGGGGTGAAG
GCGAGCCTGA TCGGCACGGA CGCCTCCACC CACTATGCTC TATTGCGCGA GAAGCCGCCG
TTCGACGCCG CGCGGATGAG CTGGTACGCC GATTATCCTG ATGCGCAGAA TTTTCTGTTT
CTCGCCGAAA GCGCCAATAA GGGCTTGAAT ACGCCGAGCT TTTCCAACCC CGAATTCGAT
GCGCTGATGC GGCGGGCGGC CGAGGAGCAA AATTCCGATG CGCGCAAGAC GCGGCTTCAC
GAAGCCGAAG CGCTGCTCCT CAGAGAGCAG CCCTTTATCG TCTTGATGAA TTATCGGTCG
AGCCACCTCG TCTCGCCGAA GCTCAAAGGT TTTGAGCCGA ATGCGCTCGA CATTCATCCG
GGGCGGTACG TCTCGATCGC GCGATGA
 
Protein sequence
MTLPSRLLCA LALCLGLVQP ACAEMVWRRG ALGDPGSLDP HKATTLIESN VLGELFEGLL 
SRNAAGALIP GVAESWSVAP DGRVYKFKLR EDAKWSNGDP VTAEDFVFAF RRLMDPRTGA
PYANILYALK NGEQVNSGAL PPDALGARAL GERELELTLE QPVPYFLEQL AHFTAKPLHR
KSIEAFGSDF AHPEHVVANG PFRLKKFIPN DAIVLEKNPR FWDAGKIALD REIFIPLEDR
SAALRRFMAG EIDSYDEVPV EEIGFVRKTL SGALHLSPSL GGYYYALDTR RPPFDDARVR
QALAMAIDRE FLAEKIWGGS MAPGYSFVPP GVASYGAPAE VAWKDLSFPE RQEQARRLLK
EAGFGEGGKT LEVEIRFNNS GSHRTTAVAI ADMWMRLGVK ASLIGTDAST HYALLREKPP
FDAARMSWYA DYPDAQNFLF LAESANKGLN TPSFSNPEFD ALMRRAAEEQ NSDARKTRLH
EAEALLLREQ PFIVLMNYRS SHLVSPKLKG FEPNALDIHP GRYVSIAR