Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3812 |
Symbol | |
ID | 7090740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4173556 |
End bp | 4175142 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643467097 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002364056 |
Protein GI | 217979909 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0629305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTTC CGTCGCGGCT TCTTTGCGCG CTGGCGCTCT GTCTCGGCCT TGTCCAGCCC GCCTGCGCCG AGATGGTCTG GCGGCGCGGC GCGCTCGGCG ATCCCGGCTC GCTCGATCCG CACAAGGCGA CGACGCTCAT CGAAAGCAAT GTGCTCGGCG AGCTCTTCGA GGGGCTGCTT TCGCGCAACG CCGCGGGCGC GCTTATTCCC GGCGTCGCCG AAAGCTGGAG CGTCGCGCCG GATGGGCGCG TCTATAAATT CAAGCTACGC GAGGACGCCA AATGGTCGAA CGGCGATCCG GTCACGGCGG AGGATTTCGT TTTCGCCTTC CGTCGCCTGA TGGACCCGCG CACCGGCGCG CCCTACGCCA ATATTCTCTA CGCGCTGAAG AATGGCGAAC AGGTCAATTC CGGGGCCTTG CCGCCAGATG CGCTGGGCGC CCGGGCGTTG GGCGAGCGCG AGCTTGAGCT GACCCTCGAA CAACCCGTGC CCTATTTTCT GGAGCAGTTG GCGCATTTCA CCGCAAAGCC GCTGCATCGC AAATCCATCG AGGCGTTCGG CTCCGATTTC GCCCATCCCG AGCATGTCGT CGCCAACGGT CCGTTCCGGC TCAAAAAATT CATTCCCAAT GATGCGATCG TGCTGGAGAA AAACCCGCGC TTTTGGGACG CCGGCAAGAT TGCGCTCGAC CGCGAGATCT TCATTCCGCT CGAGGATCGC TCGGCGGCGC TGCGTCGCTT CATGGCCGGC GAGATCGATT CCTATGATGA AGTCCCGGTT GAGGAGATCG GCTTCGTGCG CAAAACGCTG TCGGGGGCGC TGCATCTTTC GCCGAGCCTT GGCGGCTATT ATTACGCGCT CGATACGCGC CGCCCGCCCT TCGACGACGC GCGGGTGCGT CAGGCGCTCG CGATGGCGAT CGATCGGGAG TTTTTGGCCG AAAAGATCTG GGGCGGCTCG ATGGCGCCCG GATACAGCTT CGTTCCGCCC GGCGTCGCAA GTTATGGCGC GCCCGCCGAG GTCGCATGGA AAGATTTGAG CTTTCCCGAA CGGCAGGAGC AGGCGCGGCG TCTGCTCAAG GAGGCCGGGT TCGGCGAGGG CGGCAAGACG CTCGAGGTCG AGATCCGCTT CAACAATTCA GGCAGCCACC GAACGACTGC GGTCGCCATC GCCGATATGT GGATGAGGCT CGGGGTGAAG GCGAGCCTGA TCGGCACGGA CGCCTCCACC CACTATGCTC TATTGCGCGA GAAGCCGCCG TTCGACGCCG CGCGGATGAG CTGGTACGCC GATTATCCTG ATGCGCAGAA TTTTCTGTTT CTCGCCGAAA GCGCCAATAA GGGCTTGAAT ACGCCGAGCT TTTCCAACCC CGAATTCGAT GCGCTGATGC GGCGGGCGGC CGAGGAGCAA AATTCCGATG CGCGCAAGAC GCGGCTTCAC GAAGCCGAAG CGCTGCTCCT CAGAGAGCAG CCCTTTATCG TCTTGATGAA TTATCGGTCG AGCCACCTCG TCTCGCCGAA GCTCAAAGGT TTTGAGCCGA ATGCGCTCGA CATTCATCCG GGGCGGTACG TCTCGATCGC GCGATGA
|
Protein sequence | MTLPSRLLCA LALCLGLVQP ACAEMVWRRG ALGDPGSLDP HKATTLIESN VLGELFEGLL SRNAAGALIP GVAESWSVAP DGRVYKFKLR EDAKWSNGDP VTAEDFVFAF RRLMDPRTGA PYANILYALK NGEQVNSGAL PPDALGARAL GERELELTLE QPVPYFLEQL AHFTAKPLHR KSIEAFGSDF AHPEHVVANG PFRLKKFIPN DAIVLEKNPR FWDAGKIALD REIFIPLEDR SAALRRFMAG EIDSYDEVPV EEIGFVRKTL SGALHLSPSL GGYYYALDTR RPPFDDARVR QALAMAIDRE FLAEKIWGGS MAPGYSFVPP GVASYGAPAE VAWKDLSFPE RQEQARRLLK EAGFGEGGKT LEVEIRFNNS GSHRTTAVAI ADMWMRLGVK ASLIGTDAST HYALLREKPP FDAARMSWYA DYPDAQNFLF LAESANKGLN TPSFSNPEFD ALMRRAAEEQ NSDARKTRLH EAEALLLREQ PFIVLMNYRS SHLVSPKLKG FEPNALDIHP GRYVSIAR
|
| |