Gene Msil_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0067 
Symbol 
ID7090382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp59454 
End bp60584 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID643463400 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002360412 
Protein GI217976265 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.188079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTGT TCCGCCCAAT CCTCCGCTGC CTGCTCGCCG CCGCATTCCT CGCCTCGGCG 
ACTTTTGCGC GCGCCGCCGA TCCGCTCACG ATCGGCGCGA TCGAAATTTT GAGTGGACCG
AACGCCAAAT ATGGCGTCGC CATCAAGCAG GGCTTCGACC TTGGCCTCGA TGAAATCAAC
CGCGGCGGCG GCGTGCGCGG GGTTCCGCTG GCCATCGCTT ACGAGGATTC CGCCGGCAGC
AAGGAGCAGG CGATCAACGC CGCGCGCCAG CTGATCGGCC GCGCCAAGGT TCCGCTGCTG
CTCGGACCGA CGCTCTCGAC CGAAATGTTC GCGGTCGGTC CGATCGCCAA TCAGCGCCAG
ACGCCGATCA TCGGCACATC GACGACCGCG ATTGGGGTCA CCGACATCGG TCCTTTCGTG
TTCCGCACCT CTTTGCCCGA AGCCGATGTG ATCCCCGTGA CGCTGCGCGC CGCTCGCGAC
AAGCTGGGCG TCAAGAAAGT CGCAGTGCTG TACGGCCGTG ACGACGCCTT CACCAAATCG
GCCTATGACG TCATGAAGGC GGCGCTCGCC GAGCTGGGCT TTGAGGTTCT GACGACGGAG
ACATTCGGCG CCAAGGACAC GGATTTTTCC GCGCAGTTGA CCAAGATCGC CAGCCTCAAT
CCCGACGCGA TCGTCATCTC CGCGCTGGCC GACGCCGGGG CCGGCATTCT GCTCGCAAAG
CAAGCGCTCG GCCTGCCGCA AACGGTGCGT GCGATCGGCG GCAATGGCAT GAATTCACCG
AAGGTGCTCG AGATCGCTGG GCCCGCCGCG GATGGGCTTC TCGTCGGCAG CCCGTGGTTC
GTCGGCAAGA GCGATCCACT CAATGCAAAG TTCATCGAAG CCTATCGCGC GAAATATGGC
TCCGATCCCG ACCAATTTGC GGCGCAGGCC TATGACACGA TCAAGATCGC GGCGAGCGCG
CTGAACGAGG CCAAGGATCT CAGCGGCGAA GCCATCCGCG ACGCGCTCCT CAAAACCAAG
ATCGAGGGCG TCATGGGCCA ATTCGCCTTC ACGCCGGACC GGAGCCCGGC CGTCGTCTCG
GGCGTCCTTG TCCTCGAAGC CGCCGGCGGC AAATTCACGA TTTTGAAATA G
 
Protein sequence
MILFRPILRC LLAAAFLASA TFARAADPLT IGAIEILSGP NAKYGVAIKQ GFDLGLDEIN 
RGGGVRGVPL AIAYEDSAGS KEQAINAARQ LIGRAKVPLL LGPTLSTEMF AVGPIANQRQ
TPIIGTSTTA IGVTDIGPFV FRTSLPEADV IPVTLRAARD KLGVKKVAVL YGRDDAFTKS
AYDVMKAALA ELGFEVLTTE TFGAKDTDFS AQLTKIASLN PDAIVISALA DAGAGILLAK
QALGLPQTVR AIGGNGMNSP KVLEIAGPAA DGLLVGSPWF VGKSDPLNAK FIEAYRAKYG
SDPDQFAAQA YDTIKIAASA LNEAKDLSGE AIRDALLKTK IEGVMGQFAF TPDRSPAVVS
GVLVLEAAGG KFTILK