Gene Msil_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2086 
Symbol 
ID7091452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2259447 
End bp2260565 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID643465410 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002362387 
Protein GI217978240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA GCTTCGCTCG CCTCAGCCTG ATGACGGCTT TTTTCCTCGC CGCCGTCGCG 
CCCCCCGCGA TCGCTGAAGT GCGCTTTGGC GTCGGCGCGC CGATCACCGG CCCCGACGCG
TCTTTCGGCG CACAATTGCG CAACGGCGCC GAGCAGGCCG TCGCCGACAT CAATGCGGCC
GGCGGCATTC TCGGCGAGAA AGTCACGCTG CGCGTCGGCG ACGATGGCGC GGACCCAAAG
CAGGGCGTCT CCGTCGCCAA TAAATTCGTC GGCGATCAGG TGTCTGTCGT GATCGGCCAT
TTCAACTCCG GCGTGAGCCT GCCCGCCTCG GACGTCTACG CCGAAGCCAA TATTTTGCAG
ATCACGCCGG GATCGACCAA TCCCAAAATC ACCGATCGCG GCATCGAGAC GCTTTTTCGC
ACCTGCGGCC GCGACGACCA GCAGGGGGCG GTCGCCGCCA AATTCCTCGC CGGGCGGGGC
TTTAAGAAGA TCGCCATCAT CCACGACAAG ACGACCTATG GCAAAGGACT CGCCGACGAG
ACGCGCAAGA GCCTCGAGGC GCTCGGCGTC AAGGACGTGC TCTATGAGGG GATCAACAAG
GGCGAGAAGG ATTATTCGGC GATCGTCTCC AAGATCAAGC AATCCGGGGC TGACGTCATC
TATTGGGGCG GCGTCCACAC CGAGGGCGGC CTGCTGCTGC GCCAGATGCG CGATCAAGGC
GTCGAAACGC CGATGATGGG CGGCGACGGC ATCGCCTCCG ACGAATTCGC CGCGATCGCC
GGCCCCGGCG TCGAGGGAAC CTTCATGACC TTCCCGCCCG ACCCGCGCGA GCGGCCGGAA
GCGGCGAAAG TAGTGGCGGA ATTCAAGGCG AAGAATTTTA ATCCCGAAAC CTACACGCTC
TATTCCTACG CGGCGGTGGA GGTGGTGAAG CAGGCGGCGG AGGCGGCCAA ATCGCTCGAC
GCCGCCGAGA TCGCCAAGAC GATCCATTCC GGCATGGTCT TCAATACGGT GATCGGCCCG
ATCAGCTTCG ACAAGAAAGG CGACGTGACG CGCGCCGATT ATGTCGTCTT CCTCTGGAAA
AAGGGGCCCG ACGGCAAGAT CAGCTATTAC CAGATGTGA
 
Protein sequence
MKPSFARLSL MTAFFLAAVA PPAIAEVRFG VGAPITGPDA SFGAQLRNGA EQAVADINAA 
GGILGEKVTL RVGDDGADPK QGVSVANKFV GDQVSVVIGH FNSGVSLPAS DVYAEANILQ
ITPGSTNPKI TDRGIETLFR TCGRDDQQGA VAAKFLAGRG FKKIAIIHDK TTYGKGLADE
TRKSLEALGV KDVLYEGINK GEKDYSAIVS KIKQSGADVI YWGGVHTEGG LLLRQMRDQG
VETPMMGGDG IASDEFAAIA GPGVEGTFMT FPPDPRERPE AAKVVAEFKA KNFNPETYTL
YSYAAVEVVK QAAEAAKSLD AAEIAKTIHS GMVFNTVIGP ISFDKKGDVT RADYVVFLWK
KGPDGKISYY QM