Gene M446_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2186 
Symbol 
ID6134738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2435914 
End bp2436888 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content72% 
IMG OID641642413 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001769081 
Protein GI170740426 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.589842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.269946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC ATCGCCGCGC GCTGATCGGC GCCGCGTTCG GCCTCGCCGG CCTGTTCTCT 
TTACAGAACC CGGCCGCGGC CCAGGCCGCC AAGGAGGTCC GGCTCGACTG GGCGACCTAC
AACCCGGTGA GCCTGCTCCT GAAGGAGAAG GGGCTCGTCG AGAAGGCGCT GGCGGCCGAC
GGCGTCAGCG TGCGCTGGGT GCAGTCGCTC GGCTCCAACA AGGCCCTGGA ATTCCTCAAC
GCAGGCTCGC TCGATTTCGG CTCGACGGCG GGGGCGGCGG CCCTGCTCGG GCGGATCAAC
GGCAACCCGA TCAAGTCCGT CTACGTCTAT TCCCGACCGG AATGGACCGC CCTCGTCACG
CGCCCGAATA CCGGCATCGC GGCGGTGAAG GACCTGAAGG GCAGGCGCGT CGCGGTCACC
CGCGGCACCG ACCCGCACAT CTTCCTGATC CGCGCCCTGC AGGGGGCCGG GCTGACCGAG
CGGGACGTGA AGCTCGTGCT GCTCCAGCAC CCGGACGGGC GCACGGCCCT CGACCGCGGC
GACGTCGATG CCTGGGCGGG CCTCGACCCG ATCATGGCGG CGGCCGAGAT CGAGACCGGC
GACGTGCTGT TCCACCGCGA TCCGGCCGCC AATACCTGGG GCGTGCTGAA CGTGCGGGAG
GATTTCGCCA AGGCGAACCC GGACCTGACC CGCAAGGTGC TGGCGGCCTA CGAGGAGGCG
CGCGCCCTCG CGGTGAGCCG GCCCGAGGAA CTGCGGCGCG CGCTCGTGGC GGCGACGAAG
CTGCCCGAGC CGGTGGTCGC CCGCCAGCTG GAGCGCACCG ACGTGTCCCA GCCGAATATC
GGGCCGGCCC AGGCCGAGTC GATCCTGGCG GCCGGCAAGG CCCTGCGCGA GGCCGGCGTG
ATCCCGGCCG GCACCGACGT CGAGGCGGCC GTCGACGCCC TGATCGACCG GCGCTTCAAC
ACCGCCGCGC GCTGA
 
Protein sequence
MRIHRRALIG AAFGLAGLFS LQNPAAAQAA KEVRLDWATY NPVSLLLKEK GLVEKALAAD 
GVSVRWVQSL GSNKALEFLN AGSLDFGSTA GAAALLGRIN GNPIKSVYVY SRPEWTALVT
RPNTGIAAVK DLKGRRVAVT RGTDPHIFLI RALQGAGLTE RDVKLVLLQH PDGRTALDRG
DVDAWAGLDP IMAAAEIETG DVLFHRDPAA NTWGVLNVRE DFAKANPDLT RKVLAAYEEA
RALAVSRPEE LRRALVAATK LPEPVVARQL ERTDVSQPNI GPAQAESILA AGKALREAGV
IPAGTDVEAA VDALIDRRFN TAAR