Gene Mchl_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1332 
Symbol 
ID7116305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1370831 
End bp1371772 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content73% 
IMG OID643524109 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_002420144 
Protein GI218529328 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.449207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.661146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCT GGCTCGTCCT GATCGCCCTG GCGCTCGCCA GCCTCGCGCC CGCCCGCGCG 
GAGGAGGTCT TGCGCGTCGG CGACCAGCGC GGCAACGCCC GCGCCCTGAT GGAGGCCACG
GGCGTGCTCG ACGGCCTCCC CTACCGGCTG GAATGGAGCG AGTTTCCGGC GGCCGCCCCG
CTGCTGGAAG CGCTGAATGC CGGCGTCATC GATGCGGGCG GCGTGGGCGA TGGCCCCTTC
ACCTTCGCCG CCGCCGCGGG GGTTCCGGTC AAGGCCTTTC TGGCCTTCCG CAACCGGCAG
GACGGGCTCG CCATCCTCGT GCAGCCCGAT TCCGCCATCC GCACCGTGGC GGATCTCCAG
GGCAAGCGGA TCGCCACCAA CCGCGGCTCG ATCGGCCACC AGGTCGTCCT CGCCGCCCTC
GAAGAAGCGG GGCAGCCCGC CGACAGCGTG CAGTTTCGCT TCCTGCCGCC GGCCGACGCC
AAGTTGGCGC TGACTTCCGG CGCGGTCGAT GCGTGGTCGA CCTGGGAGCC CTACACCTCC
GCGGCCGAAC TCGCCGGCCT CGTGCGGGTG CTCCGCGACG GCAACGGCAT CACCCCGGGC
CTGAGCTACG CGGTGGCGAG CGACGCCGCG CTGAAATCCA AGCGCGCCCT GCTCGCCGAC
TACGCCGCCC GCCTTGCCAG GGCCCGAGCC CGGGCGCTGA CCGATCCGGC GCCCTACGCT
GCCGCGTGGT CGCGGCTGAT CGGCCTGCCC GAGGCGGTGC CGCTGCGCTG GTTCGGGCGC
GCGCGCTACC GCACCGTGCC GATCGATGAC GCCGTGATCG CCGACGAGCA GCGCATCATC
GACCTCTATG TGCGGGCCGG ACTGATCCCG GCGGCGCGAG CCCCGCGCGC CGAGGCGATC
CTCGATACCG GGTTTTCGGA CGCGCTTGCC GCCGTGCGAT GA
 
Protein sequence
MARWLVLIAL ALASLAPARA EEVLRVGDQR GNARALMEAT GVLDGLPYRL EWSEFPAAAP 
LLEALNAGVI DAGGVGDGPF TFAAAAGVPV KAFLAFRNRQ DGLAILVQPD SAIRTVADLQ
GKRIATNRGS IGHQVVLAAL EEAGQPADSV QFRFLPPADA KLALTSGAVD AWSTWEPYTS
AAELAGLVRV LRDGNGITPG LSYAVASDAA LKSKRALLAD YAARLARARA RALTDPAPYA
AAWSRLIGLP EAVPLRWFGR ARYRTVPIDD AVIADEQRII DLYVRAGLIP AARAPRAEAI
LDTGFSDALA AVR