Gene Mchl_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2142 
Symbol 
ID7116088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2241842 
End bp2242837 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content72% 
IMG OID643524892 
Productputative ABC transporter periplasmic solute-binding protein 
Protein accessionYP_002420917 
Protein GI218530101 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.297347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.343222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCAC GGCGCGGAAT GCTCGCGACG GCGGCTCTGT CGCTGGCCGC GATGCTGCCG 
ATGGCGCGCC GCGCTCAGGC GGCGGGCGAG ACGTTCCGGC TCGGCGTCCT GCCCTTCGGC
ACCGCCTCCT GGGAGGCTGC CGTTATCAAG GCGCGGGGCT TTGATACGGC CAACGGCTTC
ACCCTCGACA TCGTCAAGCT CGCCGGCAAC GATGCCGCCC GCATCGCCTT CCTCGGCGGT
CAGGTCGATG CCATCGTCGG CGACCTGATC TTCGCCGCCC GCCTCGGCAA CGAGGGACGG
GGCGTGCGCT TCTCGCCCTA CTCCACCACC GAGGGGGCGC TGATGGTGCC CGCCGGAAGC
CCGATCACGG ATCTGAAGGG GCTCGCGGGC AAGCGGCTCG GGGTGGCGGG CGGTGCGCTC
GACAAGAACT GGATCCTGTT GAGGGCGCAG GCCCGCGAGA CGGCGGGGCT CGAGCTCGAG
AACGTCGCGC AGATCGCCTA CGGTGCGCCA CCGCTGCTGG CGCAAAAGCT GGAGACCGGC
GAACTCGACG CGGCTCTGCT CTACTGGCAG TTCTGCGCCC GCCTCGAAGC CAAGGGCTTC
AAGCGGCTGA TCTCGGCCGA CGACGTGATG CGGGCCTTCG GCGCCAAGGG TGCGGTCTCG
CTGATCGGCT ATCTCTACGA GGGCCACACC GTGGCCGACC GGGGCGAGGT GGTGCGCGGC
TTCGCCCGTG CCTCGGCCGC CGCCAAGGAC GCGCTGGCGA ACGAGCCGGC CCTGTGGGAG
ACGGTCCGCC CGCTGATGGC GGCGGAGGAC GACGCCACCT TCGCCACGCT CAAGCGCGAT
TTCCTCGCGG GCATCCCGCG CCGGCCCATC GCCGCCGAGC GTGCCGACGG CGAGCGCATC
TACGCGGCCC TGGACCGGCT CGCGGGCGCG CAACTCCTCG GCGTGGGCAA GAGCCTGCCG
CCGGACCTCT ATCTCGACGC CTCGGGCAAC GGCTGA
 
Protein sequence
MLSRRGMLAT AALSLAAMLP MARRAQAAGE TFRLGVLPFG TASWEAAVIK ARGFDTANGF 
TLDIVKLAGN DAARIAFLGG QVDAIVGDLI FAARLGNEGR GVRFSPYSTT EGALMVPAGS
PITDLKGLAG KRLGVAGGAL DKNWILLRAQ ARETAGLELE NVAQIAYGAP PLLAQKLETG
ELDAALLYWQ FCARLEAKGF KRLISADDVM RAFGAKGAVS LIGYLYEGHT VADRGEVVRG
FARASAAAKD ALANEPALWE TVRPLMAAED DATFATLKRD FLAGIPRRPI AAERADGERI
YAALDRLAGA QLLGVGKSLP PDLYLDASGN G