Gene Mchl_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1439 
Symbol 
ID7116657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1492672 
End bp1494294 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID643524209 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002420244 
Protein GI218529428 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTTCA ATCTGAAGAG AAACGGCGCC GGTGCGGCGC TCGCCGGCCT GCTGCTGCTT 
GGGCTCAGTC CGGCCGCCCT GGCGCAGGGC GTTCTGCGCA TCGGCATGAC AGCGTCCGAC
ATTCCCCTGA CCACCGGCCA GGCCGACAAC GGCGGCGAGG GCATGCGGTT CATGGGCTAC
ACAGTCTATG ACGGGCTCAT CAATTGGGAC CTGACGAGCG CGGAACTCGC CTCCGACCTC
ACGCCCGGCC TCGCCACGAG TTGGACGGTG GACCTGAACG ACCAGACCAA GTGGACCTTC
AAGCTGCGCC CGGGCGTCAC GTTCCACGAC GGCTCGGACT TCACCGCCGA TGCGGTGGTG
TGGAACCTCG ACAAGCTCCT GAAGAGCGAT GCGCCCCAGT ACGACCCGCG CCAGTCCGCC
CAGGGCAAGA CCCGTATCCC GGCGGTGGCG AGCTACCGCG CGGTCGATCC GCTGACCGTC
GAGATCACCA CCAAGATCCC CGACGCGACG CTGCCCTACC AGATCGCCTG GATCATGATG
TCCTCCCCCG CCCAGTGGGA GAAACTCGGC AAGTCCTGGG ACGCCTTCGC CAAGCAGCCC
TCGGGCACCG GCCCGTGGAA GCTGACGCTG TTCGCCCCCC GCGAGCGGGC CGAGATGGCC
CCGAACCCGG CCTACTGGGA CAAGAAGCGG ATCCCGAAGC TCGACAAGCT GGTGCTGGTG
CCGCTGCCGG AGGCCAACGC CCGGGTCGCG GCCCTGCGCG CCGGGCAGGT CGATTGGATC
GAGGCCCCGG CGCCGGACGC CATCGCCTCG TTGAAGGGCG CGGGCTTCAC CATCGTCACC
AACGCCTACC CGCACAACTG GACGTGGCAT CTCTCCCGGG GCGAGGGCTC GCCGTGGAAC
GACGTCCGCG TGCGCAGGGC GGTCAACCTC GCCATCGACC GCGAGGGCCT GAAAGAGCTG
CTGGGCGGGG TGGCGATCCC GGCCAAGGGC TTCTACCCGC CGAACCACCA GTGGTTCGGC
CGCACCACCT TCGACGTGAA GTACGATCCC GAGGCGGCCA AGAAGCTGCT TGCCGAGGCC
GGCTACGGCA AGGCGAAGCC GTTGAAGTTC AAGGTGGCAA TCTCGGCCTC GGGCTCGGGC
CAGATGCAGC CGCTGCCGAT GAACGAGTTC GTACAGCAGA ACCTCGCCGA TGTCGGCGTC
CAGGTCGACT ACGAGGTCGT CGAGTGGAAC ACGCTGATCA ACGTCTGGCG CGCGGGCGCC
AAGGCCGACA TCTCCCGCGG CGTATCGGCG ATCAATTACT CCTACTTCAT CCAAGACCCG
TTCACCGGCT TCATCCGCCA CCTGCAGTGC AACCTCGCGC CGCCGAACGG CACCAACTGG
GGCTATTACT GCGATCCTGA GATGGACCAG CTGTTCGATC AAGTGCGCAC CACCTTCGAC
AAGGAGACGC AGAACAAGGT CCTCCAGAAG GTTCACGAGA AGTTCGTCGA CGACGCGCTG
TTCGTGATGA TCACCCACGA CGTCAATCCG CGGGCGATGA GCCCGAAGGT GAAGGGCTTC
GTCCAGGCGC GCAACTGGTT CCAGGACTTC TCAACGATCA CCATCGCCAC CGCCGGGCGG
TGA
 
Protein sequence
MVFNLKRNGA GAALAGLLLL GLSPAALAQG VLRIGMTASD IPLTTGQADN GGEGMRFMGY 
TVYDGLINWD LTSAELASDL TPGLATSWTV DLNDQTKWTF KLRPGVTFHD GSDFTADAVV
WNLDKLLKSD APQYDPRQSA QGKTRIPAVA SYRAVDPLTV EITTKIPDAT LPYQIAWIMM
SSPAQWEKLG KSWDAFAKQP SGTGPWKLTL FAPRERAEMA PNPAYWDKKR IPKLDKLVLV
PLPEANARVA ALRAGQVDWI EAPAPDAIAS LKGAGFTIVT NAYPHNWTWH LSRGEGSPWN
DVRVRRAVNL AIDREGLKEL LGGVAIPAKG FYPPNHQWFG RTTFDVKYDP EAAKKLLAEA
GYGKAKPLKF KVAISASGSG QMQPLPMNEF VQQNLADVGV QVDYEVVEWN TLINVWRAGA
KADISRGVSA INYSYFIQDP FTGFIRHLQC NLAPPNGTNW GYYCDPEMDQ LFDQVRTTFD
KETQNKVLQK VHEKFVDDAL FVMITHDVNP RAMSPKVKGF VQARNWFQDF STITIATAGR