Gene Mext_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3756 
Symbol 
ID5833256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4163581 
End bp4165203 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content65% 
IMG OID641369546 
Productextracellular solute-binding protein 
Protein accessionYP_001641201 
Protein GI163853158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0607711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTTTT TGACCACTTT GATGGATGCG TGGGAACTCG GGAGCCTCAG CATGGCCGGT 
ACAATGGACC GCAGACGGTT TCTTCAGGCG AGCGCCGCCG CTGTGGGTTT TGCCCAGATC
AATCCCGACT TCCTGGTCTC CTCGGCCTTC GCCCAATCGG GCAAGCCGCT GGTCTTCCTC
TCGGCCGAGA ACATCACCGG CAACTGGGAC CCGACCGCCC ACACCACGCT CTCGCAGAAG
AACATCGAGG GCTTCGTGAT GGGCTTCCTG ACCCGCACGC CGATGACCCT CGATGACCCC
GGCAAGGTGG TCTACGAGCT CGCCACCGAC ATCCGGTTGC TCGATCCGCA CCGCCTGCAG
ATCAAGCTGC GCAAGGGCGT GCAGTTTCAC GACGGCAAGC CGTTCGGGCC CGAGGACGTC
AAAGCGACCT TCGAGTACGG CGCGGGCAAG GACCGGCCGG CGCAGTGGTA TCCCGGCCCG
ACCGAGACGC TGACGATCAC CACGCCCGAC GACGAGACCG TGATCGTCGA CACCTCGAAG
GGCGGCTACC CCGCCCACCT CTTCATCTTC CTGGCCTCGT TCCTCCCGAT CCTCTCGGCC
AAGGACGTGG CCGAGGGGCC GGGCGGCGCC CTCACCCGGC GCCTGAACGG CACCGGCCCG
TTCCGCTTCG TCGAGCAGCG CGGCAACGAC ACCGTGCTCA AGGCCCATGA CGGCTATTTC
CGCGGCAAGC CGGGGATTCC CGGCATCAAC TTCACCTTCA CCGGCGATTC GACCACGCGA
ATGCTGTCGC TGATGAACGG CCAGGCCTCG ATCGTCGAGC GGCTCGAACC CGAGCAGGTC
GAGACGGTCA AGAACAACCC AAAGATCGCG ATCAACGAGG TCGTCTCGGT CGAGAACAAG
TATCTCTGGT TCCGCTGCTC CAAGCCGCCC TTCAACGACG TGCGGGTGCG CATGGCGGCC
TGTCACTCGA TCGACCGGGC GATGCTCCTG GAGATCCTCG GCGCGGCGGG CCACGCCTCG
GCCAATTTCA TCTCGCCGGT GAAGTTCGGC TACGTCGATC TGAAGAACTA CCCGGCCTAC
GACCCGGCCA AGGCCCAGGC GCTGCTGGCC GAGGCGGGCT TCCCCAAGGG CAAGGGGCTG
CCGCCGCTCG AATACATCAC CTCGGTCGGA TTCTACCCGA AGACGAAAGA ATACGGCGAG
GTCATCACCG CGATGCTCAA TGAGCAGGGC TTTCCGGTGA GCCTCACGGT GCTGGAGCCG
GCGGCTTGGA ACGAGCGGCT CTATCACCGC CCCGGCGGCG GGCCCGGCCA CATGGTCGAT
TGCGGCTGGT CCACCGCCTC GCCCGAGCCG GATCTGGTGC TGCGCACCCA CTTCCACTCC
TCCTCGCATC GCATCACCGG CATCGAGGAT GCGCAGATCG ATGCGAGCCT CGACAAGGAG
CGCGCGGCGC CGACGCTGGA GGAGCGCAAG GCCATCCTGC AGAACGAGAC GATGCCGCTC
CTGGCCGCCA AGATGCCGGC GCTGTCGCTG TTCACCTCGG TGATGATCCA CGCGATGCAG
CAGGAGCTGA AGGGCCTCTA CATCTACCCG GACGGCTCGA TCGACGCCTC GAAAACCGCC
TGA
 
Protein sequence
MWFLTTLMDA WELGSLSMAG TMDRRRFLQA SAAAVGFAQI NPDFLVSSAF AQSGKPLVFL 
SAENITGNWD PTAHTTLSQK NIEGFVMGFL TRTPMTLDDP GKVVYELATD IRLLDPHRLQ
IKLRKGVQFH DGKPFGPEDV KATFEYGAGK DRPAQWYPGP TETLTITTPD DETVIVDTSK
GGYPAHLFIF LASFLPILSA KDVAEGPGGA LTRRLNGTGP FRFVEQRGND TVLKAHDGYF
RGKPGIPGIN FTFTGDSTTR MLSLMNGQAS IVERLEPEQV ETVKNNPKIA INEVVSVENK
YLWFRCSKPP FNDVRVRMAA CHSIDRAMLL EILGAAGHAS ANFISPVKFG YVDLKNYPAY
DPAKAQALLA EAGFPKGKGL PPLEYITSVG FYPKTKEYGE VITAMLNEQG FPVSLTVLEP
AAWNERLYHR PGGGPGHMVD CGWSTASPEP DLVLRTHFHS SSHRITGIED AQIDASLDKE
RAAPTLEERK AILQNETMPL LAAKMPALSL FTSVMIHAMQ QELKGLYIYP DGSIDASKTA