Gene Mext_3731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3731 
Symbol 
ID5832632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4132360 
End bp4133688 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID641369521 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001641176 
Protein GI163853133 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.131666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGA TCAATCTTGA AGGTATCAGC AAGATATTCG GCTCGAACCC CTCCAAGGCG 
CTCGATCTGA TCGGCCAGGG GAAGCGCAAG GGCGACATCG CCGCTGCCTG CGGTGCGGTC
GTCGGATTGC GCGACATCTC GTTCGACATC GAAGAGGGCG AGATCCTCGT CCTGATGGGC
CTGTCCGGCT CCGGGAAGTC GACGCTCCTG CGCTGCATGA ACCGTCTGGT CGAGCCGTCC
TGCGGCCGGA TCGTCGTGGA CGGAGTGGAC GTGACCCGGC TCGGCCGCAA GGATCTGCTC
GCCTTCCGCC AGAAGACCTT CGGCATGGTC TTCCAGCACT TCGCGCTGCT GCCCAACCGG
ACCATCCTCG GGAATGTCGG GTTCGGCCTC GAGATCAAGC AGGTCCCGGC CAAGGAGCGG
ATCGAGCGGT CGATGCAGGC CATCGAACTC GTCGGCCTGA AGGGCTGGGA GACGAAGTAT
CCCAATGAAT TGTCGGGCGG CATGCAGCAG CGGGCGGGCC TCGCGCGGGC GCTCGCCGCC
GATGCCGACA TCCTGCTCAT GGACGAGGCC TTCAGCGCCC TCGACCCCCT GATCCGCCGC
GACATGCAGG CGGAGTTGCG CGACCTCCAG CGCAAGCTCA AGAAGACCAT CGTCTTCGTC
TCGCACGATC TCGACGAGGC CATCGCGCTC GGCGGCCGCA TCGTCCTGAT GAAGGACGGC
GAGGTGGTGC AGATCGGGCA GCCCGAGGAC ATCGTGGCTC GCCCCGCGAC CGACTATGTC
GAGCGCTTCG TCGAGCATAT CGATCTCGCC GCCGTGCTGC GGGCGGAGCA GGTCGCGGAT
CGCTCCGCCC CCGTGCTCGC CCCCACGCAG ACAGTGGCCG AGGCGCGGAC CGCACTCGGC
GGGGCAGGTG GCCGCACGAG CGGCCGGGCT TGGCTCGTCG CCGACGGGGA CGGACGGCTG
GTCGGCCGCA TCTTCGCCGA GAGGCTCGCC TCCGCCCGGC CGGCCGAGAC CCTCTCCAGC
CTGCTCGACC TCGGACAATC CGTCGTCGAG GCGGACAGCC GGCTGGACAC CATCCTCGCG
ACGGTCGCCG CCGAGGAATC CGTCGCGGTC GTGGGCCGGA ACGGACGCCT GATCGGCTCC
ATCACCAGCC GCGACGTCGT TCAGGCGCTC GCCGCGCGGC CCGGCACGCA CGCGCAGCCG
CATGCCGGTG CCCCGATCCT CTCAAAGCCG TCAGGAGCCC CGACATGGAG TGGAACGTCC
CCAAATTCCC CCTCGACACG CTCAGTGACA ACGGCCTCGA CTGGCTCACC GAGCATGGCA
GTTGGCTGA
 
Protein sequence
MGRINLEGIS KIFGSNPSKA LDLIGQGKRK GDIAAACGAV VGLRDISFDI EEGEILVLMG 
LSGSGKSTLL RCMNRLVEPS CGRIVVDGVD VTRLGRKDLL AFRQKTFGMV FQHFALLPNR
TILGNVGFGL EIKQVPAKER IERSMQAIEL VGLKGWETKY PNELSGGMQQ RAGLARALAA
DADILLMDEA FSALDPLIRR DMQAELRDLQ RKLKKTIVFV SHDLDEAIAL GGRIVLMKDG
EVVQIGQPED IVARPATDYV ERFVEHIDLA AVLRAEQVAD RSAPVLAPTQ TVAEARTALG
GAGGRTSGRA WLVADGDGRL VGRIFAERLA SARPAETLSS LLDLGQSVVE ADSRLDTILA
TVAAEESVAV VGRNGRLIGS ITSRDVVQAL AARPGTHAQP HAGAPILSKP SGAPTWSGTS
PNSPSTRSVT TASTGSPSMA VG