Gene Mext_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1744 
Symbol 
ID5833052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1965922 
End bp1967334 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content71% 
IMG OID641367543 
Producturate catabolism protein 
Protein accessionYP_001639214 
Protein GI163851171 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG3195] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03164] OHCU decarboxylase
[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCT CCCGCGACCT GATCGGATAC GGCCGCACCG TTCCGCAGGC GGATTGGCCC 
GGCGGCGCCC GGATCGCCGT GCAGATCGTG CTCAACTACG AGGAGGGGGG CGAGAACTGC
ATCCTGCACG GGGATGCGGC CTCCGAGGCG TTCCTCTCCG AAATCGTCGG CGCGGCGCCC
TGGCCGGGCC TACGCCACAT GAACATGGAA TCGCTCTACG AGTACGGCGC CCGCGCCGGG
TTCTGGCGGC TGTGGCGGCT GTTCACGCAA CGGGGCGTGC CGGTGACCGT ATTCGGCGTC
GCCACCGCAC TCGCGCGCAA CCCGGAGGTC GTGGCCGCGA TGCGGGAGGC GGATTGGGAG
ATCGCCAGCC ACGGCCTGAA ATGGATCGAT TACCGCGACA TGAAACGGGC GGAGGAGGCC
GCGCAGATGG ATGCGGCGAT CCGGCTGCAC GAGGAGGTGA CGGGCGAGCG CCCCCTCGGC
TGGTACACCG GCCGCTCCTC CGTCAACACG CTCGAACTCG GCCTGGAACG GGGCTTTTCC
TATCTCGCCG ATTCCTACGC CGACGACCTG CCCTACTGGC TGTACGGGCG GGCCGGCACC
GGCCTCGTGG TGCCCTACAC CCTCGACGCG AACGACATGC GCTTCGCCAC GCCGCAGGGC
TTCAACACCG GCGAGCACTT CTTCACCTAC CTGCGCGACA GCTTCGACGC GCTCTACGCG
GAGGGTGCCA CCACGCCGAA GATGATGTCG GTGGGGCTGC ACTGCCGTCT GGTCGGCCGG
CCCGGCCGCA TCGCCGCCCT CGCGCGCTTC CTCGACCACG TCGCCGCCCA TGACGGCGTC
TGGCTGGCGC GCCGCATCGA CATCGCCCGG CACTGGACGG CGCGGCACCC GGCCGAAGCC
TTACGCCCGA GCACCATGAG CGCGGCGCAG TTCCTCACCC GGTTCGGCGA CATCTTCGAG
GATACGCCGG AGATCGCGCT CCGGGCGTGG CAGGCGGGCC TCACCGCCCG CGAGGACAGC
GCGGAGGGGC TCCATGCCGC CCTCGTCGGG GCCCTGCGCG GCCTGCCCGC CGAGCAACAG
CGCGCCCTCA TCCGCGCCCA TCCCGAACTC GCCGGACGGC TCGCCCAGGC GGGACAGTTG
ACGCAAGCCT CCACCACCGA GCAGGGCAGC GCCGGCCTCG GCGCGCTCTC GGCCGAGGAG
CTGGCGCGAT TCGAGCGGCT GAACGCGGCC TACCGCGCAC GCTTCGACCT GCCCTTCATC
ATGGCCATCA AGGGCAGCAG CCGCGAGGCG ATCCTGGCTG CGTTCGAGGC GCGGCTGCGC
AACGATCCCG AGCAGGAGTT TCAGGAGGCT TTGCGCCAAA TCGAGCGGAT CGCGTGGCTG
CGCCTGAAGG ACCGGCTGCC CTCGGAGAGT TGA
 
Protein sequence
MHTSRDLIGY GRTVPQADWP GGARIAVQIV LNYEEGGENC ILHGDAASEA FLSEIVGAAP 
WPGLRHMNME SLYEYGARAG FWRLWRLFTQ RGVPVTVFGV ATALARNPEV VAAMREADWE
IASHGLKWID YRDMKRAEEA AQMDAAIRLH EEVTGERPLG WYTGRSSVNT LELGLERGFS
YLADSYADDL PYWLYGRAGT GLVVPYTLDA NDMRFATPQG FNTGEHFFTY LRDSFDALYA
EGATTPKMMS VGLHCRLVGR PGRIAALARF LDHVAAHDGV WLARRIDIAR HWTARHPAEA
LRPSTMSAAQ FLTRFGDIFE DTPEIALRAW QAGLTAREDS AEGLHAALVG ALRGLPAEQQ
RALIRAHPEL AGRLAQAGQL TQASTTEQGS AGLGALSAEE LARFERLNAA YRARFDLPFI
MAIKGSSREA ILAAFEARLR NDPEQEFQEA LRQIERIAWL RLKDRLPSES