Gene Mext_4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4441 
Symbol 
ID5834104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4945940 
End bp4947226 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID641370234 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001641880 
Protein GI163853837 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.672706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCGCA CCCTTTCCCT CTTGCTCGCC ACCGTTGCGG CCCTCGGCCT CGATGCCCCA 
GCCTTCGCTC AGCAGCCAGC CGGACAGGAA TCGCGCTCGC CCCGCATCGA GGGTGAATCG
TTCCAGGCGC CGGCCGCGCC GAGTGCCGGC ACATCGATCG CCGAGAAGGA GGCGCCGAAC
ACCGAGTACA AGCCGCTCCT GCCCAACCAG ACCCGTGCGC CGGAGCCCGC GCAGAAGACC
GAGTTCGAGA CCGCCGTCGT GGCCAAGGGC CTGGAAAGCC CCTGGGCGAT GGAGTTCCTG
CCCGATGGCC GCATGATCGT GACCGAGAAG GCTGGAAAGA TCCGCCTGAT CGCCAAGGAC
GGCACGGTGG GCCAGCCGGT GGCGGGCGTG CCGAAGGTTG ATTCCAAGGG GCAGGGCGGT
CTTCTCGACG TCGCGCTGAG CCCGAGCTTC GCCGCGGATC GCACGATCTA CTTCAGCTAC
AGCGAGCCGC GCGACAAGGG CAACGGCACC ACGGTCGCCA AGGCCAAGCT GGTCGAGAGC
GATGGCAAGG CGAAGCTCGA CGACGTCAAG GTCATCTTCC GCCAGATGCC GACCTATGAC
GGCGACAAGC ATTTCGGCTC GCGCCTCGTC TTCGCGCCGG ACGGCAAGCT GTTCGTCACC
GTGGGCGAGC GCTCCGACAA GCAGACCCGC GGGCAGGCGC AGGATCTGAC GAGCGGGCTC
GGCAAGGTCT TCCGCATCGA CACCGACGGC AATGCCCCGA AGGACAACCC CTTCACCGGC
GGCGAGAAGG CCAAGCCCGA GATCTGGTCC TACGGCCACC GCAACGTCCA GGCCGCCGCC
CTGGACAACC AGGGCCGGCT CTGGACCGTG GAGCACGGGC CGCGCGGCGG CGACGAGCTG
AACCGCCCGC GCCCCGGCCT CAACTACGGC TGGCCGGTGG TCACCTACGG CATCGAGTAT
TCCGGCGAGA AGATCGGCGA CGGCCAGACC CAGGCCGGCG GCACGGTGCA GCCGGTCTAT
TACTGGGATC CGGTGATCGG CCCCTCGGGC ATGGCGCTCT ACGACAAGGA CGCGTTCCCG
GCCTGGAAGA ACCAGTTCCT CATCGGCGGC CTCGTCAGCA CCGGGATCGT CGCGCTCAAG
CTCGACGGCG ACAAGGTCGT CACCGAGGAG CGCATCCCGC TGGAACACCG CGTCCGCGAC
GTGCGGGTCG GCCCCGATGG TGCCGTCTAC GCGGTCACCG AGGATGACGG CCAGATCGTC
AAGCTGACGC CGAAGAAGGG CAGCTGA
 
Protein sequence
MTRTLSLLLA TVAALGLDAP AFAQQPAGQE SRSPRIEGES FQAPAAPSAG TSIAEKEAPN 
TEYKPLLPNQ TRAPEPAQKT EFETAVVAKG LESPWAMEFL PDGRMIVTEK AGKIRLIAKD
GTVGQPVAGV PKVDSKGQGG LLDVALSPSF AADRTIYFSY SEPRDKGNGT TVAKAKLVES
DGKAKLDDVK VIFRQMPTYD GDKHFGSRLV FAPDGKLFVT VGERSDKQTR GQAQDLTSGL
GKVFRIDTDG NAPKDNPFTG GEKAKPEIWS YGHRNVQAAA LDNQGRLWTV EHGPRGGDEL
NRPRPGLNYG WPVVTYGIEY SGEKIGDGQT QAGGTVQPVY YWDPVIGPSG MALYDKDAFP
AWKNQFLIGG LVSTGIVALK LDGDKVVTEE RIPLEHRVRD VRVGPDGAVY AVTEDDGQIV
KLTPKKGS