Gene Mext_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0914 
Symbol 
ID5835804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp987178 
End bp989373 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content71% 
IMG OID641366696 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001638390 
Protein GI163850347 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC GAGACCTGAC TGATGCGCCC ATCCCCTCGC GGCGCGCCTT CCTCGGCGGC 
GCGGCGGCGG GTGCCATGCT GCTCGCCTTC CGCCTCGACC TGAAGGGTGC CCGCGCCGCG
GAAGCCTCGG ACGCGCTGTC GAAGGCCCCG GCCCAGCCCA ACGCCTTCGT GCGGATCGCC
GCCGACGACA CCGTCACGGT GATGATCAAG CATCTCGATA TGGGCCAGGG CAACACGACC
GGGCTCACCA CCATCCTCGC CGACGAACTC GACGCGGATT GGAGCCAGAT GCGCGTCGCC
TTCGCCCCGG CCGACGCCAA GCTCTACGCC AACCTGCTGA TGGGCCCGGT CCAGGGCACC
GGCGGCTCGA CGGCGATCGC CAATTCCTGG TTCCAGCTCC GTAAGGCGGG TGCGGCCGCC
CGCGCCATGA TGGTCGCCGC CGCGGCCGAC AAGTGGGGCG TGCCGGCGGG TGAGATCACC
GTCGCCAAGG GCGTCATCAG CCACAAGTCG GGCAAGCAGG CCCGCTTCGG CGAGTTCGCG
GAGGCCGCCG CCGCCAAGCC GGTGCCGCAG GAGCCCCGCC TCAAGACGCC GGCCGAGTGG
ACGCTGATCG GCCAGCGCGT GCCGCGCATC GATTCCGCGG CGAAGACCGA CGGCACCGCG
ATCTACTCCC TCGACATCCG CCGCCCCGGT CAGGTCACGG CGCTCGTCGC CCACCCGCCG
CGCTTCGGCG CCACGGTGAA GTCGGTGGAT GCGGAGGCCG CGCAAGGCAT GCCCGGCGTC
GTCGGCATCG TCACGATCCC GACCGGCGTG GCGGTGATCG CCCGCGACAC CTGGACCGCG
ATGAAGGCCC GCGAGGCCCT GAAGATCACC TGGGACGATT CCGCCGCCGA GACCCGCTCC
TCGGACGCGA TCCTCGCCGA GTACCGCGAG ACGGCCAAGA CCTCCGGCCT CGTCGCCTCG
CAGGCGGGCG ACGCCGACGG CGCCATCAAG GGCTCCGCCA AGGTGCTGGA GGCGGAGTTC
TCCTTCCCCT ACCTGGCCCA TGCCGCGATG GAGCCCCTGA ACGCCACGAT CGAGCGGGCG
GCGGACGGCA GCTTCGACGT CTATGCCGGC TGCCAGATCC AGACGATCGA GCAGGCTGTG
GTGGCAGCGA CGCTGGGCGT CACGACCGAC CGGGTCCGGC TGCACACGCA ATGGGCCGGC
GGCTCGTTCG GCCGCCGGGC GACGCCGGGC GCCGACTACT TCGCCGAAGC CGCCGCGATC
GTGAAGGCCT GGGACGGCAA GGCGCCGGTC CACCTCGTTT GGACCCGCGA GGACGACATG
GCGGCCGGCT ATTACCGCCC GCAGGTCCAT CACACGGTTA AGGCGGGCCT GAACGAGAAG
GGCCAGATCA CCGGCTGGCG CCACACCATG GTCGGCAAGT CGATCATGAT CGGCTCGCCC
TTCGAGGCGA TGCTCGTCAA GAACGGGATC GACTCCACCA CCGTCGAGGG CGCCTCCGAC
ACGCCCTACG CGCTCCCGGC CTACCGCTTC GAGGTGCACA ATGCCCGCGA GGGCGTGCCG
GTGCTGTGGT GGCGCTCGGT CGGCCACACC CACACCGCCC ACGTCATGGA GGTGTTTATC
GACGAGCTGG CCCATGCCGC GGGGTCCGAT CCGGTCGCCT ACCGGCTGTC GCTGCTGACC
CGCGCGCCGC GCTTGTCGGG CGTGCTGAAG CTCGCCGCCG AGCAGGCCGG CTGGGGCGGC
AAACCCTCCG AAAAGGGCCG CGGCCTCGGC GTGGCGGTGC ACGAATCCTT CGGTTCCTAC
GTCGCAATGG TGGCCGACGT CACGGCGGGC GAGTCCGGCG TGAAGGTCAA CCGGATCGTG
GCGGCGGTCG ATGTCGGCAT CGCCGTGAAC CCGGACGTGG TGCGCGCGCA GGTCGAGGGT
GCGGTGGGCT TCGCCCTGTC GGCGGTGCTG CGCAACCGCA TCACCCTCAA GGACGGCGTT
GTGCAGGAGA GGAACTTCGA CAGCTACCAG CCGACGCGCA TCTCCGAGAT GCCGCGGGTC
GATGTTCACA TCGTGCCCTC CGAGGTCGCT CCCACCGGGA TCGGCGAACC CGGCGTGCCG
GTGCTGGCGC CGGCCATCGC CAACGCGGTC TTCGCCGCGA CGGGCCAGCG CCTGCGCTCG
CTGCCGCTCG ACCTGTCGAG CCTGCGCGGC GCGTAA
 
Protein sequence
MLKRDLTDAP IPSRRAFLGG AAAGAMLLAF RLDLKGARAA EASDALSKAP AQPNAFVRIA 
ADDTVTVMIK HLDMGQGNTT GLTTILADEL DADWSQMRVA FAPADAKLYA NLLMGPVQGT
GGSTAIANSW FQLRKAGAAA RAMMVAAAAD KWGVPAGEIT VAKGVISHKS GKQARFGEFA
EAAAAKPVPQ EPRLKTPAEW TLIGQRVPRI DSAAKTDGTA IYSLDIRRPG QVTALVAHPP
RFGATVKSVD AEAAQGMPGV VGIVTIPTGV AVIARDTWTA MKAREALKIT WDDSAAETRS
SDAILAEYRE TAKTSGLVAS QAGDADGAIK GSAKVLEAEF SFPYLAHAAM EPLNATIERA
ADGSFDVYAG CQIQTIEQAV VAATLGVTTD RVRLHTQWAG GSFGRRATPG ADYFAEAAAI
VKAWDGKAPV HLVWTREDDM AAGYYRPQVH HTVKAGLNEK GQITGWRHTM VGKSIMIGSP
FEAMLVKNGI DSTTVEGASD TPYALPAYRF EVHNAREGVP VLWWRSVGHT HTAHVMEVFI
DELAHAAGSD PVAYRLSLLT RAPRLSGVLK LAAEQAGWGG KPSEKGRGLG VAVHESFGSY
VAMVADVTAG ESGVKVNRIV AAVDVGIAVN PDVVRAQVEG AVGFALSAVL RNRITLKDGV
VQERNFDSYQ PTRISEMPRV DVHIVPSEVA PTGIGEPGVP VLAPAIANAV FAATGQRLRS
LPLDLSSLRG A