Gene Mext_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3722 
Symbol 
ID5831028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4120561 
End bp4124091 
Gene Length3531 bp 
Protein Length1176 aa 
Translation table11 
GC content72% 
IMG OID641369512 
Producturea carboxylase 
Protein accessionYP_001641167 
Protein GI163853124 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.848646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.673189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCA AGGTTCTGGT CGCCAATCGC GGCGAGATCG CCGCCCGCGT CGTGCGCACA 
TTGCGGCGCA TGGGCATCGC CTCGGTCGCC GTCTACTCCG ACGCCGACCG CTTCACCCCC
GGCGTGCTCG CCGCCGACGA GGCCGTGCGC CTCGGCCCCG CGCCGGCCGC GCAGAGCTAT
CTCGAGGTCG AGGCCGTCAT CGCCGCCTGT AAGGTGACCG GCGCCGAGGC CGTCCATCCC
GGCTACGGCT TCCTGTCCGA GAATGTCGGC TTCGCCGAGC GGCTGGCGGC GGAAGGAATC
GTATTTATCG GCCCGCGCCC GGAGCATCTG CGCGCCTTCG GCCTCAAGCA CACGGCCCGC
GAGCTGGCGA AAGCCAGCGG CGTCCCGCTC CTGCCCGGCA CCGACCTCTT GCCCGATCTT
GAGACGGCGC TGAGCGCAGC CGAGGCGATC GGCTATCCGG TGATGCTCAA GAGCACGGCG
GGCGGCGGCG GCATCGGCAT GCAGCTCTGC CATTCGGCGC AAGAGCTGGC CGAGCGCTTC
GCTGCGGTCG AGCGCACGGC ACGGGCCAGC TTCGGTGATG CCCGCGTCTA TCTCGAGCGC
TTCGTGGCGC AGGCCCGCCA TGTCGAGGTG CAGATCTTCG GAGACGGGCG CGGCCGCGTC
GTGGCGCTCG GCGAACGCGA CTGCTCGCTC CAGCGCCGGA ACCAGAAGGT GATCGAGGAG
ACGCCGGCCC CCGGCCTCTC CGACGCCGTG CGCGCCCGGC TCCACGCCGC GGCGGTGGCG
TTGGGGGAGA GCGTCGCCTA CGCCTCGGCC GGCACCGTCG AGTTCATCTA CGATCCGGAC
CGCGAGGACT TCTCGTTCCT CGAAGTGAAC ACCCGGCTCC AAGTGGAGCA TCCGGTGACG
GAGGCGGTGT TCGGCATCGA CCTCGTGGAA TGGATGGTGC GCCAGGCCGC GGGCGAGGAT
CCGATCACGG CGGCCGGGCC GCTCGTGCCG AGGGGCGCGG CGATCGAGGC GCGGCTCTAC
GCCGAGATCC CGCACGCGAA TTTCTCGCCG AGCGCCGGCC TGCTCACCGA GGTGCGCTTC
CCCGAGACCG CTCGCATCGA CGGCTGGATC GCCACCGGCA CCGAGGTCAC CCCGTTCTAC
GACCCGATGC TCGCCAAGAT CATCGTCACC GGGCCCGACC GTCCGGCGGC TTTGGCCGCG
CTCCGGGCAG CGCTCGCCGA CACCGTCATC TCCGGCATCG AGACCAATCT CGACTACCTG
CGCACCATCG CCGCCTCCGA CCTGCTGGCA AGCGGGCGCG TCGCGACCAC GGCCCTGCGC
GACCTCGCCT ATGCTCCCAG AAGCATCGAG GTGCTGCTGC CGGGCGCCCA GTCGAGCCTT
CAGGAGCTAC CCGGCCGGCT CGGCCTGTGG GAGGTCGGCG TGCCGCCGAG CGGGCCGATG
GATGCCCGCT CCTTCGCCCA GGCCAACGCC CTCGTCGGCA ACGGGCAGGA CACCTGCGCC
CTCGAACTCA CCGTCTCGGG TCCGACGCTG CGCTTCCACA CGGATGCCCG GGTAGCGCTC
GCGGGCGCCG CGATGCGGCT CACCCTCGAC GGGGAGGTGC GTCCGCATGG CGAGGCCTTC
TCGGTGAGGG CGGGGCAGAC GCTCGCCATC GGCGCGATCG AGGGGCCGGG CCAGCGCGCC
TATCTGGCGC TGCGCGGCGG CCTCGCGGCG CCCGTGGTGC TCGGCTCGCG CGCGACCTTC
GCGCTGGGCG CCTTCGGCGG CCACGCCACC GGCGTCCTCA AGGCCGGCGA CGTGCTCCAT
CTCGGCGACG CGCCGGAGGT CGCCGCCGCC CCGCCCGAGC CCGCGCCGCT GACGCAGGAT
TGGGAGATCG GCGTCGTCTA CGGTCCGCAC GGCGCGCCGG ACTTCTTCCA GGAGGCCGAC
ATCGCCGATC TGTTCTCGGC GTCCTACGAG GTGCATTTCA ACTCGGCGCG CACGGGCGTG
CGCCTGATCG GCCCGACGCC GCGCTGGGCG CGGACCGACG GCGGCGAGGC GGGCTTGCAC
CCCTCGAACC TGCACGACAA CGCCTACGCC GTCGGCGCCA TTGACTTCAC CGGCGACATG
CCGATCCTGC TCGGCCCCGA TGGGCCGAGC CTCGGCGGCT TCGTCTGCCC CGCGGTGATC
GCCCGTGACG AGCTGTGGAA GATGGGCCAG CTCCGCCCCG GCGACACGGT CCGCTTCCGG
CCCGTGGCCC GGCCCGAGGA CGCGCTCGCC GCCCCGGCCC TGATCGGCGG CGCGGGCAGC
CTGTCCGGCT CACCCATCCT CGCCCGCGAC GAGACCGGCC CGGTCCCCGT CGCCTATCGC
CGCCAGGGCG ACGACAACCT CCTCGTCGAG TACGGCCCGA TGGCCCTCGA TATCGGCTTG
CGGCTGCGGG TCCACCTGCT CGCCGAGGCG GTGCGCGCCG CGCGGCTCCC CGGTCTGACC
GACCTGACGC CGGGCATCCG CTCGCTCCAG ATCCACTACG ACGGCACGGC GCTGCCGCGC
CACCGCCTGC TCGGCAGCCT ACGCGAGATC GAGGCCGGAT TGCCGGCGGC GGAGGCCGTC
AGCGTCCCGA GCCGCACCGT GCACCTGCCG CTCTCGTGGA ACGACCCGCA GGCGGAGCTC
GCCATGCGCA AGTACCAGGA GCTGGTGCGC CCGAACGCGC CCTGGTGCCC CTCGAACATC
GAGTTCATCC GCCGCATCAA CGGTCTGCCC GACGAGGCGG CGGTGCGCAG CATCATCTTC
GATGCGAGCT ACCTCGTGAT GGGGCTCGGC GATGTCTATC TCGGTGCGCC GGTCGCCACC
CCGGTCGATC CGCGCCACCG CCTCGTGACC ACCAAGTACA ACCCGGCCCG CACCTGGACG
CCGGAGAACG CGGTCGGCAT CGGCGGCGCC TATCTATGCA TCTACGGCAT GGAGGGGCCG
GGCGGGTACC AGCTGTTCGG CCGCACGATC CAGGTCTGGA ACACGTGGCG GACCACCCGC
GAATTCGTGC CCGGCCATCC CTGGCTGTTG CGGCCCTTCG ACCGCATCCG CTTCTTCCCC
GTCAGCCACG CCGAACTCAC CGAAGCCCGC GCCGCCTTCC CGCACGGCGC CTATCCGATC
CGGATCGAGG ACGGCACGTT CTCCTACGCC GAGCACCGAA GGATGCTCGC GGGCAACGCC
GCCGAGATCG AGCAGGCGGG GGCGCGCCAG AGGGCAGCCT TCGCCGCCGA GCGCGAGCGC
TGGAAGGCGG AAGGCCTCGA CAGCTTCGTC GCCGACGAGG CGGTGGCGAC GGAGGCCGCC
GAGATCCCGC CGGGCTGCGA GGGCGTGCCG ACCACCGTGC CCGGCAATGT CTGGAAGGTT
CTCGTCGGCG CCGGCGAGAC CGTGGCCGCC GGCCAGACGG TGGCGATCCT CGAATCAATG
AAGATGGAGG TGGCGGTCAC CGCTCCCGTC GGCGGCCGGG TCCGCGAGAT CCGGGCCCAG
CCCGGCCGCA CCCTGCGCGG CGGCGACCTC GTCGCCATCC TCGAGACCTA A
 
Protein sequence
MFAKVLVANR GEIAARVVRT LRRMGIASVA VYSDADRFTP GVLAADEAVR LGPAPAAQSY 
LEVEAVIAAC KVTGAEAVHP GYGFLSENVG FAERLAAEGI VFIGPRPEHL RAFGLKHTAR
ELAKASGVPL LPGTDLLPDL ETALSAAEAI GYPVMLKSTA GGGGIGMQLC HSAQELAERF
AAVERTARAS FGDARVYLER FVAQARHVEV QIFGDGRGRV VALGERDCSL QRRNQKVIEE
TPAPGLSDAV RARLHAAAVA LGESVAYASA GTVEFIYDPD REDFSFLEVN TRLQVEHPVT
EAVFGIDLVE WMVRQAAGED PITAAGPLVP RGAAIEARLY AEIPHANFSP SAGLLTEVRF
PETARIDGWI ATGTEVTPFY DPMLAKIIVT GPDRPAALAA LRAALADTVI SGIETNLDYL
RTIAASDLLA SGRVATTALR DLAYAPRSIE VLLPGAQSSL QELPGRLGLW EVGVPPSGPM
DARSFAQANA LVGNGQDTCA LELTVSGPTL RFHTDARVAL AGAAMRLTLD GEVRPHGEAF
SVRAGQTLAI GAIEGPGQRA YLALRGGLAA PVVLGSRATF ALGAFGGHAT GVLKAGDVLH
LGDAPEVAAA PPEPAPLTQD WEIGVVYGPH GAPDFFQEAD IADLFSASYE VHFNSARTGV
RLIGPTPRWA RTDGGEAGLH PSNLHDNAYA VGAIDFTGDM PILLGPDGPS LGGFVCPAVI
ARDELWKMGQ LRPGDTVRFR PVARPEDALA APALIGGAGS LSGSPILARD ETGPVPVAYR
RQGDDNLLVE YGPMALDIGL RLRVHLLAEA VRAARLPGLT DLTPGIRSLQ IHYDGTALPR
HRLLGSLREI EAGLPAAEAV SVPSRTVHLP LSWNDPQAEL AMRKYQELVR PNAPWCPSNI
EFIRRINGLP DEAAVRSIIF DASYLVMGLG DVYLGAPVAT PVDPRHRLVT TKYNPARTWT
PENAVGIGGA YLCIYGMEGP GGYQLFGRTI QVWNTWRTTR EFVPGHPWLL RPFDRIRFFP
VSHAELTEAR AAFPHGAYPI RIEDGTFSYA EHRRMLAGNA AEIEQAGARQ RAAFAAERER
WKAEGLDSFV ADEAVATEAA EIPPGCEGVP TTVPGNVWKV LVGAGETVAA GQTVAILESM
KMEVAVTAPV GGRVREIRAQ PGRTLRGGDL VAILET