Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3722 |
Symbol | |
ID | 5831028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4120561 |
End bp | 4124091 |
Gene Length | 3531 bp |
Protein Length | 1176 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369512 |
Product | urea carboxylase |
Protein accession | YP_001641167 |
Protein GI | 163853124 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.848646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.673189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGCCA AGGTTCTGGT CGCCAATCGC GGCGAGATCG CCGCCCGCGT CGTGCGCACA TTGCGGCGCA TGGGCATCGC CTCGGTCGCC GTCTACTCCG ACGCCGACCG CTTCACCCCC GGCGTGCTCG CCGCCGACGA GGCCGTGCGC CTCGGCCCCG CGCCGGCCGC GCAGAGCTAT CTCGAGGTCG AGGCCGTCAT CGCCGCCTGT AAGGTGACCG GCGCCGAGGC CGTCCATCCC GGCTACGGCT TCCTGTCCGA GAATGTCGGC TTCGCCGAGC GGCTGGCGGC GGAAGGAATC GTATTTATCG GCCCGCGCCC GGAGCATCTG CGCGCCTTCG GCCTCAAGCA CACGGCCCGC GAGCTGGCGA AAGCCAGCGG CGTCCCGCTC CTGCCCGGCA CCGACCTCTT GCCCGATCTT GAGACGGCGC TGAGCGCAGC CGAGGCGATC GGCTATCCGG TGATGCTCAA GAGCACGGCG GGCGGCGGCG GCATCGGCAT GCAGCTCTGC CATTCGGCGC AAGAGCTGGC CGAGCGCTTC GCTGCGGTCG AGCGCACGGC ACGGGCCAGC TTCGGTGATG CCCGCGTCTA TCTCGAGCGC TTCGTGGCGC AGGCCCGCCA TGTCGAGGTG CAGATCTTCG GAGACGGGCG CGGCCGCGTC GTGGCGCTCG GCGAACGCGA CTGCTCGCTC CAGCGCCGGA ACCAGAAGGT GATCGAGGAG ACGCCGGCCC CCGGCCTCTC CGACGCCGTG CGCGCCCGGC TCCACGCCGC GGCGGTGGCG TTGGGGGAGA GCGTCGCCTA CGCCTCGGCC GGCACCGTCG AGTTCATCTA CGATCCGGAC CGCGAGGACT TCTCGTTCCT CGAAGTGAAC ACCCGGCTCC AAGTGGAGCA TCCGGTGACG GAGGCGGTGT TCGGCATCGA CCTCGTGGAA TGGATGGTGC GCCAGGCCGC GGGCGAGGAT CCGATCACGG CGGCCGGGCC GCTCGTGCCG AGGGGCGCGG CGATCGAGGC GCGGCTCTAC GCCGAGATCC CGCACGCGAA TTTCTCGCCG AGCGCCGGCC TGCTCACCGA GGTGCGCTTC CCCGAGACCG CTCGCATCGA CGGCTGGATC GCCACCGGCA CCGAGGTCAC CCCGTTCTAC GACCCGATGC TCGCCAAGAT CATCGTCACC GGGCCCGACC GTCCGGCGGC TTTGGCCGCG CTCCGGGCAG CGCTCGCCGA CACCGTCATC TCCGGCATCG AGACCAATCT CGACTACCTG CGCACCATCG CCGCCTCCGA CCTGCTGGCA AGCGGGCGCG TCGCGACCAC GGCCCTGCGC GACCTCGCCT ATGCTCCCAG AAGCATCGAG GTGCTGCTGC CGGGCGCCCA GTCGAGCCTT CAGGAGCTAC CCGGCCGGCT CGGCCTGTGG GAGGTCGGCG TGCCGCCGAG CGGGCCGATG GATGCCCGCT CCTTCGCCCA GGCCAACGCC CTCGTCGGCA ACGGGCAGGA CACCTGCGCC CTCGAACTCA CCGTCTCGGG TCCGACGCTG CGCTTCCACA CGGATGCCCG GGTAGCGCTC GCGGGCGCCG CGATGCGGCT CACCCTCGAC GGGGAGGTGC GTCCGCATGG CGAGGCCTTC TCGGTGAGGG CGGGGCAGAC GCTCGCCATC GGCGCGATCG AGGGGCCGGG CCAGCGCGCC TATCTGGCGC TGCGCGGCGG CCTCGCGGCG CCCGTGGTGC TCGGCTCGCG CGCGACCTTC GCGCTGGGCG CCTTCGGCGG CCACGCCACC GGCGTCCTCA AGGCCGGCGA CGTGCTCCAT CTCGGCGACG CGCCGGAGGT CGCCGCCGCC CCGCCCGAGC CCGCGCCGCT GACGCAGGAT TGGGAGATCG GCGTCGTCTA CGGTCCGCAC GGCGCGCCGG ACTTCTTCCA GGAGGCCGAC ATCGCCGATC TGTTCTCGGC GTCCTACGAG GTGCATTTCA ACTCGGCGCG CACGGGCGTG CGCCTGATCG GCCCGACGCC GCGCTGGGCG CGGACCGACG GCGGCGAGGC GGGCTTGCAC CCCTCGAACC TGCACGACAA CGCCTACGCC GTCGGCGCCA TTGACTTCAC CGGCGACATG CCGATCCTGC TCGGCCCCGA TGGGCCGAGC CTCGGCGGCT TCGTCTGCCC CGCGGTGATC GCCCGTGACG AGCTGTGGAA GATGGGCCAG CTCCGCCCCG GCGACACGGT CCGCTTCCGG CCCGTGGCCC GGCCCGAGGA CGCGCTCGCC GCCCCGGCCC TGATCGGCGG CGCGGGCAGC CTGTCCGGCT CACCCATCCT CGCCCGCGAC GAGACCGGCC CGGTCCCCGT CGCCTATCGC CGCCAGGGCG ACGACAACCT CCTCGTCGAG TACGGCCCGA TGGCCCTCGA TATCGGCTTG CGGCTGCGGG TCCACCTGCT CGCCGAGGCG GTGCGCGCCG CGCGGCTCCC CGGTCTGACC GACCTGACGC CGGGCATCCG CTCGCTCCAG ATCCACTACG ACGGCACGGC GCTGCCGCGC CACCGCCTGC TCGGCAGCCT ACGCGAGATC GAGGCCGGAT TGCCGGCGGC GGAGGCCGTC AGCGTCCCGA GCCGCACCGT GCACCTGCCG CTCTCGTGGA ACGACCCGCA GGCGGAGCTC GCCATGCGCA AGTACCAGGA GCTGGTGCGC CCGAACGCGC CCTGGTGCCC CTCGAACATC GAGTTCATCC GCCGCATCAA CGGTCTGCCC GACGAGGCGG CGGTGCGCAG CATCATCTTC GATGCGAGCT ACCTCGTGAT GGGGCTCGGC GATGTCTATC TCGGTGCGCC GGTCGCCACC CCGGTCGATC CGCGCCACCG CCTCGTGACC ACCAAGTACA ACCCGGCCCG CACCTGGACG CCGGAGAACG CGGTCGGCAT CGGCGGCGCC TATCTATGCA TCTACGGCAT GGAGGGGCCG GGCGGGTACC AGCTGTTCGG CCGCACGATC CAGGTCTGGA ACACGTGGCG GACCACCCGC GAATTCGTGC CCGGCCATCC CTGGCTGTTG CGGCCCTTCG ACCGCATCCG CTTCTTCCCC GTCAGCCACG CCGAACTCAC CGAAGCCCGC GCCGCCTTCC CGCACGGCGC CTATCCGATC CGGATCGAGG ACGGCACGTT CTCCTACGCC GAGCACCGAA GGATGCTCGC GGGCAACGCC GCCGAGATCG AGCAGGCGGG GGCGCGCCAG AGGGCAGCCT TCGCCGCCGA GCGCGAGCGC TGGAAGGCGG AAGGCCTCGA CAGCTTCGTC GCCGACGAGG CGGTGGCGAC GGAGGCCGCC GAGATCCCGC CGGGCTGCGA GGGCGTGCCG ACCACCGTGC CCGGCAATGT CTGGAAGGTT CTCGTCGGCG CCGGCGAGAC CGTGGCCGCC GGCCAGACGG TGGCGATCCT CGAATCAATG AAGATGGAGG TGGCGGTCAC CGCTCCCGTC GGCGGCCGGG TCCGCGAGAT CCGGGCCCAG CCCGGCCGCA CCCTGCGCGG CGGCGACCTC GTCGCCATCC TCGAGACCTA A
|
Protein sequence | MFAKVLVANR GEIAARVVRT LRRMGIASVA VYSDADRFTP GVLAADEAVR LGPAPAAQSY LEVEAVIAAC KVTGAEAVHP GYGFLSENVG FAERLAAEGI VFIGPRPEHL RAFGLKHTAR ELAKASGVPL LPGTDLLPDL ETALSAAEAI GYPVMLKSTA GGGGIGMQLC HSAQELAERF AAVERTARAS FGDARVYLER FVAQARHVEV QIFGDGRGRV VALGERDCSL QRRNQKVIEE TPAPGLSDAV RARLHAAAVA LGESVAYASA GTVEFIYDPD REDFSFLEVN TRLQVEHPVT EAVFGIDLVE WMVRQAAGED PITAAGPLVP RGAAIEARLY AEIPHANFSP SAGLLTEVRF PETARIDGWI ATGTEVTPFY DPMLAKIIVT GPDRPAALAA LRAALADTVI SGIETNLDYL RTIAASDLLA SGRVATTALR DLAYAPRSIE VLLPGAQSSL QELPGRLGLW EVGVPPSGPM DARSFAQANA LVGNGQDTCA LELTVSGPTL RFHTDARVAL AGAAMRLTLD GEVRPHGEAF SVRAGQTLAI GAIEGPGQRA YLALRGGLAA PVVLGSRATF ALGAFGGHAT GVLKAGDVLH LGDAPEVAAA PPEPAPLTQD WEIGVVYGPH GAPDFFQEAD IADLFSASYE VHFNSARTGV RLIGPTPRWA RTDGGEAGLH PSNLHDNAYA VGAIDFTGDM PILLGPDGPS LGGFVCPAVI ARDELWKMGQ LRPGDTVRFR PVARPEDALA APALIGGAGS LSGSPILARD ETGPVPVAYR RQGDDNLLVE YGPMALDIGL RLRVHLLAEA VRAARLPGLT DLTPGIRSLQ IHYDGTALPR HRLLGSLREI EAGLPAAEAV SVPSRTVHLP LSWNDPQAEL AMRKYQELVR PNAPWCPSNI EFIRRINGLP DEAAVRSIIF DASYLVMGLG DVYLGAPVAT PVDPRHRLVT TKYNPARTWT PENAVGIGGA YLCIYGMEGP GGYQLFGRTI QVWNTWRTTR EFVPGHPWLL RPFDRIRFFP VSHAELTEAR AAFPHGAYPI RIEDGTFSYA EHRRMLAGNA AEIEQAGARQ RAAFAAERER WKAEGLDSFV ADEAVATEAA EIPPGCEGVP TTVPGNVWKV LVGAGETVAA GQTVAILESM KMEVAVTAPV GGRVREIRAQ PGRTLRGGDL VAILET
|
| |