Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3016 |
Symbol | ureC |
ID | 5835420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3363508 |
End bp | 3365220 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641368816 |
Product | urease subunit alpha |
Protein accession | YP_001640476 |
Protein GI | 163852433 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.290532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCC GCCTGCCCCG CGCCGCCTAT GCCGCCATGT TCGGGCCGAC CACCGGCGAC CGTATTCGGC TGGCCGATAC CAGCCTCGAG ATCGAGGTCG AGCGTGATCT GACGACCTAC GGCGAGGAGG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGGATGGG CCAGTCGCAG GCGACCAACG CCGAGGGCGC CGTCGATACC GTCATCACCA ACGCGGTGGT GCTCGACCAT TGGGGCATCG TGAAATGCGA TGTCGGCATC CGCGCCGGTC GGATCTTCAA GCTCGGAAAG GCCGGCAACC CGGACGTTCA GGCCAATGTC GACATCGTCG TCGGCCCCGG CACCGAGGTG ATCGCCGGCG AGGGCAAGAT CCTCACGGCC GGCGGCTTCG ACAGCCACAT CCACTTCATC TGCCCGCAGC AGATCGAGGA AGCGCTCTGC TCCGGCGTGA CAACGATGCT GGGTGGCGGC ACCGGGCCGG CGCACGGCAC CTTCGCCACC ACCTGCACCC CCGGCCCCTG GCACATCGCC CGGATGATCG AGGCGGCCGA TTCCTTCCCG ATGAACCTCG CTTTTGCCGG CAAGGGCAAC GCCTCGGTGC CCGGCGCGCT GGAGGAGATG ATCCGCGCCG GCGCCTGCGC GATGAAGCTG CACGAGGACT GGGGCACCAC GCCCGCCGCG ATCGATACCT GCCTCTCGGT GGCGGACGCC TTCGACGTGC AGGTCATGAT CCACACCGAC ACGCTCAACG AATCGGGCTT CGTCGAGGAC ACGATCGCCG CCTTCAAGGG CCGCACCATC CACGCCTTCC ACACCGAGGG TGCCGGTGGC GGCCATGCGC CGGACATCAT GCGGGTCGCC GGCCTCGCGA ACGTGCTCCC GTCGTCCACG AACCCGACGC GGCCCTTCAC GAAGAACACC ATCGACGAGC ATCTCGACAT GCTGATGGTG TGCCACCACC TCGACCCGTC GATTCCTGAA GACCTCGCCT TCGCCGAGAG CCGCATCCGC AAGGAAACCA TCGCGGCGGA AGACATCCTG CACGACATCG GTGCCCTCTC GATGATGTCG TCCGACAGTC AGGCCATGGG CCGCGTCGGC GAAGTCGTGA TCCGCACCTG GCAGACCGCC GACAAGATGA AGCGCCAGCG CGGCGCCCTG CCCGGAGATG CGTCGGGCAA CGACAACCTG CGCGCCCGGC GCTACGTGGC GAAATACACG ATCAACCCGG CCATCGCGCA CGGCGTCTCG CGCCATATCG GCTCGATCGA GCCGGGCAAG CTCGCCGACC TCGTGCTGTG GACGCCCGCC TTCTTCGGCG TGAAGCCGGA CCTTGTGCTC AAGGGCGGCA TGATCGCCAC CGCGCCGATG GGCGACCCCA ACGCCTCGAT CCCGACGCCG CAGCCGGTGC ATTACCGGCC GATGTTCGGC GCCTACGGCC GCGCGCCCTA TGCCACTGCG CTCACCTTCG TCTCGCGGGC GGCGCTCGAG GACGGCGTGA AGGAGAAGCT GCGCGTCTCC AAGGAGCTGG TGGCGGTGGA AAACGTGCGC GGCGGCATTT CGAAGAAGAG CATGATCCTC AACGACGCGA CCCCGAACAT CGAGATCGAC CCCGAGACCT ACGACGTGCG CGCCGACGGC GAATTGCTGG TCTGCGAGCC GGCGGAGGTC TTGCCGATGG CGCAGCGCTA CTTCCTGTTC TGA
|
Protein sequence | MSARLPRAAY AAMFGPTTGD RIRLADTSLE IEVERDLTTY GEEVKFGGGK VIRDGMGQSQ ATNAEGAVDT VITNAVVLDH WGIVKCDVGI RAGRIFKLGK AGNPDVQANV DIVVGPGTEV IAGEGKILTA GGFDSHIHFI CPQQIEEALC SGVTTMLGGG TGPAHGTFAT TCTPGPWHIA RMIEAADSFP MNLAFAGKGN ASVPGALEEM IRAGACAMKL HEDWGTTPAA IDTCLSVADA FDVQVMIHTD TLNESGFVED TIAAFKGRTI HAFHTEGAGG GHAPDIMRVA GLANVLPSST NPTRPFTKNT IDEHLDMLMV CHHLDPSIPE DLAFAESRIR KETIAAEDIL HDIGALSMMS SDSQAMGRVG EVVIRTWQTA DKMKRQRGAL PGDASGNDNL RARRYVAKYT INPAIAHGVS RHIGSIEPGK LADLVLWTPA FFGVKPDLVL KGGMIATAPM GDPNASIPTP QPVHYRPMFG AYGRAPYATA LTFVSRAALE DGVKEKLRVS KELVAVENVR GGISKKSMIL NDATPNIEID PETYDVRADG ELLVCEPAEV LPMAQRYFLF
|
| |