Gene Mext_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3016 
SymbolureC 
ID5835420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3363508 
End bp3365220 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content67% 
IMG OID641368816 
Producturease subunit alpha 
Protein accessionYP_001640476 
Protein GI163852433 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCC GCCTGCCCCG CGCCGCCTAT GCCGCCATGT TCGGGCCGAC CACCGGCGAC 
CGTATTCGGC TGGCCGATAC CAGCCTCGAG ATCGAGGTCG AGCGTGATCT GACGACCTAC
GGCGAGGAGG TGAAGTTCGG CGGCGGCAAG GTGATCCGCG ACGGGATGGG CCAGTCGCAG
GCGACCAACG CCGAGGGCGC CGTCGATACC GTCATCACCA ACGCGGTGGT GCTCGACCAT
TGGGGCATCG TGAAATGCGA TGTCGGCATC CGCGCCGGTC GGATCTTCAA GCTCGGAAAG
GCCGGCAACC CGGACGTTCA GGCCAATGTC GACATCGTCG TCGGCCCCGG CACCGAGGTG
ATCGCCGGCG AGGGCAAGAT CCTCACGGCC GGCGGCTTCG ACAGCCACAT CCACTTCATC
TGCCCGCAGC AGATCGAGGA AGCGCTCTGC TCCGGCGTGA CAACGATGCT GGGTGGCGGC
ACCGGGCCGG CGCACGGCAC CTTCGCCACC ACCTGCACCC CCGGCCCCTG GCACATCGCC
CGGATGATCG AGGCGGCCGA TTCCTTCCCG ATGAACCTCG CTTTTGCCGG CAAGGGCAAC
GCCTCGGTGC CCGGCGCGCT GGAGGAGATG ATCCGCGCCG GCGCCTGCGC GATGAAGCTG
CACGAGGACT GGGGCACCAC GCCCGCCGCG ATCGATACCT GCCTCTCGGT GGCGGACGCC
TTCGACGTGC AGGTCATGAT CCACACCGAC ACGCTCAACG AATCGGGCTT CGTCGAGGAC
ACGATCGCCG CCTTCAAGGG CCGCACCATC CACGCCTTCC ACACCGAGGG TGCCGGTGGC
GGCCATGCGC CGGACATCAT GCGGGTCGCC GGCCTCGCGA ACGTGCTCCC GTCGTCCACG
AACCCGACGC GGCCCTTCAC GAAGAACACC ATCGACGAGC ATCTCGACAT GCTGATGGTG
TGCCACCACC TCGACCCGTC GATTCCTGAA GACCTCGCCT TCGCCGAGAG CCGCATCCGC
AAGGAAACCA TCGCGGCGGA AGACATCCTG CACGACATCG GTGCCCTCTC GATGATGTCG
TCCGACAGTC AGGCCATGGG CCGCGTCGGC GAAGTCGTGA TCCGCACCTG GCAGACCGCC
GACAAGATGA AGCGCCAGCG CGGCGCCCTG CCCGGAGATG CGTCGGGCAA CGACAACCTG
CGCGCCCGGC GCTACGTGGC GAAATACACG ATCAACCCGG CCATCGCGCA CGGCGTCTCG
CGCCATATCG GCTCGATCGA GCCGGGCAAG CTCGCCGACC TCGTGCTGTG GACGCCCGCC
TTCTTCGGCG TGAAGCCGGA CCTTGTGCTC AAGGGCGGCA TGATCGCCAC CGCGCCGATG
GGCGACCCCA ACGCCTCGAT CCCGACGCCG CAGCCGGTGC ATTACCGGCC GATGTTCGGC
GCCTACGGCC GCGCGCCCTA TGCCACTGCG CTCACCTTCG TCTCGCGGGC GGCGCTCGAG
GACGGCGTGA AGGAGAAGCT GCGCGTCTCC AAGGAGCTGG TGGCGGTGGA AAACGTGCGC
GGCGGCATTT CGAAGAAGAG CATGATCCTC AACGACGCGA CCCCGAACAT CGAGATCGAC
CCCGAGACCT ACGACGTGCG CGCCGACGGC GAATTGCTGG TCTGCGAGCC GGCGGAGGTC
TTGCCGATGG CGCAGCGCTA CTTCCTGTTC TGA
 
Protein sequence
MSARLPRAAY AAMFGPTTGD RIRLADTSLE IEVERDLTTY GEEVKFGGGK VIRDGMGQSQ 
ATNAEGAVDT VITNAVVLDH WGIVKCDVGI RAGRIFKLGK AGNPDVQANV DIVVGPGTEV
IAGEGKILTA GGFDSHIHFI CPQQIEEALC SGVTTMLGGG TGPAHGTFAT TCTPGPWHIA
RMIEAADSFP MNLAFAGKGN ASVPGALEEM IRAGACAMKL HEDWGTTPAA IDTCLSVADA
FDVQVMIHTD TLNESGFVED TIAAFKGRTI HAFHTEGAGG GHAPDIMRVA GLANVLPSST
NPTRPFTKNT IDEHLDMLMV CHHLDPSIPE DLAFAESRIR KETIAAEDIL HDIGALSMMS
SDSQAMGRVG EVVIRTWQTA DKMKRQRGAL PGDASGNDNL RARRYVAKYT INPAIAHGVS
RHIGSIEPGK LADLVLWTPA FFGVKPDLVL KGGMIATAPM GDPNASIPTP QPVHYRPMFG
AYGRAPYATA LTFVSRAALE DGVKEKLRVS KELVAVENVR GGISKKSMIL NDATPNIEID
PETYDVRADG ELLVCEPAEV LPMAQRYFLF