Gene Mext_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1197 
SymbolureC 
ID5832043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1320454 
End bp1322166 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content67% 
IMG OID641366990 
Producturease subunit alpha 
Protein accessionYP_001638670 
Protein GI163850627 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.770488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.207486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGA TCCCAAGGCG CGATTACGCC GCCCTCTACG GCCCCACCAC CGGCGACGGC 
GTGCGGCTCG CCGACACCTC TCTCGTCGCG GTGGTCGAGC ACGACCACGC GGTCTACGGC
GACGAGTGCC TGCATGGCGG CGGCAAGACT CTTCGGGATG GCATCGGGCT CGCGCCCGGC
GTCACGGCGG CGCAGGGCGC GCTCGATTTC CTGCTGTGCA ACGTACTCGT GATTGATCCC
GTCCTCGGCA TCGTGAAGGG CGATCTCGGC ATCAAGGACG GGCGCATCGT CGGCCTCGGC
AAGGCCGGCA ACCCGGCAAT CATGGACGGC GTCGATCCGC GCCTGATCGT ATCGGCCGGC
ACCACGGTGC GCGATTGCGA GGGGCTGATC GCGACGCCCG GCGCCATCGA CGTTCACGTC
CATTTCGATT CCGCCGGCCT GCCCGATCAC GCCATCGCCT CAGGGATCAC GACGCTGCTC
GGCGGCTCGC TCGGGCCGAT CACCGTCGGC ATCGATTCCG GCGGGCCGTT CAACACGGGG
AAAATGCTCC AGGCGGCGGA AGCCTGGCCC GTCAATTTCG GTTTTCTGGG AAGGGGCAAC
ACCCACAAGC CGGCGGCCTT GAAGGAACAG CTTGAAACCG GCGTGCTCGG CCTCAAAATC
CACGAGGATT GGGGAGCGAT GCCGGCGGCG ATCGATGCCT GCCTCGGCTT TGCCGACGAA
TACGACTTCC AAGTCCAACT CCACACCGAC ACGCTCAACG AGTCGGGCTT CGTGGAGGAC
ACGCTGGCGG CGATCGGCGG CCGCACGATC CACATGTACC ACACCGAGGG TGCGGGCGGC
GGGCACGCGC CCGACATCAT CCGCGTGGCG GGGCTGCCCC ACTGCCTGCC CTCCTCGACG
AACCCGACCA ATCCCTACAC GGTCAACACG TTCGACGAGC ACCTCGACAT GACGATGGTG
TGCCACCACC TCAACCCGGC TTTGCCGGAG GACGTGGCCT TCGCCGAGAG CCGCATCCGC
GCGCAGACCA TCGCGGCCGA GGACGTGCTG CACGATATCG GCGCGATCTC GATGCTCGGC
TCCGACAGCC AGGGCATGGG CCGCATCCAC GAGGTGATCT GCCGGACATG GCAGCTCGCC
TCCAAGATGA AGGACCAGCG CGGGAGCCTG CCGGAGGAAC GGCCGGGCTT CGGCGACAAT
GCCCGGATCA AGCGCTACAT CGCCAAGTAC ACCATCAACG CCGCGCGCAC CTTCGGCATC
CAGGAGCACA TCGGCTCGCT GGAGCCCGGC AAGATGGCCG ACATCGTGAT CTGGCGCCCG
GCCTTCTTCG GCATCAAGCC GGAACTCGTG ATCAAGGGCG GCTTCATCGC CTGGGGCGCC
ATGGGCGATT CCGCGGCCTC CCTGATGACC TGCGAGCCGA TGCTGATGCG CCCGCAATGG
GGCGCGTTCG GGCTGGCCAA GCAGGGCCTG TCGGCCTGCT TCGTCCACCC GCTCGCCATC
GAGGGGGGGC TGCGCGAGAG CCTTGGCCTG CGAAAGAACC TGCTGCCGGC CCGCGGCACC
CGCACGCTGA CCAAAGCCGA CATGCTCTGG AACGACGCCT GCCCGGACAT CCGGGTCGAT
CCGCAGACCT TCGAGGTCTT CGTGGACGGG GAACTCGCCA CCTGCGAGCC CGCCACCGTC
CTCCCGCTCG CCCAACGCTA CATGCTGCGA TGA
 
Protein sequence
MVTIPRRDYA ALYGPTTGDG VRLADTSLVA VVEHDHAVYG DECLHGGGKT LRDGIGLAPG 
VTAAQGALDF LLCNVLVIDP VLGIVKGDLG IKDGRIVGLG KAGNPAIMDG VDPRLIVSAG
TTVRDCEGLI ATPGAIDVHV HFDSAGLPDH AIASGITTLL GGSLGPITVG IDSGGPFNTG
KMLQAAEAWP VNFGFLGRGN THKPAALKEQ LETGVLGLKI HEDWGAMPAA IDACLGFADE
YDFQVQLHTD TLNESGFVED TLAAIGGRTI HMYHTEGAGG GHAPDIIRVA GLPHCLPSST
NPTNPYTVNT FDEHLDMTMV CHHLNPALPE DVAFAESRIR AQTIAAEDVL HDIGAISMLG
SDSQGMGRIH EVICRTWQLA SKMKDQRGSL PEERPGFGDN ARIKRYIAKY TINAARTFGI
QEHIGSLEPG KMADIVIWRP AFFGIKPELV IKGGFIAWGA MGDSAASLMT CEPMLMRPQW
GAFGLAKQGL SACFVHPLAI EGGLRESLGL RKNLLPARGT RTLTKADMLW NDACPDIRVD
PQTFEVFVDG ELATCEPATV LPLAQRYMLR