Gene Mext_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3020 
Symbol 
ID5835424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3367542 
End bp3368558 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content74% 
IMG OID641368820 
Producturease accessory protein UreD 
Protein accessionYP_001640480 
Protein GI163852437 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0829] Urease accessory protein UreH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.187317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCT CACGGTCAAG AACGGCCGTC GCTGCCCGCA TCCTGTGCCA TCGCTGCCGA 
AGATCGGTGC GGGATGGCAC CCGACCGGTG GGGCCCGCCA CCTTGCGGAC CGGGGCTTCG
GTGGCCGATG AAGCCGGCAC CCGCTCCGCC GGAGGCCGTC CGATCCCTGC CGCCGAACCG
CTCCGCCCCG CCTTGTCCCG CCAGCGCTCG CAGGGCGCGG TGCATCTGCG CGTCGCCCCG
GCCGGAACGG CCGCGGACGC GCCGACGCGG ATCGTCGATC TCGCCGAGAG TGGCCCCTTG
CGCCTGCGCT GTCCCCGCCA GGGGGCCGAG CGGATGCTGG AGGGCGTGCT GGTCAATACC
GGCGGCGGCA TCGCCTGCGG CGATGTGTTC ACGGTGTCGG TGACGGTCGA GCCGGGTGGG
GCCTGCGTGC TGACCACCAC CGCGGCGGAG AAGATCTACC GCTCGGACGG ACCCTGCGCG
GAGATCGTCA ACCGGGCGAG CGTCGGCGCG GGCGGGCGGC TCGATTGGCT GCCGCAGGAG
ACGATCCTGT TCGACCGCGC CCGGCTGGTG CGCCGCTTCG AAGCGGATCT TGCCCCCGAC
GCGTCGCTGC TCGTGGCCGA GATCGCGGTG CTCGGCCGTG CCGCTCGCGG CGAAAGCCTG
GAGCAGGCCC TGTTCGAGGA TCGCTGGCGC ATCCGCCGCG ACGGCCGCCT TGTCTACGCC
GACAGCCTGC GCCTCGACGG CGCGGTCACG GCCCTCATGA ACCGCCGGGC GATCGGCGGC
GGGGCCCGCG CATTGGCGAC GATCCTCGAC CTTTCGCTGC GTGCGGAAGG CCGGCTCGAC
GAGGCCCGTG CCCTTCTCGA CGCCCTGCCG GCGCAGGTCG AGGCCGGGGC GAGCGCCTGG
AACGGTCACC TCGCCGTGCG GATGCTGGCC CCCACCGTCG CTCCCCTGCG CGACGCCGCC
GCCCGCTTCC TTGCTGCATG GCGCGGGCAG CCGATGCCGC GCGTGTGGCA GACCTGA
 
Protein sequence
MTTSRSRTAV AARILCHRCR RSVRDGTRPV GPATLRTGAS VADEAGTRSA GGRPIPAAEP 
LRPALSRQRS QGAVHLRVAP AGTAADAPTR IVDLAESGPL RLRCPRQGAE RMLEGVLVNT
GGGIACGDVF TVSVTVEPGG ACVLTTTAAE KIYRSDGPCA EIVNRASVGA GGRLDWLPQE
TILFDRARLV RRFEADLAPD ASLLVAEIAV LGRAARGESL EQALFEDRWR IRRDGRLVYA
DSLRLDGAVT ALMNRRAIGG GARALATILD LSLRAEGRLD EARALLDALP AQVEAGASAW
NGHLAVRMLA PTVAPLRDAA ARFLAAWRGQ PMPRVWQT