Gene Mext_3924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3924 
Symbol 
ID5834821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4362546 
End bp4363655 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content73% 
IMG OID641369715 
Productagmatinase 
Protein accessionYP_001641366 
Protein GI163853323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0571349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGTG CGGGTGAGAT CGCACCATCG CTGCACGTGG CTGTGGAGAC GCCGCGCCGG 
GGGAGGATCG TCATGGCGGA TGAGAGCGAA TCGGCGGGCG CGCGGGCGGA ACGGCTGGCG
CGGTTCCAGC CGGCCTCGGG GATGGTGACG CCACGCTTCT CGGGGCTGGC GAGCTTCATG
CGGCTGCCGG TGCTCGATCC CGCCGAGGCG GTGGGAGACA GGGCCGGCGA AGGGGCTGGC
GAAGGGACTG GACTGGTCGA GATCGGTCTG ATCGGCATCC CCTTCGACGG CACCACCACC
AACCGCCCCG GTGCCCGGCT CGGACCGCGG GCCGTGCGCG AAGCCTCCAC CGGCACGCGG
GCGCTCAACC ACGCCACGGG GGTGGCGCCC TACGCCCTGG CCGCCTGCGC CGATCTCGGC
GACGTGCCGG TCAACCCGGT GGACGCCGCC GAGACCGCCC GGCGGATCGA GGCGTTCTAC
CGGCCGCTCG CCGAGGCCGG GATCGTGCCG CTCACGGTCG GCGGCGACCA TTTCATCACC
TATCCGGTGC TGCGGGCGCT CGGGGCCGCC CGGCCGCTCG GGCTGATCCA TATCGACGCC
CACAGCGACA CCGACGACAC TCAGTATGGC GGGGCGCGGC TCACCCACGG CACGCCGTTC
CGGCGCGCGA TCGAGGACGG GGTGCTCGAT CCGCGGCGCT GCATCCAGAT CGGCATCCGC
GGCAGCATGG ATGCGGCCGA CGAGCGCGAC TGGGCCCTGG CGCAGGGCAT GCGCATCCTC
ACGATGGAGG AGGTCTGCGC CCGCGGCCTG CCGGAGGTGG CCGCGGAAGC CCGCGCCGTG
ACCGGCGACG GCCCGACCTA TCTCAGCTTC GACATCGACG CCCTCGATCC CGCCTTCGCC
CCCGGCACCG GCACGCCGGA GATCGGCGGC TTCACCACTC GCGAGGCGCT GCACCTGCTG
CGGGCCCTGC GCGGCCTCGA TCTCGTCGGG GCGGATGTGG TGGAGGTCGC TCCTCCGCTC
GATTCCGCCG GCATCACGGG TTTGGCGGGC GCCGGCATCG CCTTCGAGAT CCTGTGCCTG
CTGGCCGAGC GGGTCGCCGC ACGGCGCTGA
 
Protein sequence
MRSAGEIAPS LHVAVETPRR GRIVMADESE SAGARAERLA RFQPASGMVT PRFSGLASFM 
RLPVLDPAEA VGDRAGEGAG EGTGLVEIGL IGIPFDGTTT NRPGARLGPR AVREASTGTR
ALNHATGVAP YALAACADLG DVPVNPVDAA ETARRIEAFY RPLAEAGIVP LTVGGDHFIT
YPVLRALGAA RPLGLIHIDA HSDTDDTQYG GARLTHGTPF RRAIEDGVLD PRRCIQIGIR
GSMDAADERD WALAQGMRIL TMEEVCARGL PEVAAEARAV TGDGPTYLSF DIDALDPAFA
PGTGTPEIGG FTTREALHLL RALRGLDLVG ADVVEVAPPL DSAGITGLAG AGIAFEILCL
LAERVAARR