Gene Mext_2306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2306 
Symbol 
ID5835695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2555645 
End bp2557102 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content71% 
IMG OID641368105 
Producturacil-DNA glycosylase superfamily protein 
Protein accessionYP_001639772 
Protein GI163851729 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.555277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0113153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC CGTTGGAAAA CCTCTCCCCC GCAGAGGGGG GAGGGCGTTT TGGCGGCGGC 
ATCCATGTCG TCAGCCTCGC CCCCGGCGCA GATCTGTCGG GGTTTCGCAC CGCGGCGCGT
CGCCTGATCG CGGCCGAGAT CCCGCCGAAA AACATCGTCT GGCAGACCGA GGCTCCGAGC
CTGTTCGGCG CCGAGTCCGG CTCCGTGGAC GGCCCGCCGC TGCGGCTTCC CCGCGCGGTG
ACCGAACTGA TCCCGATGGT GGTGCCCCAC CGCGATCCCG AGCGCTACGG TCTGCTCTAC
GCCCTGCTCT GGCGCGTCCT GCACGGCGAG CGGGCGCTGA TGGATGTCCT GAGCGATCCG
CTCGTCCACC GCCTCCACCG GATGCGGAAG GCGATCGGCC GCGACCTGCA CAAGATGCAC
GCCTTCCTGC GCTTCCGCCG GGTGCCGGGG GAGGGGGCTG AGCGCTTCGT GGCGTGGTTC
GAGCCCGACC ACCACATCCT GGGGGCCGCC GCGCCCTTCT TCGTCGATCG CTTCGGCGGG
CTGACATGGT CGATCCTGAC GCCCGAGGGC TCAGCGCATT GGGACGGCAC GCTCCGCTTC
GGTCCGCCCG GCCGCCGCGA GGATGTGCCG GAAGGGGACG GCTTCGAGGC CGGCTGGCGC
GACTATTACG AGAGCACCTT CAATCCGGCC CGACTCAACC TCGATGCCAT GCGCGCCGAG
ATGCCCCGCA AGTACTGGCG GAACATGCCG GAGACGGCGG CGATTCCCGG TCTCGTGCGG
GCCGCGAGCG CCCGCGCGCA GGCGATGATC GAGAAGGAGC CGACGATGCC GGTCAAGCGT
GACCCCGTCC GCGCCGTGGC GAAGATGGCC CAGGATGAGC CGGATTCGCT GGAAGCCCTC
AACGCGATCA TCGCTCGCTC CGAACCGCTG GTGCCCGGCG CCACACAAGC CGTGCTCGGC
GAAGGGCCGG TCGGCGCGCG GATCGCCTTC GTCGGCGAGC AGCCGGGCGA TCAGGAGGAT
CGCCAGGGCA AACCCTTCGT CGGGCCGGCG GGGCAGCTTC TCTCCCGCGC GCTGGAAGAG
GCGGGGATCG ACCGGGGGGA GGCCTACCTC ACGAATGCGG TCAAGCACTT CAAATTCACG
CTGCGCGGCA AGCGCCGCAT TCACGAGAAG CCGACGGCCG GCGAGGTGAG CCACTATCGC
TGGTGGCTCG AAAAGGAGCT GGACTTCGTC GCCCCCAAGC TCGTCGTGGC GCTGGGGGCC
ACCGCGGTGC TGTCGCTGAC GGGCAAGCAG ATCCCGATCA CCCGCGCCCG CGGCCCCGCC
GAGTTCGGGC GGCCGTTCGC GGGCTTTATC ACGGTCCACC CCTCCTACTT GCTGCGCCTG
CCCGACGAGG CGGCGAAGGC GGCGGCCTAT CAGGCCTTCG TCGATGACCT GCGGCGGGCC
AACGCCCTCG CGGCGTGA
 
Protein sequence
MGEPLENLSP AEGGGRFGGG IHVVSLAPGA DLSGFRTAAR RLIAAEIPPK NIVWQTEAPS 
LFGAESGSVD GPPLRLPRAV TELIPMVVPH RDPERYGLLY ALLWRVLHGE RALMDVLSDP
LVHRLHRMRK AIGRDLHKMH AFLRFRRVPG EGAERFVAWF EPDHHILGAA APFFVDRFGG
LTWSILTPEG SAHWDGTLRF GPPGRREDVP EGDGFEAGWR DYYESTFNPA RLNLDAMRAE
MPRKYWRNMP ETAAIPGLVR AASARAQAMI EKEPTMPVKR DPVRAVAKMA QDEPDSLEAL
NAIIARSEPL VPGATQAVLG EGPVGARIAF VGEQPGDQED RQGKPFVGPA GQLLSRALEE
AGIDRGEAYL TNAVKHFKFT LRGKRRIHEK PTAGEVSHYR WWLEKELDFV APKLVVALGA
TAVLSLTGKQ IPITRARGPA EFGRPFAGFI TVHPSYLLRL PDEAAKAAAY QAFVDDLRRA
NALAA