Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2306 |
Symbol | |
ID | 5835695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2555645 |
End bp | 2557102 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641368105 |
Product | uracil-DNA glycosylase superfamily protein |
Protein accession | YP_001639772 |
Protein GI | 163851729 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.555277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0113153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAGC CGTTGGAAAA CCTCTCCCCC GCAGAGGGGG GAGGGCGTTT TGGCGGCGGC ATCCATGTCG TCAGCCTCGC CCCCGGCGCA GATCTGTCGG GGTTTCGCAC CGCGGCGCGT CGCCTGATCG CGGCCGAGAT CCCGCCGAAA AACATCGTCT GGCAGACCGA GGCTCCGAGC CTGTTCGGCG CCGAGTCCGG CTCCGTGGAC GGCCCGCCGC TGCGGCTTCC CCGCGCGGTG ACCGAACTGA TCCCGATGGT GGTGCCCCAC CGCGATCCCG AGCGCTACGG TCTGCTCTAC GCCCTGCTCT GGCGCGTCCT GCACGGCGAG CGGGCGCTGA TGGATGTCCT GAGCGATCCG CTCGTCCACC GCCTCCACCG GATGCGGAAG GCGATCGGCC GCGACCTGCA CAAGATGCAC GCCTTCCTGC GCTTCCGCCG GGTGCCGGGG GAGGGGGCTG AGCGCTTCGT GGCGTGGTTC GAGCCCGACC ACCACATCCT GGGGGCCGCC GCGCCCTTCT TCGTCGATCG CTTCGGCGGG CTGACATGGT CGATCCTGAC GCCCGAGGGC TCAGCGCATT GGGACGGCAC GCTCCGCTTC GGTCCGCCCG GCCGCCGCGA GGATGTGCCG GAAGGGGACG GCTTCGAGGC CGGCTGGCGC GACTATTACG AGAGCACCTT CAATCCGGCC CGACTCAACC TCGATGCCAT GCGCGCCGAG ATGCCCCGCA AGTACTGGCG GAACATGCCG GAGACGGCGG CGATTCCCGG TCTCGTGCGG GCCGCGAGCG CCCGCGCGCA GGCGATGATC GAGAAGGAGC CGACGATGCC GGTCAAGCGT GACCCCGTCC GCGCCGTGGC GAAGATGGCC CAGGATGAGC CGGATTCGCT GGAAGCCCTC AACGCGATCA TCGCTCGCTC CGAACCGCTG GTGCCCGGCG CCACACAAGC CGTGCTCGGC GAAGGGCCGG TCGGCGCGCG GATCGCCTTC GTCGGCGAGC AGCCGGGCGA TCAGGAGGAT CGCCAGGGCA AACCCTTCGT CGGGCCGGCG GGGCAGCTTC TCTCCCGCGC GCTGGAAGAG GCGGGGATCG ACCGGGGGGA GGCCTACCTC ACGAATGCGG TCAAGCACTT CAAATTCACG CTGCGCGGCA AGCGCCGCAT TCACGAGAAG CCGACGGCCG GCGAGGTGAG CCACTATCGC TGGTGGCTCG AAAAGGAGCT GGACTTCGTC GCCCCCAAGC TCGTCGTGGC GCTGGGGGCC ACCGCGGTGC TGTCGCTGAC GGGCAAGCAG ATCCCGATCA CCCGCGCCCG CGGCCCCGCC GAGTTCGGGC GGCCGTTCGC GGGCTTTATC ACGGTCCACC CCTCCTACTT GCTGCGCCTG CCCGACGAGG CGGCGAAGGC GGCGGCCTAT CAGGCCTTCG TCGATGACCT GCGGCGGGCC AACGCCCTCG CGGCGTGA
|
Protein sequence | MGEPLENLSP AEGGGRFGGG IHVVSLAPGA DLSGFRTAAR RLIAAEIPPK NIVWQTEAPS LFGAESGSVD GPPLRLPRAV TELIPMVVPH RDPERYGLLY ALLWRVLHGE RALMDVLSDP LVHRLHRMRK AIGRDLHKMH AFLRFRRVPG EGAERFVAWF EPDHHILGAA APFFVDRFGG LTWSILTPEG SAHWDGTLRF GPPGRREDVP EGDGFEAGWR DYYESTFNPA RLNLDAMRAE MPRKYWRNMP ETAAIPGLVR AASARAQAMI EKEPTMPVKR DPVRAVAKMA QDEPDSLEAL NAIIARSEPL VPGATQAVLG EGPVGARIAF VGEQPGDQED RQGKPFVGPA GQLLSRALEE AGIDRGEAYL TNAVKHFKFT LRGKRRIHEK PTAGEVSHYR WWLEKELDFV APKLVVALGA TAVLSLTGKQ IPITRARGPA EFGRPFAGFI TVHPSYLLRL PDEAAKAAAY QAFVDDLRRA NALAA
|
| |