Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2581 |
Symbol | |
ID | 7117328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 2713737 |
End bp | 2715194 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643525328 |
Product | Uracil-DNA glycosylase superfamily |
Protein accession | YP_002421350 |
Protein GI | 218530534 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.148154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.520534 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAGC CGTTGGAAAA CCCCTCCCCC GCAGAGGGGG GAGGGCGTTT TGGCGGCGGC ATCCATGTCG TCAGTCTCGC CCCCGGCGCA GATCTTTCGG GGTTTCGCAC CGCGGCGCGG CGCCTGATCG CGGCCGAGAT CCCGCCGAAA AACATCGTCT GGCAGACCGA GGCTCCGAGC CTGTTCGGCG CCGAGTCCGG CTCCGTGGAC GGCCCGCCGT TGCGGCTTCC CCGCGCGGTG ACCGAGCTGA TCCCGATGGT GGTGCCGCAC CGCGATCCCG AGCGCTACGG CCTGCTCTAC GCCCTGCTCT GGCGCGTCTT GCACGGTGAG CGGGCGCTGA TGGATGTCCT GAGCGATCCG CTCGTCCACC GCCTCCACCG GATGCGGAAG GCCATCGGCC GCGACCTGCA CAAGATGCAC GCCTTCCTGC GCTTCCGCCG GGTACCGGGG GAGGGGGGCG AGCGCTTCGT GGCGTGGTTC GAGCCCGATC ACCACATCCT GGGGGCCGCG GCACCCTTCT TCGTCGATCG CTTCCGCGGG CTGACATGGT CGATCCTGAC GCCGGAAGGG TCGGCGCATT GGGACGGTAC GCTCCGCTTC GGTCCGCCCG GCCGCCGCGA GGATGTGCCG GAGGGCGACG GCTTCGAGGC CGGTTGGCGC GACTATTACG AGAGCACCTT CAATCCGGCC CGGCTCAACC TCAATGCCAT GCGTGCCGAG ATGCCCCGCA AGTACTGGCG GAACATGCCG GAGACGGCGG CGATTCCTGC CCTCGTGCGG GCCGCGAGCG CCCGCGCGCA GGCGATGATC GAGAAGGAGC CGACGATGCC GGTCAAGCGC GACCCCGTCC GCGCCGTGGC GAAGATGGCC CAGGATGAGC CGGATTCGCT GGAAGCCCTC AACGCCATCA TCGCCCGCTC CGAACCGCTG GTGCCCGGCG CCACCCAGGC CGTGCTCGGC GAAGGACCGG TCGGCGCGCG GATCGCTTTC GTCGGCGAGC AGCCGGGCGA TCAGGAGGAT CGCCAGGGCC GGCCCTTCGT CGGGCCGGCG GGGCAGCTTC TCTCCCGCGC GCTGGAAGAG GCGGGGATCG ACCGGGGGGA GGCCTACCTC ACGAATGCGG TCAAGCACTT CAAATTCACG CTGCGCGGCA AGCGCCGCAT CCACGAGAAG CCAACCGCGG GCGAGGTGAG CCATTACCGC TGGTGGCTCG AGAAGGAGCT GGACTTCGTC GCCCCCAAGC TCGTCGTGGC GCTGGGGGCC ACCGCGGTGC TGTCGCTGAC GGGCAAGCAG ATCCCGATCA CCCGCGCCCG CGGACCCGCC GAGTTCGGGC GGCCGTTCGC GGGCTTCATC ACGGTCCACC CCTCCTACCT GCTGCGCCTG CCCGACGAGG CGGCGAAGGC GGCGGCCTAT CAGGCCTTCG TCGATGACCT GCGGCGGGCC AACGCCCTCG CGGCGTGA
|
Protein sequence | MGEPLENPSP AEGGGRFGGG IHVVSLAPGA DLSGFRTAAR RLIAAEIPPK NIVWQTEAPS LFGAESGSVD GPPLRLPRAV TELIPMVVPH RDPERYGLLY ALLWRVLHGE RALMDVLSDP LVHRLHRMRK AIGRDLHKMH AFLRFRRVPG EGGERFVAWF EPDHHILGAA APFFVDRFRG LTWSILTPEG SAHWDGTLRF GPPGRREDVP EGDGFEAGWR DYYESTFNPA RLNLNAMRAE MPRKYWRNMP ETAAIPALVR AASARAQAMI EKEPTMPVKR DPVRAVAKMA QDEPDSLEAL NAIIARSEPL VPGATQAVLG EGPVGARIAF VGEQPGDQED RQGRPFVGPA GQLLSRALEE AGIDRGEAYL TNAVKHFKFT LRGKRRIHEK PTAGEVSHYR WWLEKELDFV APKLVVALGA TAVLSLTGKQ IPITRARGPA EFGRPFAGFI TVHPSYLLRL PDEAAKAAAY QAFVDDLRRA NALAA
|
| |