Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6621 |
Symbol | |
ID | 6130884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 7287543 |
End bp | 7289024 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641646710 |
Product | uracil-DNA glycosylase superfamily protein |
Protein accession | YP_001773309 |
Protein GI | 170744654 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.647751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.384505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAGCA TCCCGCTGAG GCCCGGCGCC GACCTCGAAG GGTTCCGGGC CGCGGTGCGC CGCCTCGCGG CGGAGGAGTG CCCTCCCGAG GCGGTGACCT TCACGCAGGG CGGGGCCCCC GGCCTGTTCG GGCCCGAACC CGGCCTGTTC GGGGCCGAGC CCGGCCTGTT CGGGGCCGAG GCGGCGCGCG CGTCCGAGGA GGCCGGGCCG GCGCCCCCGC TCGCCCTGCC GCGGGCCGTG GCGGCGCTGG TGCCCGCGAT CGTGCCCCAC CGGGACGAGG CGCGCTACGC GCTGCTCTAC CAGCTGATCT GGCGGGTGCG GCGGGGCGAG CGCCACCTCC CCGAGGTGGC GAGCGATCCC CTGGTCCACC GGCTGACGCG CATGCGCCGA GCGGTCGCGC GCGACCTGCA CAAGATGCAC GCCTTCCTGC GCTTCCGGCG CGTGCCCGGC GAGGGAGAGC GGGAGCGCTT CGCCGCGTGG TTCGAGCCCG ACCACCACAT CCTGGAGGCG GCCGCGCCGT TCTTCGTGGC GCGTTTCCCG GGCTTCGACT GGTTGATCCT GACGCCCGAG GGCTCGGCCC ACTGGGACGG CGCGCGCCTG CGCTTCGGCC CACCCGAGCG GCGCGAGGCG CTGCCCGCCG GCGACGCCTT CGAGGCCGGC TGGAGCGCCT ATTACGCCAG CACCTTCAAC CCGGCCCGGA CCAACCTCGC GGCGATGCGG GCGGAGATGC CCAAGAAGTA CTGGCGCAAC CTGCCCGAGG CGGCGGCGAT TCCCGACCTC GTGCGCAACG CGCGCCGCCG CGTCGCCGCG ATGATCGAGA GGGAGCCCGC CATGCCGCGG AAACGCACGC CCACCCGCGC CCTCGACGCC ATGGCGGCGC AGGGACCGGA GGATCTCGAG GCCCTGAACG CCCTGATCCG GCGCTCCGAA CCGCTGGTGC CGGGGGCGAC GCAGGCGGTG CTGGGCGAGG GACCGGTCGG CGCCGCGATC GCCTTCGTGG GCGAGCAGCC CGGGGACCAG GAGGACCGGC TGGGGCGGCC CTTCGTCGGC CCGGCGGGGC AGCTCCTCAC CCGGGCGATG GAGGAGGCCG GCCTCGCGCG TGGTTCCTGC TATCTCACGA ATGCGGTCAA GCACTTCAAG TTCGAGGAGC GCGGCAAGCG GCGCATCCAC CAGAAGCCGA CCGCCGGGGA GGTCGCGCAT GGCCGCTGGT GGCTCGACCG GGAGCTCGGC TTCGTGCATC CGCGCCTCGT CGTCGCGCTC GGGGCGACGG CGGTGCTGGC GCTGACCGGC AAGGCGATCC CGATCACCCG GGCCCGCGGC CCGGCCCGGT TCGACGGCAA ACCCTATGCG GGCTTCGTCA CCGTGCACCC CTCCTACCTG CTGCGCCTGC CCGAGGAGGC GAAGGCCGAG GCCTATGCGG GTTTCGTGGA CGATCTGCGC CGGGTGCGAA TGCTGGCGCA GGAACTCGCC GGGGCGGCGT AG
|
Protein sequence | MRSIPLRPGA DLEGFRAAVR RLAAEECPPE AVTFTQGGAP GLFGPEPGLF GAEPGLFGAE AARASEEAGP APPLALPRAV AALVPAIVPH RDEARYALLY QLIWRVRRGE RHLPEVASDP LVHRLTRMRR AVARDLHKMH AFLRFRRVPG EGERERFAAW FEPDHHILEA AAPFFVARFP GFDWLILTPE GSAHWDGARL RFGPPERREA LPAGDAFEAG WSAYYASTFN PARTNLAAMR AEMPKKYWRN LPEAAAIPDL VRNARRRVAA MIEREPAMPR KRTPTRALDA MAAQGPEDLE ALNALIRRSE PLVPGATQAV LGEGPVGAAI AFVGEQPGDQ EDRLGRPFVG PAGQLLTRAM EEAGLARGSC YLTNAVKHFK FEERGKRRIH QKPTAGEVAH GRWWLDRELG FVHPRLVVAL GATAVLALTG KAIPITRARG PARFDGKPYA GFVTVHPSYL LRLPEEAKAE AYAGFVDDLR RVRMLAQELA GAA
|
| |