Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2649 |
Symbol | |
ID | 4268539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2998948 |
End bp | 2999781 |
Gene Length | 834 bp |
Protein Length | 277 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638127408 |
Product | formamidopyrimidine-DNA glycosylase / DNA-(apurinic or apyrimidinic site) lyase |
Protein accession | YP_743479 |
Protein GI | 114321796 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.849612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC TGCCCGAAGT GGAGACCACC CGGCGCGGCC TCGCCCCGCT GCTGGAGGGC CGGCGTGTAA CCGGAATGAC CGTGCGCCAG GCCCGGCTGC GCTGGCCGGT ACCGGCGGGC CTGCCCGATG CCATCACCGG CCAGACGATC CGGGCCGTGG ACCGGCGGGC CAAATACCTG CTCTTCCGTA CCCCGGCGGG GACGCTGATC CTGCATCTGG GGATGTCCGG CAGCCTGCGG GTGATCCCCG GCCAGCAGGC CGGCGCGTGC GCCGTCCCCC CCGGGCGCCA CGACCATGTG GACCTGCGCC TGGCCGACGG CAGCTGCCTG CGCTATACCG ATCCGCGCCG GTTCGGCAGC CTCCACTGGT GCACCGGTGA ACCGGAGGCG CACTGGCTGC TCCACCGGCT GGGTCCGGAG CCCTTCGACA CCGCCTTCGA CGGCGATAGG CTCCACCGCC TGAGCCGCGG CCGGCGAACC AGCGTCAAGG CCTTCATCAT GGACAGCGGG ATCGTGGTGG GGGTGGGGAA CATCTACGCC AGCGAGTCGC TGTTCCGGGC CGGCATCCAC CCGGGACGCC CGGCCGGGCG GGTGGGCCTC GCTCGTTACC GGCGCCTGGC CGGCGCCGTG CGCGAGGTGC TGGCCGAGGC CATAGCCGCC GGTGGCACGA CCCTGCGGGA CTTCACCGCC AGCGACGGCC GCCCCGGCTA CTTCGCCCAG ACCCTCAACG TCTATGGCCG GGCGGGCGCG CCCTGCCCCC GCTGCGGCCG GTCCATCCGT CAGCGACGCA TCGCACAGCG CTCCACCTGG TATTGCCCGG GCTGCCAGCG CTGA
|
Protein sequence | MPELPEVETT RRGLAPLLEG RRVTGMTVRQ ARLRWPVPAG LPDAITGQTI RAVDRRAKYL LFRTPAGTLI LHLGMSGSLR VIPGQQAGAC AVPPGRHDHV DLRLADGSCL RYTDPRRFGS LHWCTGEPEA HWLLHRLGPE PFDTAFDGDR LHRLSRGRRT SVKAFIMDSG IVVGVGNIYA SESLFRAGIH PGRPAGRVGL ARYRRLAGAV REVLAEAIAA GGTTLRDFTA SDGRPGYFAQ TLNVYGRAGA PCPRCGRSIR QRRIAQRSTW YCPGCQR
|
| |