Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0522 |
Symbol | |
ID | 4027692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 578600 |
End bp | 579661 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965686 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_572583 |
Protein GI | 92112655 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000130755 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGACA TTCCGCTTAC CGCCGAGGCC TTTCGTCAGC GGCTCTTCGC ATGGTTCGAC GAGCATGGAC GCAAGACGCT GCCCTGGCAG TTCGACAAGA CGCCCTACCG TGTCTGGGTC TCGGAAATCA TGCTGCAGCA GACCCAGGTC GCCACGGTAA TCCCATATTA CCAGCGCTTC ATGGATCGCT TTCCCGATGT GTTTGCCCTG GCCGAGGCGC CGCAAGACGA GGTGCTGCAC TTGTGGACGG GCCTGGGCTA CTACGCACGT GCCCGCAACC TCCACAAGGC TGCTCGTGTC GTGGTGGAGG AACATGGCGG CGAATTCCCC GTGGACAGCG TCGAGGCGCT GTCCACGCTT CCCGGCATCG GCCGCTCCAC GGCCGGCGCC ATCATCAGCA TCAGTACCGG CCGGCGCGCG CCGATTCTCG ACGGTAACGT CAAGCGTGTG CTGACTCGCC TGCACGGCGT GGAAGGCTGG CCCGGTCGGC CGGCGGTGGA GCGTGAGCTC TGGGTGCTGG CGGAGCGTTA CACGCCCGAG GAGCGTCTGC CCGATTACAC CCAGGCGATG ATGGATGTGG GAGCGACGCT GTGCACCCGC GGCAAGCCCG CGTGTCTGTT ATGCCCCTTC AACGATGTGT GCGTGGCGCA TGCGCGTGGC GAGGAAACGC GCTTCCCCGA ATCCAAGCCG CGCAAGACAC TGCCCGAACG TACCACGCGG ATGCTGGTAC TGCGTGACCC GGAGGGCCGC GTGTTCCTTC AGCAGCGGCC CGCGAGCGGT CTGTGGGGCG GGCTATGGAG CTTGCCGCAA TTCGATGACG AAGCGGCGTT GCGTGCCTGG CTCGACCAGC GCTTTCCACG GGCTCAACGC GAAGCGGACG GTGCCGCCTT CACGCACACG TTCAGCCACT TCCGGCTGGT GATCACGCCG TCTCCCGCGC GTCTGCACGA GCCATTCAGC AGCGTCGGCG AGACGGGCGA ATTGTGGTAC GACGTGCAGG CCCCGGCTAG TGTAGGATTG GCCGCACCGG TCAAGACGCT GCTCGATCAA GCCGTTACTT GA
|
Protein sequence | MQDIPLTAEA FRQRLFAWFD EHGRKTLPWQ FDKTPYRVWV SEIMLQQTQV ATVIPYYQRF MDRFPDVFAL AEAPQDEVLH LWTGLGYYAR ARNLHKAARV VVEEHGGEFP VDSVEALSTL PGIGRSTAGA IISISTGRRA PILDGNVKRV LTRLHGVEGW PGRPAVEREL WVLAERYTPE ERLPDYTQAM MDVGATLCTR GKPACLLCPF NDVCVAHARG EETRFPESKP RKTLPERTTR MLVLRDPEGR VFLQQRPASG LWGGLWSLPQ FDDEAALRAW LDQRFPRAQR EADGAAFTHT FSHFRLVITP SPARLHEPFS SVGETGELWY DVQAPASVGL AAPVKTLLDQ AVT
|
| |