Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2645 |
Symbol | |
ID | 4077948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2778782 |
End bp | 2779843 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638007969 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_614639 |
Protein GI | 99082485 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACC ACCCCGTGTC TGCCGAGCAA CTGAGCCAGG ATCTCTTGGT GTGGTACGAT ACCCACGCCC GCGAGATGCC ATGGCGCGTC GGCCCAGCCG CGCGCGCCTC CGGTGTGCGC CCCGATCCCT ACCGGATCTG GCTCAGCGAG GTCATGCTGC AACAAACCAC CGTCGCCGCC GTCAAAGACT ATTTCGAACG GTTCACCCGG CGCTGGCCCC GCGTCGGCGA CCTGGCCGCT GCCGAGGACG GCGATGTTAT GGCAGAGTGG GCGGGGCTCG GCTATTACGC CCGCGCGCGC AACCTGTTGA AATGTGCCCG CGTGGTGGCG GAGGAGTTTG AGGGTGTGTT TCCAGATGCC TATGAGGGGC TGATCGCACT GCCCGGCATC GGGCCTTATA CGGCAGCGGC GATCTCTGCC ATTGCCTTTG ACCGGCCCGA GACGGTGCTT GATGGCAATG TGGAACGGGT GATGGCGCGG CTTCATGACG AACACGAACC GCTTCCGGCA GTGAAACCCG TGCTCAAGGC GCATGCCGCC CATTTGACGC CCAGCGCGCG GCCCGGCGAC TATGCGCAAG CGGTGATGGA TCTCGGGGCC ACCATCTGCA CGCCCAAATC GCCCGCGTGC GGCATCTGCC CCTGGCGCGA TCCCTGCCGC GCCCGCGTCA AGGGCACCGC GCCCGAACTG CCCAAGAAAA CGCCCAAGAA ACCCAAACCC ACCCGCTATG GGTTTGTCTA TCTGGCGCGC AGTGCCGAAG GCGACTGGCT CCTCGAGCGC CGCCCCGACA AGGGGCTTTT GGGGGGGATG CTGGGCTGGC CCGGTTCAGA GTGGAACGAC GCTCCCACCG AGACGCCGCC CTTTGATGCC GACTGGCAGG ATCTGGGCGC CGAGGTCCGT CACACCTTCA CCCATTTCCA CCTGATCCTG CAGGTGCGCA GCGCCGAGCT GCCCGCGGAT TTTGAGCCCC GCGCAGGTCA GGAACTGGTC CGACGCCACG ACTTCCGCCC CTCCAGCCTG CCCACCGTCA TGCGCAAGGC CTTTGATCTG ACGCACCGTT AA
|
Protein sequence | MRDHPVSAEQ LSQDLLVWYD THAREMPWRV GPAARASGVR PDPYRIWLSE VMLQQTTVAA VKDYFERFTR RWPRVGDLAA AEDGDVMAEW AGLGYYARAR NLLKCARVVA EEFEGVFPDA YEGLIALPGI GPYTAAAISA IAFDRPETVL DGNVERVMAR LHDEHEPLPA VKPVLKAHAA HLTPSARPGD YAQAVMDLGA TICTPKSPAC GICPWRDPCR ARVKGTAPEL PKKTPKKPKP TRYGFVYLAR SAEGDWLLER RPDKGLLGGM LGWPGSEWND APTETPPFDA DWQDLGAEVR HTFTHFHLIL QVRSAELPAD FEPRAGQELV RRHDFRPSSL PTVMRKAFDL THR
|
| |