Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2531 |
Symbol | |
ID | 7269377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3083494 |
End bp | 3084420 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643567357 |
Product | HhH-GPD family protein |
Protein accession | YP_002463838 |
Protein GI | 219849405 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0428971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAGC AAATCCGTTC CGATCTGCTC CACTGGTTTC ATTCCTACGC TCGTGACTTG CCTTGGCGAC GAACTCGCGA TCCGTATGCG ATTATGGTTG CCGAGGTGAT GCTGCAACAG ACGCAAGTCG ACCGGGTAAT TCCCAAGTAT CAGGCATTTC TCAGCGCGTT TCCTACCGTA GCCGCCCTGG CCGCCGCGCC GACCGCCGAA GTGATCCGGT TGTGGGCTGG GCTGGGCTAT AATCGGCGTG CGGTGAATCT GCAACGGGCC GCTCAAGTGA TTATGGAGCA GTACGGGGGG CAGGTGCCGT CGGCGGTGGC CGATCTGCGC GCATTGCCCG GTATTGGCCC TTATACTGCC GGTGCGATAG CCTGTTTTGC GTTTGAGCAG GATGTAGCCT TCCTCGATAC CAACATTCGC CGGGTGGTGC GACGATTGTG CGTTGGCCCC GATGACCGGT CCACGCCCTC CGATGGCGAG TTGTTGGCAC ACGCCACGGC TCTGATTCCG CCGGGACAGG GTTGGACGTG GAATCAGGCG ATTATGGAGT TGGGTGCTTT AATCTGTACG TCGACGAACC CGGCGTGTTG GCGCTGCCCA CTCCGTAGCT ATTGCCGGTC CTATGCCACT GCGGTTGCCG ACGATACGGC GCTGGCTGCG ACAATGATGT CACCGCCGCT CAAGCGAGTC GCCGAATCGC GCACTGCTGA GCCATTCGTT GGGTCACGCC GTTGGTACCG CGGCAAGATC GTCGCCGTGT TGCGCGAGCT GCCCGCCGGT GAAGTATTAC CGTTGCCGAT CTTAGGAGAA CGGATACGCG CCGACTTTAC CCCCGATCAC GAACCCTGGT TACAAGGACT GGTAGCCGAT CTGGCCCGCG ATGGGTTGCT GGTGATGACC GAGGAAGGGG TACGCTTGCC GCAGTGA
|
Protein sequence | MIEQIRSDLL HWFHSYARDL PWRRTRDPYA IMVAEVMLQQ TQVDRVIPKY QAFLSAFPTV AALAAAPTAE VIRLWAGLGY NRRAVNLQRA AQVIMEQYGG QVPSAVADLR ALPGIGPYTA GAIACFAFEQ DVAFLDTNIR RVVRRLCVGP DDRSTPSDGE LLAHATALIP PGQGWTWNQA IMELGALICT STNPACWRCP LRSYCRSYAT AVADDTALAA TMMSPPLKRV AESRTAEPFV GSRRWYRGKI VAVLRELPAG EVLPLPILGE RIRADFTPDH EPWLQGLVAD LARDGLLVMT EEGVRLPQ
|
| |