Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1017 |
Symbol | |
ID | 3746745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1361039 |
End bp | 1361977 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637773546 |
Product | HhH-GPD |
Protein accession | YP_379322 |
Protein GI | 78188984 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0765881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAAG AATTTTTAAT AACACATAAT AAAGAGTTAG AAATCGAAAA ATCTCTCTTT AGTGGTCAAT CTTTTCTCTG GAAAAAGCAT CAGAGTAATC TTGATTCTTT TGTTACTGTA ATGGATAAGA GATTGGTTAT TATAAGCCAA CTATCTCCTT ATACCATAAG GGTTCATTGC GATAGTGAGG TGCTTTATGG GCAAAAAATT TCAGCTTTTA TAAGCCACTA CTTTACGCTT GATGTCCCCT TTCAAAAGAT TTTTTCATCC TCATTTAAAA GTAACTACTC GGAGGTATGG CGCTTATTAG ATGGGTATAA ATCCATCGCA CTGTTACGGC AGCATCCGTT TGAAACCCTT ATCTCATTTA TGTGTGCTCA AGGGATTGGT ATGCGATTAA TTCGCCAGCA AATCAATCGT TTATGTGAAC GGTATGGAGA GTTTTATGAG GCAGAAATGG AGGGTGAAAT GTTGTGCTTT TCGGGCTTTC CTGCGCCCGA GCAACTTGCT TGCTTGAACG CTGAGGAGCT GAGTTACTGT ACCAATAACA ATCGTGAAAG AGCGGCAAAC ATTATTGCGG TTGCGCGTAA GGTTGTGGAA GGTAGATTAG ATTTGTCGAG TTTGTCATAT CCAAACATGG CGTTTGAGGA GGTGCAAGCT CGCTTGACGC AAGAGCGTGG CATTGGGTTA AAAATTGCCG ATTGCGTTGC TTTGTTTGGT TTGGGATATT TTGAGGCATT CCCTATTGAT ACGCATGTGC ATCAATTTAT GGCTCAGTGG TTTAAAGTGC CTGCTGCCTC GCGTTCACTA ACCCCCGCCA CCTATCGGCA GTTAACCCTC GAAGCGCGTG AAATTCTTGG CAGCCATTAT ACGGGCTATG CAGCGCACCT GCTTTTTCAT TGTTGGCGTT GTGAGGTTAA AAAGCTTTGC TGGTTTTAA
|
Protein sequence | MLKEFLITHN KELEIEKSLF SGQSFLWKKH QSNLDSFVTV MDKRLVIISQ LSPYTIRVHC DSEVLYGQKI SAFISHYFTL DVPFQKIFSS SFKSNYSEVW RLLDGYKSIA LLRQHPFETL ISFMCAQGIG MRLIRQQINR LCERYGEFYE AEMEGEMLCF SGFPAPEQLA CLNAEELSYC TNNNRERAAN IIAVARKVVE GRLDLSSLSY PNMAFEEVQA RLTQERGIGL KIADCVALFG LGYFEAFPID THVHQFMAQW FKVPAASRSL TPATYRQLTL EAREILGSHY TGYAAHLLFH CWRCEVKKLC WF
|
| |