Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1405 |
Symbol | |
ID | 6375083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1521044 |
End bp | 1521985 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642683900 |
Product | 8-oxoguanine DNA glycosylase domain protein |
Protein accession | YP_001959814 |
Protein GI | 189500344 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | [TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00273157 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAGAT TAACACAGAG CACTACTCAC CTATACGGAT TTGATATTGA GAATACTATT TTTAGCGGGC AGTGCTTCAG GTGGAAAGTA ACTGATAACA CTCCTACAGT ACTGTCAGGG GTTATTGGGT CTGAAATGTT TATTATTGAC AGATCAAACC CTGTGAAATA CACCATATCA AGCACAATAA AATTCCGGAA CATCAACGCA TTCAATGAAT TTAACCGAAA GTATTTCAGC CTTGATGTAG ACGTCAACTC GCTATTCCCT GAAGATTTCA GAAAACGTTA TCCTGAAGTA TGGGACCGTA TACAACCCTA TACAGACATT AAAATCCTGA GACAGGACCC GTTTGAAACC CTTATCACCT TTATGTGTGC CCAGGGCCTT GGCATGCACC TCATCAGAAA GCAGGTAACG TACCTTGCGC AGGAATACGG CACCAGACAT ACCATACGCC TGAACGATGT ACCCTACACC TATTTCTCCT TCCCCACCCC GGAAGCCCTT GCTTCAACCA GTCCGGAGTC TCTCAGGCTC TGTACGAACA ACAATTGTAT CAGGGCGGAT AACATTATAC AAGCCGCTCA GGCCGTTGTT TCAGGGAAAC TTGATCTCCA GGCCCTGAAA GATCCTGCTA TGCCCCTTGA AAACGTCAGA AAAACACTTT GCAGCCAACC GGGAATCGGC TTCAAGATCG CTGACTGTGT CATGCTTTTC GGGCTTCACC GTTTTGCCGC TTTTCCGATA GACAGACATG TTCATCAATA TCTTGCCCAT TGGTTTTCCA TAGGTAACCC GCTCCAATCA TTGTCTCAAA AACACTATCT TTCTCTTCAG GAACAGGCTT ATCGTATACT GAAACCGGAA CTTGCCGGTT TTGCAGGCCA TATACTCTTT CATTGCTGGA GAAAGGAGAT AAAGCACCTG CAGTCCTATT AA
|
Protein sequence | MERLTQSTTH LYGFDIENTI FSGQCFRWKV TDNTPTVLSG VIGSEMFIID RSNPVKYTIS STIKFRNINA FNEFNRKYFS LDVDVNSLFP EDFRKRYPEV WDRIQPYTDI KILRQDPFET LITFMCAQGL GMHLIRKQVT YLAQEYGTRH TIRLNDVPYT YFSFPTPEAL ASTSPESLRL CTNNNCIRAD NIIQAAQAVV SGKLDLQALK DPAMPLENVR KTLCSQPGIG FKIADCVMLF GLHRFAAFPI DRHVHQYLAH WFSIGNPLQS LSQKHYLSLQ EQAYRILKPE LAGFAGHILF HCWRKEIKHL QSY
|
| |