Gene Cag_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1017 
Symbol 
ID3746745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1361039 
End bp1361977 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content42% 
IMG OID637773546 
ProductHhH-GPD 
Protein accessionYP_379322 
Protein GI78188984 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID[TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0765881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG AATTTTTAAT AACACATAAT AAAGAGTTAG AAATCGAAAA ATCTCTCTTT 
AGTGGTCAAT CTTTTCTCTG GAAAAAGCAT CAGAGTAATC TTGATTCTTT TGTTACTGTA
ATGGATAAGA GATTGGTTAT TATAAGCCAA CTATCTCCTT ATACCATAAG GGTTCATTGC
GATAGTGAGG TGCTTTATGG GCAAAAAATT TCAGCTTTTA TAAGCCACTA CTTTACGCTT
GATGTCCCCT TTCAAAAGAT TTTTTCATCC TCATTTAAAA GTAACTACTC GGAGGTATGG
CGCTTATTAG ATGGGTATAA ATCCATCGCA CTGTTACGGC AGCATCCGTT TGAAACCCTT
ATCTCATTTA TGTGTGCTCA AGGGATTGGT ATGCGATTAA TTCGCCAGCA AATCAATCGT
TTATGTGAAC GGTATGGAGA GTTTTATGAG GCAGAAATGG AGGGTGAAAT GTTGTGCTTT
TCGGGCTTTC CTGCGCCCGA GCAACTTGCT TGCTTGAACG CTGAGGAGCT GAGTTACTGT
ACCAATAACA ATCGTGAAAG AGCGGCAAAC ATTATTGCGG TTGCGCGTAA GGTTGTGGAA
GGTAGATTAG ATTTGTCGAG TTTGTCATAT CCAAACATGG CGTTTGAGGA GGTGCAAGCT
CGCTTGACGC AAGAGCGTGG CATTGGGTTA AAAATTGCCG ATTGCGTTGC TTTGTTTGGT
TTGGGATATT TTGAGGCATT CCCTATTGAT ACGCATGTGC ATCAATTTAT GGCTCAGTGG
TTTAAAGTGC CTGCTGCCTC GCGTTCACTA ACCCCCGCCA CCTATCGGCA GTTAACCCTC
GAAGCGCGTG AAATTCTTGG CAGCCATTAT ACGGGCTATG CAGCGCACCT GCTTTTTCAT
TGTTGGCGTT GTGAGGTTAA AAAGCTTTGC TGGTTTTAA
 
Protein sequence
MLKEFLITHN KELEIEKSLF SGQSFLWKKH QSNLDSFVTV MDKRLVIISQ LSPYTIRVHC 
DSEVLYGQKI SAFISHYFTL DVPFQKIFSS SFKSNYSEVW RLLDGYKSIA LLRQHPFETL
ISFMCAQGIG MRLIRQQINR LCERYGEFYE AEMEGEMLCF SGFPAPEQLA CLNAEELSYC
TNNNRERAAN IIAVARKVVE GRLDLSSLSY PNMAFEEVQA RLTQERGIGL KIADCVALFG
LGYFEAFPID THVHQFMAQW FKVPAASRSL TPATYRQLTL EAREILGSHY TGYAAHLLFH
CWRCEVKKLC WF