Gene Clim_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1208 
Symbol 
ID6355309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1306620 
End bp1307558 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content45% 
IMG OID642668824 
Product8-oxoguanine DNA glycosylase domain protein 
Protein accessionYP_001943254 
Protein GI189346725 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID[TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTATC AATCATTATT ACTTACTGAT ATTCCTGTTA ATATTGAAGA TTCTCTTTTT 
AGCGGACAAT CTTTTTCATG GAACAGGTTA TCCTTTAAAG ATAATTTTTT TATCTCTGTA
ATCAATAATG TTCCGGTGGT TATAAATCAG ATAAATAATT ATTTTATAAA CATTTACACG
CCAGATAAAT TTATAGGTGG CATACCTGTT TCTGAGGCAC TAAGTGCTTA TTTCACTCTG
GATATTGACA ACGGAAAACT GTTTGATGAT CATTTTATAA AACGGTTTCC AGCAATCGCA
ACACTGTTGC AGGAGTATAT GGGATTGAAG CTGCTCCGTC AGGATCCGTT TGAAACAACC
ATAACCTTCA TGTGTGCTCA GGGAATCGGT ATGGCGCTGA TACGCCGACA GATTGGTATG
CTTTGTGAGA AGTACGGCAC TCCGTGTACC ATCGAGTTGA TGGGGCAAAA ACATCGCATC
TTCCGCTTTC CGAAACCGGA GATGCTTGCT GAAACCTCCG TGTTGTCGCT GCAGGCATGT
ACCAACAACA ATTACCGGAG AGCTCTCAAC ATCAGGCGTG TTGCCGCGGC GGCGGCGGAA
GGAACGCTTG ACTTTACAAT ATCCGGCTCG CAATCGCTCT CCCTTGACAG GATAAGAGCG
ATGCTCTGTG AATATGACGG CATAGGTCCG AAAATCGCTG ATTGCATCGC GCTCTTCAGC
CTCGGTCGTT TCGACGCATT TCCTGTCGAC ACTCATGTAC GTCAGTATCT GGCTGAATGG
TTCGGCATTC GAAGAGCCTC AATGTCGTTG ACTGAAAAAA ACTATCTCAG GCTTCAGGAT
GAGGTACGCA CCATCCTCAG GCCGGAAGTG GCCGGGTATG CCGGTCATCT GCTCTTTCAT
TGCTGGCGCA GAAAAGTCAA GCACCTCAGA ACAGCGTGA
 
Protein sequence
MSYQSLLLTD IPVNIEDSLF SGQSFSWNRL SFKDNFFISV INNVPVVINQ INNYFINIYT 
PDKFIGGIPV SEALSAYFTL DIDNGKLFDD HFIKRFPAIA TLLQEYMGLK LLRQDPFETT
ITFMCAQGIG MALIRRQIGM LCEKYGTPCT IELMGQKHRI FRFPKPEMLA ETSVLSLQAC
TNNNYRRALN IRRVAAAAAE GTLDFTISGS QSLSLDRIRA MLCEYDGIGP KIADCIALFS
LGRFDAFPVD THVRQYLAEW FGIRRASMSL TEKNYLRLQD EVRTILRPEV AGYAGHLLFH
CWRRKVKHLR TA