Gene Acid345_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0804 
Symbol 
ID4068683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp996395 
End bp997369 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID637982811 
ProductA/G-specific DNA glycosylase 
Protein accessionYP_589883 
Protein GI94967835 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.591097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT CCGCGACGGC CGCTGACTCC GACGCCTCGG ACCTCCAGAA ATCCCTGCTG 
TCCTGGTACC GGCACAGCCG CCGCAATCTG CCATGGCGAC GCACTCGCGA TCCGTACGCG
ATTTGGATCT CGGAGATCAT GCTCCAGCAG ACCCGGGTGG CCGCCGTCCT CGACAAGTAC
GCCCAATTTC TCGCGCAATT TCCGAACGTA AAGGCATTGG CCGACGCCTC GCTCGACGAG
GTTCTGACCG TTTGGAGCGG CCTTGGCTAC TACCGGCGCG CGCGAGCCTT GCACCAGGCA
GCGCAGATGG TGGTCCATCA TCTGCACGGC AAATTTCCGG ATACTGCGGC AGGCTGGCGG
CAACTGCCCG GGATTGGTCG CTACACCAGC GCGGCGATCG CCAGTATCGC GTTTAACGAG
CCGGCGGCAG TGGTGGATGG CAACGTGGAG CGCGTCCTTG AGCGTCTGGA TGGAGAGCGC
CACGAGGGCG AGAGGCTTTG GGAGCGCGCG GAACAATTGC TTGCCAAGCG TGCACCCGGC
GATTGGAACC AGGCGATGAT GGAACTCGGC GCCACGATCT GTTTGCCGCA GAATCCGCAA
TGCCTGGTTT GTCCGGTGAA CGGGCCGTGC AAAACCCGGG GGCCACTCCA GTCTCGACCG
CAACCGAAGC GCAAGCGCGC CGAGCTTTGG TACGCTCTGT ATGCGCGGAA GAACAGCGTG
CTGCTGGTGC AGCGTCCAGC GGACCACTCG TTGATGGCCG GTATGTGGGA GCTTCCTGCG
ATCCGCGCAA ATGGCGTGGA GCCGCTTCAC AAACTGCGCC ACTCGATCAC GGACACTGAT
TACGCGGTCT TCGTGGTCCG CGGCCGTACG GCGAAGAAAC ACGGCAAGTG GGTTACGCAC
GAAGAGGCGC ACCGGATGGC GATCACCGGA TTGACGCGCA AGATTTTGCG GAAACACTTC
GCACGGGAGG CGTGA
 
Protein sequence
MSTSATAADS DASDLQKSLL SWYRHSRRNL PWRRTRDPYA IWISEIMLQQ TRVAAVLDKY 
AQFLAQFPNV KALADASLDE VLTVWSGLGY YRRARALHQA AQMVVHHLHG KFPDTAAGWR
QLPGIGRYTS AAIASIAFNE PAAVVDGNVE RVLERLDGER HEGERLWERA EQLLAKRAPG
DWNQAMMELG ATICLPQNPQ CLVCPVNGPC KTRGPLQSRP QPKRKRAELW YALYARKNSV
LLVQRPADHS LMAGMWELPA IRANGVEPLH KLRHSITDTD YAVFVVRGRT AKKHGKWVTH
EEAHRMAITG LTRKILRKHF AREA