Gene Acid345_0948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0948 
Symbol 
ID4070830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1207078 
End bp1208511 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content62% 
IMG OID637982955 
ProductDNA-3-methyladenine glycosylase II / transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase 
Protein accessionYP_590025 
Protein GI94967977 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.511809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCTCG ACCGGCAGAC GTGTTCGCGG GCGCGGTTGT CGCGCGATCC TCGCTTCGAC 
GGCAAGTTTT TCATCGCCGT CTTGACCACG CGGGTGTACT GCCGCTCGAT CTGTTCCGCC
CGCACCTGTC AGGAGAAAAA TGTGCGGTAC TACCCAACCG CCGCCGCGGC GGAAGAGGCA
GGATTTCGTC CGTGCCTGCG CTGCCGGCCG GAGTGCTCGC CCGGAACGCC GGCGTGGCTG
GGAACGCCGA GCACGGTGAC GCGCGGGCTT CGCTTGCTCA ACGAGCGCGC TCTCGATGAC
GGTGGAATCG AATATCTGGC GGAGCGGTTA GGAATTGGCG GACGCCACTT GCGCCGATTG
TTCCTCCAGC ATCTGGGAGC ATCGCCACGC GCGGTGGCTC AAACTCGGCG ATTACATTTC
GCGAAGAAAC TGATTGATGA AACCAGCCTG CCGATGAGCG AGGTCTCGGA GGCATCAGGC
TTCGGCTGCG TACGACGGTT TAATGCAAAC ATCCGCGCGA CCTTCCATCG CACGCCGTCG
CAACTGCGCG CGCTGGCGAG GAATGCGTTT GAAACGCACG AGCACGAATA TGTGTTTCGG
CTGCGGTACC GTCCGCCGTA TCACTGGCTC GGGATGTTGG ATTTTCTGCG TCCGCGCGCG
ACACCCGGCG TGGAGTGCGT AACAGAAGAC GCTTACGCCC GATCGATCTC GTTACACGGT
AAAGAGGGAA GCTTCGAAGT GACCCACGCA CCGGAGCAGC ACTCGCTGGT GTTGCGCGTG
AACTTCGAGG ACTCGTCGGC GTTGTTTCAG ATTGTGGAGC GCGTCCGCGC GATGTTTGAC
CTGAACGCAG ACTGGGGTTC GATTGCCGGG GTGCTCGAGA ACGATCGGCT ACTTCGCGGA
CATCTGAAGG GCGATCCGGG GCGACGTTTG CCTGGGGCTT GGGATGGCTT TGAACTCGCC
GTCCGGGCCG TTCTCGGCCA GCAGATCAGC GTGGCAGCGG CGACCAATCT CGCCGGACAA
ATCGCTCGCA AGTTCGGACG GCCCTTGCGG AAGTCGAATG GGATCTCGCA TCTGTTTCCG
ACTCCCGAAA TTCTGGCAGA CGCTGCTTCC TTGCCGCTGC CGATGAAACG GGCGGAAACC
ATCCGCGCGT TGGCGTGCGC GGTCCGCGAC TGCGAGCTTC AGTTCGATGC AATTACCGAC
GTGCCGCAGT TCTGCGAACA ATTGAAGACC ATCCCGGGAA TCGGCGACTG GACGGCGCAG
TACGTTGCCC TGCGGGCGCT ACGCGAGCCC GACGCGTTTC CCGCAGGCGA CCTCGGCCTG
CAAAAAAGCC TGGGCGTGAA ATCGTCGGCA GAGTTAGAGC GAAGGGCAGA GAACTGGCGG
CCCTGGCGCG GGTACGCTGC CATCTATATG TGGAGCGCTG GACACTCGCT GTAA
 
Protein sequence
MLLDRQTCSR ARLSRDPRFD GKFFIAVLTT RVYCRSICSA RTCQEKNVRY YPTAAAAEEA 
GFRPCLRCRP ECSPGTPAWL GTPSTVTRGL RLLNERALDD GGIEYLAERL GIGGRHLRRL
FLQHLGASPR AVAQTRRLHF AKKLIDETSL PMSEVSEASG FGCVRRFNAN IRATFHRTPS
QLRALARNAF ETHEHEYVFR LRYRPPYHWL GMLDFLRPRA TPGVECVTED AYARSISLHG
KEGSFEVTHA PEQHSLVLRV NFEDSSALFQ IVERVRAMFD LNADWGSIAG VLENDRLLRG
HLKGDPGRRL PGAWDGFELA VRAVLGQQIS VAAATNLAGQ IARKFGRPLR KSNGISHLFP
TPEILADAAS LPLPMKRAET IRALACAVRD CELQFDAITD VPQFCEQLKT IPGIGDWTAQ
YVALRALREP DAFPAGDLGL QKSLGVKSSA ELERRAENWR PWRGYAAIYM WSAGHSL