Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0804 |
Symbol | |
ID | 4068683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 996395 |
End bp | 997369 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637982811 |
Product | A/G-specific DNA glycosylase |
Protein accession | YP_589883 |
Protein GI | 94967835 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.591097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCT CCGCGACGGC CGCTGACTCC GACGCCTCGG ACCTCCAGAA ATCCCTGCTG TCCTGGTACC GGCACAGCCG CCGCAATCTG CCATGGCGAC GCACTCGCGA TCCGTACGCG ATTTGGATCT CGGAGATCAT GCTCCAGCAG ACCCGGGTGG CCGCCGTCCT CGACAAGTAC GCCCAATTTC TCGCGCAATT TCCGAACGTA AAGGCATTGG CCGACGCCTC GCTCGACGAG GTTCTGACCG TTTGGAGCGG CCTTGGCTAC TACCGGCGCG CGCGAGCCTT GCACCAGGCA GCGCAGATGG TGGTCCATCA TCTGCACGGC AAATTTCCGG ATACTGCGGC AGGCTGGCGG CAACTGCCCG GGATTGGTCG CTACACCAGC GCGGCGATCG CCAGTATCGC GTTTAACGAG CCGGCGGCAG TGGTGGATGG CAACGTGGAG CGCGTCCTTG AGCGTCTGGA TGGAGAGCGC CACGAGGGCG AGAGGCTTTG GGAGCGCGCG GAACAATTGC TTGCCAAGCG TGCACCCGGC GATTGGAACC AGGCGATGAT GGAACTCGGC GCCACGATCT GTTTGCCGCA GAATCCGCAA TGCCTGGTTT GTCCGGTGAA CGGGCCGTGC AAAACCCGGG GGCCACTCCA GTCTCGACCG CAACCGAAGC GCAAGCGCGC CGAGCTTTGG TACGCTCTGT ATGCGCGGAA GAACAGCGTG CTGCTGGTGC AGCGTCCAGC GGACCACTCG TTGATGGCCG GTATGTGGGA GCTTCCTGCG ATCCGCGCAA ATGGCGTGGA GCCGCTTCAC AAACTGCGCC ACTCGATCAC GGACACTGAT TACGCGGTCT TCGTGGTCCG CGGCCGTACG GCGAAGAAAC ACGGCAAGTG GGTTACGCAC GAAGAGGCGC ACCGGATGGC GATCACCGGA TTGACGCGCA AGATTTTGCG GAAACACTTC GCACGGGAGG CGTGA
|
Protein sequence | MSTSATAADS DASDLQKSLL SWYRHSRRNL PWRRTRDPYA IWISEIMLQQ TRVAAVLDKY AQFLAQFPNV KALADASLDE VLTVWSGLGY YRRARALHQA AQMVVHHLHG KFPDTAAGWR QLPGIGRYTS AAIASIAFNE PAAVVDGNVE RVLERLDGER HEGERLWERA EQLLAKRAPG DWNQAMMELG ATICLPQNPQ CLVCPVNGPC KTRGPLQSRP QPKRKRAELW YALYARKNSV LLVQRPADHS LMAGMWELPA IRANGVEPLH KLRHSITDTD YAVFVVRGRT AKKHGKWVTH EEAHRMAITG LTRKILRKHF AREA
|
| |