Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0985 |
Symbol | |
ID | 4068652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1250915 |
End bp | 1251814 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637982992 |
Product | DNA-(apurinic or apyrimidinic site) lyase / formamidopyrimidine-DNA glycosylase |
Protein accession | YP_590062 |
Protein GI | 94968014 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000818098 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCGGAAC TCCCCGACAT CACCGCGTAC CTCACCGCGC TAGAACCGCG CGTGCTTGGC AAAACGCTCC AGCGCGTCCG CATCACCAGC CCCTTCCTCC TCCGCACCAT CGATCCGCCG CTGGAAGCCG TGGAAAGGAA GCGCGTGCTC GCGTTGCGGC GCATCGGCAA ACGCATCGTC TTTGGATTGG AAGACGATCT CTGGCTGGTC TTGCACCTGA TGATCGCCGG CCGCCTTCAC TGGAAAGCCG CGGGCGTCGC GCTGAAAGGC CGAAATTACC TTGCCGCGCT GGATTTCGAC GACGGATCGC TGGTACTCAC CGAGGCCGGC GCGAAACGAC GCGCGTCCTT GCATCTTTTC CGCGGCGAAG CCGCGCTGCG CACCGTGGAT CCCGGTGGAA TTGACGTCTT CACCGCCGAT CTCGACGCAT TTCGCGCGGC GCTCACCATA GAAAACCGCA CCTTGAAGCG CGCCCTCACC GATCCGCGCT TCCTCAGTGG CATCGGCAAC GCCTATTCCG ACGAAATCCT CTGGGCCGCG CAGCTCTCAC CCATCGCGCA AACCCACAAG CTGAAATCCG ACGAATGGCA GCGCCTCTAC GACGCCACCC GCGCGACGCT GCAAACCTGG ATCGACCGCT TCGCCGCCGA GGCCGCGAAG AAATTCCCGG AGAAAGTCAC CGCGTTCCGT CCGGAGATGG CCGTCCACGG CAAGTACGGC GAGCCCTGCC CGCGCTGCGG CGAGAAAGTC CTACGCATCC GCTACGCCGA CAACGAAACC AACTACTGCG CGCGTTGCCA GACCGGCGGC CGCGTCCTCG CCGACCGCAG CCTATCGCGC CTTCTCGGCT CCGATTGGCC GCGCACTCTC GACGAACTCG AGGCCCTCAA GCACCGCTAG
|
Protein sequence | MPELPDITAY LTALEPRVLG KTLQRVRITS PFLLRTIDPP LEAVERKRVL ALRRIGKRIV FGLEDDLWLV LHLMIAGRLH WKAAGVALKG RNYLAALDFD DGSLVLTEAG AKRRASLHLF RGEAALRTVD PGGIDVFTAD LDAFRAALTI ENRTLKRALT DPRFLSGIGN AYSDEILWAA QLSPIAQTHK LKSDEWQRLY DATRATLQTW IDRFAAEAAK KFPEKVTAFR PEMAVHGKYG EPCPRCGEKV LRIRYADNET NYCARCQTGG RVLADRSLSR LLGSDWPRTL DELEALKHR
|
| |