Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3605 |
Symbol | |
ID | 8734057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3834371 |
End bp | 3835240 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646504224 |
Product | apurinic endonuclease Apn1 |
Protein accession | YP_003395397 |
Protein GI | 284045057 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.637243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAATCG GCGCCCACGT GTCACAGGCC GGAGGGCTGC CGAACGCGAT CGAGCGCGGC GTCGAGAAGG GCTGTACGGC GATACAGATC TTCAACCAGT CGCCGCGGAT GTGGCGCCCG ACGCAGTACT CCGAGGACGA CTTCGCCGCG TTCCGCGACG CGATGGCGGG CAGCCCGATC AGAGCGGTGA TGATCCACGC CGTCTACCTG ATCAACTGCG CCAGCGAGGA CCCGGAGATC CGCACGAAGT CGCTCGCCTC GCTGACGCAG TCGCTGCGCG TCGGCGACGC GATCGGCGCG AGCGTCGTGC TCCACCCCGG CTCGGCGCTC AGAGGCCACG TCGGCGAGGC GATCGCGCGC GCCGGCGGCG TCTTCAGAGA GGCGCTGGCG GAGAGCGAGT CGAGCGCGCT GCTGCTGGAG GACACCGCGG GCGCGGGCGG CACGCTGGGG CGCTCGTTCG AGGAGCTGAG AGAGCTGATC GACGCGGCCG GCGGCGGTGA GCGGCTCGGC GTCTGCCTCG ACTCCTGCCA CCTGCTCGCG TCCGGCTACG ACGTCCGCAC GATCGACGGG CTGAGCGAGA CGCTCGACCG CTTCGACGCG GCCGTCGGGC TCGGCCGGCT CGGCGGGCTG CACCTCAACG ACTCCGTCAA CGCGCTCGGC ACCAACCGCG ACCGCCACGC CAACCTCGGC GAGGGCGAGC TGGGCGAGAC GGGCTGCATG GCGTTCCTGT CCGAGCCGCG CTTCGAGAAC CTCCCCGTCG TGCTGGAGAC CCCCGGCCCG GACAAGAGAG GGACATCGGC CGAGGAGATC GTCTACGCGA AGAGACTGCG CAGAAGAGGA CTGCGCTTGC GCAAGAAGGC AGCGGTGTAG
|
Protein sequence | MLIGAHVSQA GGLPNAIERG VEKGCTAIQI FNQSPRMWRP TQYSEDDFAA FRDAMAGSPI RAVMIHAVYL INCASEDPEI RTKSLASLTQ SLRVGDAIGA SVVLHPGSAL RGHVGEAIAR AGGVFREALA ESESSALLLE DTAGAGGTLG RSFEELRELI DAAGGGERLG VCLDSCHLLA SGYDVRTIDG LSETLDRFDA AVGLGRLGGL HLNDSVNALG TNRDRHANLG EGELGETGCM AFLSEPRFEN LPVVLETPGP DKRGTSAEEI VYAKRLRRRG LRLRKKAAV
|
| |