Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4501 |
Symbol | |
ID | 8335855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5127749 |
End bp | 5128615 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957603 |
Product | DNA-(apurinic or apyrimidinic site) lyase |
Protein accession | YP_003115205 |
Protein GI | 256393641 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.352275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAT TGCCGGAGGT CCAGGCGCTG GCCGCGTTCC TGGACGAGCA CCTCGCCGGA CACGCGGTCG CGGCAGCCAC GCCGGTCGCG ATCCAGGCGC TCAAGACCTA CGACCCGCCG CTGTCGGCGC TGGAGGGCCA GGTGGTCTCC GGGGTGACGC GGCACGGGAA GTTCCTCGAC TTCAGCGTCG GGGACGTCCA CCTGGTGCTG CACCTGGCGC GCGCCGGCTG GGTGCGCTGG CAAGAGGAGC TGCCGACCGC GCCGCCGCGT CCGGGCAAGG GACCGCTGGC ACTGCGGGTG CGTATGGAGG AGCCGGCGGG CAGCGGGATC GACGTGACCG AGTACGGCAC GAAGAAGGGC CTGGCGGTCT ACGTCGTGCG CGATCCGGCT GAGGTGCCGG GCATCGCGCG CCTGGGCATC GATCCGCTGT CGGCGGAGTT CACCGCCGAG GTGCTGGCCG GGCTGCTGGA CGGCGAGCGC CGCCAGATCA AGGGGTTCCT GCGCGACCAG AGCGTGCTGG CGGGCATCGG CAACGCCTAC TCCGACGAGA TCCTGCACGC CGCGCGCATG TCCCCCTACA AGCTGGCCGC CAAGCTGACG CCGGATGAGG TCGCCGACCT GTACCAGGTG ATCATCGGCA CGCTCACCGA CGCCGTCGAG CGCTCGCGCG GGCTGCCGAT GAAGGACCTG AAGTCGGAGA AGAAGTCGGG CTTGCGGGTG CACGGCCGGA CCGGCGAGAA GTGCCCGGTG TGCGGCGACA CGATTCGCGA GGTGTCCTTC GCCGACTCGG CGCTGCAGTA CTGCCCGACG TGCCAGACCG GCGGCAAGCC GCTGGCCGAC CGGCGGATGT CGCGGTTGTT GAAGTAG
|
Protein sequence | MPELPEVQAL AAFLDEHLAG HAVAAATPVA IQALKTYDPP LSALEGQVVS GVTRHGKFLD FSVGDVHLVL HLARAGWVRW QEELPTAPPR PGKGPLALRV RMEEPAGSGI DVTEYGTKKG LAVYVVRDPA EVPGIARLGI DPLSAEFTAE VLAGLLDGER RQIKGFLRDQ SVLAGIGNAY SDEILHAARM SPYKLAAKLT PDEVADLYQV IIGTLTDAVE RSRGLPMKDL KSEKKSGLRV HGRTGEKCPV CGDTIREVSF ADSALQYCPT CQTGGKPLAD RRMSRLLK
|
| |