Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4412 |
Symbol | |
ID | 8335766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5007932 |
End bp | 5009671 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957515 |
Product | PHP domain protein |
Protein accession | YP_003115117 |
Protein GI | 256393553 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.103593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGGC TCAACGACGA GGTCGGGGCA CTCCTGCGGG AGTACGCCGA CCTCATCCTG CTCAGTGGCG GCAACGCTTT CCGGGCACGT TCCTACGAGA AGGCCGCCCG CTCGGTCGGC GGCTATCCGG GGGACCTCGC GCACCTCGAC GAGGCCGGGC TGCGGGCGAT CCCCGGCGTC GGGGACTCGA CCGCCGAGAA GATCAGCGAG TACCTGGCCA GCGGGCGCAT GGAGGCGTTG GAGAGCCTGC GTGCCAAGAT CCCGCCGGGC GTCCGCGAGG TCACGCAGAT CCCCGGCGTC GGTCCGAAGA CCGCCGTGCT GCTCTACCGC AAGCTCGGCA TCAAGTCCGT CGACGAGCTC CGCAAGGCTG CCGATGCGGG CAAGCTGAAA GGGCTGCCAG GACTCGGCGA GAAGACGGTC GAGAACATCC GGCACGGCAT CGAGCAGCTG AGCCAGGCCT CCGGCCGCAC GCTGCTGAGC ATCGCGCTGG ACCTGGCCGA GGACCTGGTC GCGGAGCTCG GCGCCGTGCC GGGGTGCAAG AAGTGCGACT ACGCCGGATC GCTGCGGCGG ATGCGCGAGA CCGTCGGCGA CATCGACATC CTCGCCACCG CCAAGGACTC CGCGCCGCTG ATGGCCGCGC TGCTGGCCCG GCCGGAGGTC GCGGACGTCA TCGGCAGCGG GACGACCAAG ACCTCGATCC GGACTGACAA AGGCTTGCAG GTGGACCTGC GCGTGGTGCA GCCGGCGGAG TGGGGCGCCG CGCTGGTCTA CTTCACCGGC TCAAGGGCAC ACAACATCAA GCTGCGCGGA CGTGCGGTCA AGGAAGGGCT CAAGCTCTCC GAGTACGGGC TCTTCGACGT CGAGAGTGGG AAGAAGCTGG CCTCCCGCAC CGAGAAGGAG GTCTACGCGG CCCTGGGTCT GCCGTGGATC CCGCCGACGC TGCGCGAGGA CCGCGGCGAG ATCGAAGCCG CCGCCGACGG CGCCCTCCCG GACCTGGTCC AGCTCAAGGA CCTGCGCGGC GACCTGCACA CGCACACCGA CCTCACCGAC GGCCTCGCCC CGCTGGAGGT GATGCTCCAG ACCGCCGCCG ACCTCGGCTA CGCGTACTAC GCCGTCACCG ACCACGCGCC GAACCTGGCG ATGCAGCGCA TGACCGACGA CAAGATGCTG GCCCAGCGCG CCCGGGCTCG AGACCTGGAC CGCGAGTACA GCGGAATGCG CGTCCTGCAC GGCACCGAAC TCAACATCGA CGCCGCCGGC GACGTGGACT GGCCCGCGGA CTTCCTCGCC GGCTTCGACC TCTGCGTCGC CTCCATCCAC TCCTCCTTCG GCCTGGACCG CGCCGCCCAG ACCAAACGCC TGATCCGCGC CTGCGAGAAC CCCCACGTGA ACATCATCGG CCACCCCACC ACCCGCCTCC TGGACCGCCG CCCGCCCATC GACGCCGACC TCGACGCCGT CTTCGCCGCA GCCGCCCACA CCGGCACCGC CCTGGAACTC AACGCCTCCC CCCAACGCCT GGACCTCACC GACGACCAAG CCATGGCCGC CCAACGCCAC GGCGTGAAGT TCGCCATCAA CAGCGACGCC CACTCCACCC CCGCCCTGTC CAACCGGCGC TTCGGCATCG CCACCGCCCA ACGCGCCTGG CTCACCAAGG ACGACGTGAT CAACACCTGG AGCCTGACGC GCCTGCGCGC CTTCCTGCGG AAGCCGGCGG TGCGCGGCGG GGGTGGGTGA
|
Protein sequence | MARLNDEVGA LLREYADLIL LSGGNAFRAR SYEKAARSVG GYPGDLAHLD EAGLRAIPGV GDSTAEKISE YLASGRMEAL ESLRAKIPPG VREVTQIPGV GPKTAVLLYR KLGIKSVDEL RKAADAGKLK GLPGLGEKTV ENIRHGIEQL SQASGRTLLS IALDLAEDLV AELGAVPGCK KCDYAGSLRR MRETVGDIDI LATAKDSAPL MAALLARPEV ADVIGSGTTK TSIRTDKGLQ VDLRVVQPAE WGAALVYFTG SRAHNIKLRG RAVKEGLKLS EYGLFDVESG KKLASRTEKE VYAALGLPWI PPTLREDRGE IEAAADGALP DLVQLKDLRG DLHTHTDLTD GLAPLEVMLQ TAADLGYAYY AVTDHAPNLA MQRMTDDKML AQRARARDLD REYSGMRVLH GTELNIDAAG DVDWPADFLA GFDLCVASIH SSFGLDRAAQ TKRLIRACEN PHVNIIGHPT TRLLDRRPPI DADLDAVFAA AAHTGTALEL NASPQRLDLT DDQAMAAQRH GVKFAINSDA HSTPALSNRR FGIATAQRAW LTKDDVINTW SLTRLRAFLR KPAVRGGGG
|
| |