Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4277 |
Symbol | |
ID | 8335631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4851507 |
End bp | 4852454 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644957380 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003114982 |
Protein GI | 256393418 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.421015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTC CGCCGGAAAA CCTTGCAGAT CGTCCAGCGG CGACGGATTC GGACCTGCCG CACGACCTGC TCTCGGAGCT GCTGCGCAGC GTGCGGCTGA CCGGCGAGCG GATCGTCGCG TACACCCCGT CGCCGTTGTT CTCCGTCGAG TTCGCGGACC GAGGCAGCCT GCACATCGTC GAAGAGGGCG AAGTCACGCT GCGGATCGAG GGCTCCCCGC ACGTCGAGCA CCTGTGCGCC GGTGACTTCG TCCTGCTCCC GCGCGGCGAC GCGCACTCCA TCAGCGATGC CGGCCTCGGC GGCAGCCCAG CCGGTGGCGA CGCCGACCGC CGCCCCGCGC GCTGGCTGTG CGGCACGTTC ACCATCGGCG ACCCGCAAGC CAGCCACCTG CTCGGCAGCC TCCCGGCGGT GATCATCCTG CGCGGCTCCG GCGGCCCGGA CCTGGAAGCC CTCGAAGGGC TCGAAGTCGC CCGCCGGATG ATCGTGCTGG AGATGCAGTC GCCGTCGCAG GGCTCTGCGG TGATGGTCGC GCGCATCCTT GACCTGATCT TCATCCAGAT CATGCGCACC TGGGCCGCCG GCGCGGACGT CGAGCCCAAC TGGCTGGCCG GCGCCTTCGA CCCCCAGATC GGCCTGGCGC TGAGCGCCAT CCACCGCGAC CCCGGCCGCG AATGGACGGT CGAGGAGCTG GCGCGCGCCT GCAACCTGTC CCGCTCGTCC TTCGCGGCCC GCTTCGTCGA GCGCGTCGGC AAGCCGCCGG CCACCTACCT CGCGCACGTG CGCCTGGACG CCGCCACCAC CCTGCTCCGC GACACCTTCC TGCCGGTCAC GCAGGTCGCC GAGACCGTCG GCTACGCCTC AGAAGCCGCC TTCAGCCGCG CGTTCAAGAA CCGCTACGGC ACGCCGCCGG CGCGCTGGCG GCGGGACATC CGGTACCCGC TGAGCTGA
|
Protein sequence | MTAPPENLAD RPAATDSDLP HDLLSELLRS VRLTGERIVA YTPSPLFSVE FADRGSLHIV EEGEVTLRIE GSPHVEHLCA GDFVLLPRGD AHSISDAGLG GSPAGGDADR RPARWLCGTF TIGDPQASHL LGSLPAVIIL RGSGGPDLEA LEGLEVARRM IVLEMQSPSQ GSAVMVARIL DLIFIQIMRT WAAGADVEPN WLAGAFDPQI GLALSAIHRD PGREWTVEEL ARACNLSRSS FAARFVERVG KPPATYLAHV RLDAATTLLR DTFLPVTQVA ETVGYASEAA FSRAFKNRYG TPPARWRRDI RYPLS
|
| |