Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3688 |
Symbol | |
ID | 8335041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4130002 |
End bp | 4131048 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956828 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003114431 |
Protein GI | 256392867 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.240093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATTT CCTTCGACCC CGATGATGTC GGGTCGGCGC CGTCTTCGCG GCGTGACGGG GGAGATGGTG CCTTTGACGG TCAATCGACG TTCGTCCTTC TCGGCCCGTT GAGTATCGCG GCCCAGGGGA CCATCGCCCC GCTCCAGCCC TCCCGGCCGG CGACGCTGCT CGCCACGCTG CTGCTGCACC CCAACTCGGT GGTGTCGATC GGCGCGCTGG TGCGGGCGGT GTGGGACGAG GAACCGCCGG TCAGCGCCAA GGCGGCACTG CACACCTGTG TCCAGAGGCT GCGCAGGCTG TTCGCCAAGT ACGGAGTGCC CGGCGGCGAG ATCGAGGCGG TGTCGGGCGG GTACCGCATC GCGGCTCAGG CCGAGACGCT GGATCTGATG CGGTTCCGGG GATTGGCCGC CCGGGCCCAC GCCGCGGCTG ATCCGCAGGC TGAGTTGGCG CTGCTGCGCG AGGCGCTCGC GCTGTGGCGC GGTCCGGCCC TGAGCAACGT CCGCTCGCAG GTGCTGCACC GGGAGGAGGT TCCGGCGCTC GACGAGGAGC GGCTGAGCGT CGTGGAGCGC GTCTTCGATC TCGAGATCGC GCTGGACCGG CGGCGTGAAG TACTTCCGGA GCTGTTCACC GCGACGCGGG CGCATCCCAC GCACGAGCAC TTCTGGGAGC AGCTGATCGA GTCGCTCTAC CGCACGGGGC GCCGGGCTGA GGCGCTGGGG GAGTACCGCC GGATCAAGCG CTATCTGCGC GAGCAGCTCG GCGTCGATCC CGGCGCGGCT CTGCAGCAGC TGGAGCTGAT GGTGTTGCGC GGCAACGGAA GTGTCGTGGA GAGGGCCGCT CGGCCTGAGA CCGGCGCGGT TCCGCTCCGG CTGCTCACCG AGGCGCAGAT CCTCGACCGG CTCCAAAGCG CGGGTCTGGT GCGCAAACAG GCGCGCGGCT ATCAGATGCA CGAGCTCTTA TACGTGTTGA CCAGGGATGC AGCCGTTGTG GACCACGGCG CGCCGGAGCC CGGCGCCCTC CTGAGCGGAA AGGACGACGT GGATTAA
|
Protein sequence | MVISFDPDDV GSAPSSRRDG GDGAFDGQST FVLLGPLSIA AQGTIAPLQP SRPATLLATL LLHPNSVVSI GALVRAVWDE EPPVSAKAAL HTCVQRLRRL FAKYGVPGGE IEAVSGGYRI AAQAETLDLM RFRGLAARAH AAADPQAELA LLREALALWR GPALSNVRSQ VLHREEVPAL DEERLSVVER VFDLEIALDR RREVLPELFT ATRAHPTHEH FWEQLIESLY RTGRRAEALG EYRRIKRYLR EQLGVDPGAA LQQLELMVLR GNGSVVERAA RPETGAVPLR LLTEAQILDR LQSAGLVRKQ ARGYQMHELL YVLTRDAAVV DHGAPEPGAL LSGKDDVD
|
| |