Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3811 |
Symbol | |
ID | 8335164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4312217 |
End bp | 4313407 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644956950 |
Product | regulatory protein DeoR |
Protein accession | YP_003114553 |
Protein GI | 256392989 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.333022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.698925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCGG CCGAGCGCCG CGAAAGCATC CTCGGCGAGC TGCGCACGCG CGGTCCGCTG CTTCGCGTCA CCGACATCGC CCGGACCCTG GGCGTCAGCG CGGTGACCGT GCGCCGGGAC GTCGCCCAAC TCGCGGACGA CGGCGTGATC GAACGGGTCC ACGGCGGCAT CCGGCTGCCC CGGGCCGCCG CCGCGCGGCC GAACAGCGCT CCCGGATCCG GAGCCGCCGC CACCGGAGCG GCCTCCTGGC CGGCCGGCAC CGCATTGCCG GCGACCGGCG TGGTCCTCGA AGACCCGGGT CCGGCGTCCG GCCCCGAAGA AGCGGACCTC CCGCCGGTCG GCATGGTCGT GCCGTCGCTG GACTACTACT GGCCGCAGAT CATCCGCGGC GCACGAGACC TGGCCCGGAC CAACGGCCTG CGCATCGTCC TGCGCGGCTC GTCCTACGTC GACGTCGACG ACGTCCGCCG CCAGACCGAA TGGCTCCTGG CGACGGTGGG CATCCAAGGT CTGCTCATCG CCCCGCCCAC GGACGGCGAG GCGGCAGCCG ACCTCATCGC CTGGCTCTGC GCGCTCCCGA TCCCGGTGGT CTTCATCGAA CGGACCGCGA CGATCGGCCC GTTTCACGAA CACGTGGAAT CCGTGACGAC CGACCACGCC TACGGAGCAG GCCTCGCCGT CCGGCACCTC GCCGTCGAGG GCCACCGCCG CGTCGGATTC CTGGCCTCTG CGACAAGCCC GCACACACGA GTGGTCCGGC AAGGATGGTT CGAAACCGCC ACAGACATCG GGCTCGACGT CGAAGCCGCC CCGGAGGCGA TCACCCCCGA CCACCGGCAG CCGGACTGGA CCGACCACGT GGACGCATTC CTCGACGCGG CCCTGGCCAG CGACACCAAA GCGGTGCTGG TGCACTCCGA CCGCGAAGCG ATATCGCTGG TCGCCCGCTG CCAGGAACGT GGCATCGACG TCCCGGGCGA CCTGGCCGTC GTCGCCTATG ACGACGAGGT CGCCGGCCTC GCCGACCCGG CACTGACCGC GATCCGGCCG GCGAAGCCCG AACTCGGCCG CACCGCGCTC CGCCTGCTGG CCGAACGAAT GCGTGACGGA CCCGCCCGCC CGGTCCACCG GGTGCAGATC AGCCCGCGGC TGGTGATCCG CGACTCCAGC ATCGGGCGGA CGGCGCGCTA G
|
Protein sequence | MLPAERRESI LGELRTRGPL LRVTDIARTL GVSAVTVRRD VAQLADDGVI ERVHGGIRLP RAAAARPNSA PGSGAAATGA ASWPAGTALP ATGVVLEDPG PASGPEEADL PPVGMVVPSL DYYWPQIIRG ARDLARTNGL RIVLRGSSYV DVDDVRRQTE WLLATVGIQG LLIAPPTDGE AAADLIAWLC ALPIPVVFIE RTATIGPFHE HVESVTTDHA YGAGLAVRHL AVEGHRRVGF LASATSPHTR VVRQGWFETA TDIGLDVEAA PEAITPDHRQ PDWTDHVDAF LDAALASDTK AVLVHSDREA ISLVARCQER GIDVPGDLAV VAYDDEVAGL ADPALTAIRP AKPELGRTAL RLLAERMRDG PARPVHRVQI SPRLVIRDSS IGRTAR
|
| |