Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5039 |
Symbol | |
ID | 8336393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5776269 |
End bp | 5779250 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644958138 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003115740 |
Protein GI | 256394176 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCG GGGTGCTGGG ACCGGTGAGG GCCGGCGGCG GCCAAGAGGT CCCGGCGCTG ACGCCGATGG TTCGGAGTCT GCTGGCCGTG TTGTTGGTGG AGGCCGGACG GCCGGTGTCC GAAGCCCGGC TGACCGAGGC GTTGTGGGGC GGCAGTCCGC CGCAGACGTC GAAAGCCGCC CTCCAGAACC ATGTGCTCGG CTTGCGGCGC GCGCTCGGTG TGGACGAAGC CGCGCGCGTC CGCAGAACCT ACGACGGCTA CCTGATCGAG GTCGAGGCCG GCGAACTGGA CCTGCGGGAG TTCGAGCAGC TCTCCGGCGA GGGATCCGAG GACCTCGTGG CGGGCCGATG GCAGGCGGCG GCTGACGCCC TGACCGGGGC GTTGGCGCTC TGGCGCGGCG ATCCGCTCGC CGACGCGCTC CCCGCGACGC GGGACGCGGT GGACGTCGGC CGGATCCACG AGGCCCGGCT TCAGACCGTC GAGCAGCTGG CCCGGGCCCG ACTCGAACTC GGACACTACG ACCGGGTCAT CGGCGAGATC GAGCCGCTGC TGCGGGAGCA TCCCTGGCGG GAGGCCATGC ACGGGCAGCT GATGCACGCG CTGCACGGCG CCGGACGGCA AGCCGAAGCG CTCACCGTCT ACCAGCGGCT GCGCACCGGC CTGGTCACCG AACTCGGCGT CGAGCCCTCG GCCGGGCTGG CGGACCTGCA CCGGCGGATC CTGGCCGGCG ATCCCGCGCT GATCAGGACG ATCACGCCGG CCGGCGCTAC TCAGGGTCCC AGTCGCGTCA TCGGCGCGCC TGACGCCGGT CAGTCCGGTC ACTCCCGTGA GTCCGGTGAG TCCGACCACA CAGCCCACAC AGCCCAAGGC TCCGGCGCGG ATTCCGCCCA CGGCGCCGAC CACCGCCGGC CGCAGAACCC TGATCCGCGC GCCGCCGACG CCGCCAACCC CGTCACCCCC GCCAACGCCG TCATACCCCG CCAACTGCCG GCGAAGATCA GCCACTTCAC CGGACGCACC GCCGCCCTGG CGGTGCTGGA GGAGTTCCTC GCGGCGGCGG GCGAGGGCGA CCAGCCGCTG ATCGCGCTCG TCGGCACCGC CGGCGTGGGC AAGACGGCGC TGGCGGTGCA CTGGGCCCAC CGGATCGCCT ACCGATACCC GGACGGCTGC CTTTATGTGA ACCTGCGCGG CTTCGACCCC TCACAGGAGC CTGTGACCCC CGAGCAGGCG ATCCGCGGCT TCCTCCAAGC CCTCGGGCTG CCCCGGCAGG AGCTGCCGGC CCTGTTCGCC GACCAGGTCG GCCGCTACCG CAGCCTCGCC GCCGAGCGCC GGCTCCTGAT CGTGCTGGAC AACGCCCGCG ACGCCGAGCA GGTGCGCGAG CTGCTGCCCG GCAATCCGGC GTGCCTGACG CTGGTCACCA GCCGCGACCG GCTCACCGGG CTGGTCGCCG TCGACGGCGC CCGGCCGCTC CGGCTCGACA CGCTGCCCGC CGACGAGGCG TTCGACCTGC TGGCCCGCCG GCTCGGCGGG CGGCACGCGG CCGAGGAGCC GGACGCGATC CGGGAGATCG CAGAGCTCTG CGCGCGGCTC CCCCTCGCCC TGAACATCGC CGCCGCCCGG ATCGCCACGA ACCCGCATCT GCCGATCGAG ATGTTCGTCC AGGAGCTGCG CGAGGCCGGC GCCACACTGA GGACCCTGGA CGCCGGCGAC CGGGCGGCCA GCGTCCGGAC CGTCTTCTCC TGGTCCTACC GGCAGCTCGG CGGGCCCGCG GCCCGGCTGT TCCGGCTGCT GGGCGTACAT CCGGGCCCTG ATCTGGGCCT GTCGGTCTGC TCGGCGCTGA CGGCCCGGCC GCGCGCCGCG ACGCTGGCCA CCCTGGAGGA GCTCACCGGC CTGCACCTGC TCGACCAGCA CGCGCCGGGC CGGTACGTCC AACACGACCT GCTGCGGGTC TTCGCCGGCG AACTCGGGCA GGCGGTGGAC GGCCGGGACG CCAGCCGCGA CGCCGAGCTG CTCACCCTCG ACCACTACCT GCACAGCGCC TTCGCCGCCG AACGCCTGCT CCAGCCGGCC CGGCCGCCGA TCGCGCTCGC GCCGCCGCAC GCCGGCTCCG CGCCCCTGGA CTTCGCCGAC CTGGCCGAGG CGCTGCGCTG GTACGACGCG GAGTACCCGG TGCTGCTGGC CGCCGCGCGG CGGGCCGGCG CCGTCCCGGA CCCGCACGCC TGGCAGCTGC CCTGGTCGAT GGTCACCTAC CTCGACCGGG CCGGGTTGTG GCACGACCTC ACCGAGACGC TGACCGGGTC GCTGGCGGCA CTGCGGCGGA TCGGCGACAT CCCGAACCTC GTCGCCGCCC ACATGTGCCT GGCACAGGTC CTGGGCCACA GGCTCAACGA GGCTGAGGCC GCGGAGACCC ACTTCCAGGC GGCCCTGGAC CTCGACCGCG AGACCGACGA CGCCACCACC GAGGTGAGGG TCATGGCGAA TCTCATGACC CTGCGAGGAA GGCAAGGACG CTGGGCGGAG TCGGTGGTCT TCGGACTGCG GGCTCTGAAG CTCCTGCGCG AGAAGGGAGA GACCACTGTC CTCCTGCCGA CCGTCCTCAA CAAGGTGGGC TGGAGCCACG TCCACCTCGG CCGGTACGAG GAGGCGCTCG CCTGCTCCAC CGAAGCGCTC GAGTTGTTCC GGGAGACCGG ATTCCGCATC GGCCAGGCCG ACGCCCTGGA CACCCTCGGC CTGGCCCGCC ACCGGCTCGG CGACACCGCC GGCGCTGTGG CCTGCTACGA AGCGGCCGAG GCGGTCTTCA TCGAGGTCGG CGAACGGTTC CTGCTCGCCG AGACGCTGAT GCGCCTCGGC GACGTCCACC TCACCGACCA CGCCGAGGCC GCCGCCCGCG AGGTCTGGAC CCGGTCGCTG GCGATCCTCA GCGATATCGG CCATCCGACG GCCGAGCAGG TCGAGGAGCG GCTCCGGTCC CTGGACCGCT GA
|
Protein sequence | MKFGVLGPVR AGGGQEVPAL TPMVRSLLAV LLVEAGRPVS EARLTEALWG GSPPQTSKAA LQNHVLGLRR ALGVDEAARV RRTYDGYLIE VEAGELDLRE FEQLSGEGSE DLVAGRWQAA ADALTGALAL WRGDPLADAL PATRDAVDVG RIHEARLQTV EQLARARLEL GHYDRVIGEI EPLLREHPWR EAMHGQLMHA LHGAGRQAEA LTVYQRLRTG LVTELGVEPS AGLADLHRRI LAGDPALIRT ITPAGATQGP SRVIGAPDAG QSGHSRESGE SDHTAHTAQG SGADSAHGAD HRRPQNPDPR AADAANPVTP ANAVIPRQLP AKISHFTGRT AALAVLEEFL AAAGEGDQPL IALVGTAGVG KTALAVHWAH RIAYRYPDGC LYVNLRGFDP SQEPVTPEQA IRGFLQALGL PRQELPALFA DQVGRYRSLA AERRLLIVLD NARDAEQVRE LLPGNPACLT LVTSRDRLTG LVAVDGARPL RLDTLPADEA FDLLARRLGG RHAAEEPDAI REIAELCARL PLALNIAAAR IATNPHLPIE MFVQELREAG ATLRTLDAGD RAASVRTVFS WSYRQLGGPA ARLFRLLGVH PGPDLGLSVC SALTARPRAA TLATLEELTG LHLLDQHAPG RYVQHDLLRV FAGELGQAVD GRDASRDAEL LTLDHYLHSA FAAERLLQPA RPPIALAPPH AGSAPLDFAD LAEALRWYDA EYPVLLAAAR RAGAVPDPHA WQLPWSMVTY LDRAGLWHDL TETLTGSLAA LRRIGDIPNL VAAHMCLAQV LGHRLNEAEA AETHFQAALD LDRETDDATT EVRVMANLMT LRGRQGRWAE SVVFGLRALK LLREKGETTV LLPTVLNKVG WSHVHLGRYE EALACSTEAL ELFRETGFRI GQADALDTLG LARHRLGDTA GAVACYEAAE AVFIEVGERF LLAETLMRLG DVHLTDHAEA AAREVWTRSL AILSDIGHPT AEQVEERLRS LDR
|
| |