Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4051 |
Symbol | |
ID | 8335404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4581205 |
End bp | 4585014 |
Gene Length | 3810 bp |
Protein Length | 1269 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957154 |
Product | XRE family transcriptional regulator |
Protein accession | YP_003114757 |
Protein GI | 256393193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.544499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.21664 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGGC CAGGACTTGC CACTCGTGAC GATCTGCTGC GCTGGCCGAG TATCGTCGCG GCCGGGGAGT TCCCGCGCCT CATTCGGCGG CTGATCTTGG AGACTGTCCC GGATGCAGTC CGGCTAGGTT TCCCTGCAGG ATCGGGAACT TCGGCGGGCA GCTGGGACGG TTCGGTTCGT GCCGTAGCCG GCAATACGTT CGTCCCCGCC GGCCTGTCTG TGTGGGAGTT ATCGGTCGGG CAGAGCGGCA TCGCCGCCAA GGCGGACGGC GACTACGAGA AGCGGACCAG CACGCCCGAC GGCTCGCCCG TCTCCGATGC CGTATATATC GAGGCAATGC TCCGGCCGTG GCCCGACCGC CGGACGTGGG CGGCAGGGAA GCGCGGCGAT CGGCGGTGGA AAGATGTAAT CGGCTACGGC GTCGATGACA TCGAGGAGTG GCTCGAATCA GCCCCCGTTA CGCATTCGTG GGTATCGGAA ATGCTCAACC TCGCGCCGCA CGGATACCGG GCCGTCGAGA CGTGGTGGCG AGGCTGGGCA GGTGCGACCA CCCCAGTACT GCCGACAGGC GCAGTGCTCG CGGGCCGAGA CCGAGCCGTC CAGGCGCTGG AGGACCGACT GGAGGGCACT CCCTCCATCA CCACAATCAG GGGCGTTAGC CTGGATGAGG TCCTCGCGTT CATCGCCGCG GTGTTGGAGA GGCAAGCGAG CGCAGGCGAC TCCCGGTGGC TGTCGCGCGC GGCTTTCGTC GACCAGGTGA CGAGTTGGCG CGCGTTGGCG GAGCGACCTG GACCGCTGAT CTTGGTGCCG ACGACCGCCG ACGTCGCCGC CGAGGCGGCC CGTGGGGCTG CCCACCACGT CCTAATCCCG GTGACCGCAG CCGCCGACAT TGACCTGCCG CCCATCGACT CGCGCACCGC GGCCGTTGCC TTGCGGGCTG AAGGGCTCGG CGAGGGGTCC GCTGAGCAGG CGGGCCGCTT GGCTCGGACA AGTCTGCTCG CGATGCGCCG CAGCCTGGCG AGCAGGCCCG AACTTCACAC GCCCTCATGG GCATCGGGCC CGTCGCGCAC GCTGCGTGGT CTGCTGCTGG CCGGGCGCTG GAGCCAGACT CACGACACAG ACCGAGCCGC TATTTCTGAT CTGACCGGCG AGGATTACGA CACGCTCCGC GAAACTGTCG CAGGGCTGAA CCGCGCGAGC GACCCGTTCG TCACCCAGAT CGGCTCGATC TGGATGCTGA CCAACATGCA CGATGCGTGG ATCCATTTGT GCGAGACCGT CCGCCAAGAC GACCTGGACC GGCTCGAGCC CGTGCTGCGA AGGGCACTCT TGGAACGGGA CCCGGCGCTC GACTTGAGCC CGGATGACCG ATGGTCCGCC GGCGCCGTCG GCAAGTCGCC GACGCACTCC GCCGACCTGC GCCGCGGTCT TGCAACGACA CTGGCAGCTC TCGGAGTCCA CGGACAGGTG ATCGACACCG GTCACGGGAC CAATGGCGCT CAGTGGGCGG CCCGCTTCGT CGGCGACCTG CTGCGGGAAG CCAACGCAGA TACGACCGGA GACACATGGA ACTCGCTGGC CGGGCTACTG CCGCTGCTGG CTGAGGCGGC GCCCGACGCC TTCTTGGATG GGCTGCGAAA TGCGAGCCAG GGTGCCGTCC CGGTGATCTC CACGATGTTT ACCGACAGCA CGACGACTTC GGTTACCAGG GAGCTCTCGC GCCACCACCA CCTGCTCTGG GCGTTGGAAA CGCTTGCCTG GTCTCCAGAC CACTTCGGCC GCGTGGTCCT GCAACTTGCG CGGCTAGCGG AGATCGATCC TGGCGGGTCT CTGTCCAGTC GCCCGTTCAA CTCGCTGGTC ACGGTCTTGT GTCTGGAGTA CCCCGAGACC ACGGTGCCCG TTGCGGGACG GATGGCGGTC ATCGAGAGGC TACGAGACCG CCATCCCGAC GTCGCATGGC GGCTGATGCT CACGCTGCTC CCGTCCCAGT TCGACTTGCA CGGCCCGACA CCGAACCCTG AGTTCCGCGA CTGGAAGCCA CAGGAACCCG TCGCGGTGAC CGCAGATGAA TGGCTGGACT GTGTTCGGAC GCTCGTCAAC TGGCTCATCC TCGACGCCGG CGACAACGTA CGACGATGGC AGCAGGTCCT CGACGTCTTC CCGTTCCTGC CGAAAAGCGA TCGCCAGCGC CTGCGGGAGG CGATGGCGAC GCGGGTCGGT GATGGAACAC TTAGCGATGA CGGGCGGGCT GATTTGTGGG AATCCCTACG GGAGCTCATC ACTCACCACC GCTCGCGCGC CGGCAAGCCG GGATCGTTGC CAGTCGATGA GGTGGACGCA CTCCAGGACA TCGAACGGGC CCTCGCCCCC TCCGATCCGG TCCAACGGCA CCGGTGGCTG TTCGCGACGC AGATGCCGGA ACTAGCTGAA CACCGCCGGT TCGGCGACCC TGCATACGAC TCCGCCGTGC AGGACAAGCG GATCGCGGCG ATCACTGAGA TTGAAGAGCG CGGAATCGAC GCAGTGCGGG AGGTCGCGGC GACCGCCGCC GACGCGAGGA CAGTCGGCGT GTGCCTGGCC GAGGTGGCAG GCGACAAGTA CCGTTCCGAA CTCGTCGCCA TGATCCCGGC GGACCCGGCC GGCACGGCGT TGATCGAGGG CTGGCTCTCA CGGCAGTTCC AAAAGGATGG TTGGACCTGG TTGGACGGGC TCCTCGCTGA GGAACTCACG CCGGAGCAGG CGGCAGTGGC GTTGTTGGCG TCGCGGGACT ACCCGAAGGC GTGGGAGGTC GCCGAAGCCC ACGGGACGCC GGTCGCGGAG GCGTTCTGGA AGTATTTCTC GATCAACGGC CTGGGGCGGG ACTTTGGCCA CGTCGGCGAG GCAGCGAGCC AGTTGGCGCA GGCCGGGCGC GTAGCCGCGG CACTGAAGCT CGTCGTCATC TACCTGGACG ACCTTGGCGA CACCTCTGCA GATCTGCTCA TTCGCCTTCT CGGCCAGTTC GCCGACACCT ACCAATCGGA CCCTGAGACA GGGCTGGTCG GCGAGTACGA CTTCCGGGCG CTCTTCGAAT ACCTGCACCT GCACACCGAT CCACAGCGCA GCACCGAAAT CGGCCAGCTG GAATGGGTCT TTCTCTCCGG TCTGGGATTC CAGCCGCCCG CCGGCCGCCT TCGCGAAGCG CTGGCCGCCG AGCCGGAGCT CTTCGTGCAG ATCATGTCGT CCACCTGGCG AGCAACGGAT GCGCAGGAGA ACGACGAAGA CAGCCAGGGC GAGGGCGCCG AGCCGGAAGA CGAAACGCTC ACCAAGGAGC AGGTACAGCA GGCGACGAAC GGATACGCAC TCCTGACGTC CATTGATCGA CTCCCCGGGA TCGGCCCGGA CGGGCGAGTT GATCCTGCGG CCCTCCAGCA GTGGGTAGCC CGGGTCCTCG AACTCGCTAC CGCGTCTGGC CGACGAAGAA TCGCCGAGAT GCTGGTCGGG CAGATGCTGG CGAGCGCGCC AGCCGACGAC GACGGCACCT GGCCATGTCA GCCGGTACGA GACCTGCTCG AAGAGCTACA GAGCGAGCGA GTCGAGCGGA GTCTCGCCGC ACAGCTCTAT AACGACCGCG GAATGACCTC GCGCGATCCG GAGGACGGCG GCAGGCAGGA ACGAGCGCTG GCTGAGAGGT ATCTGGCGCA GGCCACGACG TTCTCTGACA GTTGGCCCCA GACCGCGGTA GTTCTAAGAA GGGTCGCCTC CATGTACGAA ACTGATGCGC ACGAGCACGA TGACAGGGCC GAACGATTCC GCCAAGGGCA GCAAACATGA
|
Protein sequence | MARPGLATRD DLLRWPSIVA AGEFPRLIRR LILETVPDAV RLGFPAGSGT SAGSWDGSVR AVAGNTFVPA GLSVWELSVG QSGIAAKADG DYEKRTSTPD GSPVSDAVYI EAMLRPWPDR RTWAAGKRGD RRWKDVIGYG VDDIEEWLES APVTHSWVSE MLNLAPHGYR AVETWWRGWA GATTPVLPTG AVLAGRDRAV QALEDRLEGT PSITTIRGVS LDEVLAFIAA VLERQASAGD SRWLSRAAFV DQVTSWRALA ERPGPLILVP TTADVAAEAA RGAAHHVLIP VTAAADIDLP PIDSRTAAVA LRAEGLGEGS AEQAGRLART SLLAMRRSLA SRPELHTPSW ASGPSRTLRG LLLAGRWSQT HDTDRAAISD LTGEDYDTLR ETVAGLNRAS DPFVTQIGSI WMLTNMHDAW IHLCETVRQD DLDRLEPVLR RALLERDPAL DLSPDDRWSA GAVGKSPTHS ADLRRGLATT LAALGVHGQV IDTGHGTNGA QWAARFVGDL LREANADTTG DTWNSLAGLL PLLAEAAPDA FLDGLRNASQ GAVPVISTMF TDSTTTSVTR ELSRHHHLLW ALETLAWSPD HFGRVVLQLA RLAEIDPGGS LSSRPFNSLV TVLCLEYPET TVPVAGRMAV IERLRDRHPD VAWRLMLTLL PSQFDLHGPT PNPEFRDWKP QEPVAVTADE WLDCVRTLVN WLILDAGDNV RRWQQVLDVF PFLPKSDRQR LREAMATRVG DGTLSDDGRA DLWESLRELI THHRSRAGKP GSLPVDEVDA LQDIERALAP SDPVQRHRWL FATQMPELAE HRRFGDPAYD SAVQDKRIAA ITEIEERGID AVREVAATAA DARTVGVCLA EVAGDKYRSE LVAMIPADPA GTALIEGWLS RQFQKDGWTW LDGLLAEELT PEQAAVALLA SRDYPKAWEV AEAHGTPVAE AFWKYFSING LGRDFGHVGE AASQLAQAGR VAAALKLVVI YLDDLGDTSA DLLIRLLGQF ADTYQSDPET GLVGEYDFRA LFEYLHLHTD PQRSTEIGQL EWVFLSGLGF QPPAGRLREA LAAEPELFVQ IMSSTWRATD AQENDEDSQG EGAEPEDETL TKEQVQQATN GYALLTSIDR LPGIGPDGRV DPAALQQWVA RVLELATASG RRRIAEMLVG QMLASAPADD DGTWPCQPVR DLLEELQSER VERSLAAQLY NDRGMTSRDP EDGGRQERAL AERYLAQATT FSDSWPQTAV VLRRVASMYE TDAHEHDDRA ERFRQGQQT
|
| |