Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4039 |
Symbol | |
ID | 8335392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4567354 |
End bp | 4570479 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644957145 |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003114748 |
Protein GI | 256393184 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.039203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0409603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAC TCCAGGCGCT CGGCGCAACC GCGCGCGGCA TCACCCGCGT GGTGAAGGCG CTGCTGGCCG CCGCGATCAC CATCGGCGTG CTCGCCGGCA TCCCGGCGGC CCTGCTGCAC TACGCCGGGG ACCCGATCCC GCACTCGGTA CCGACCATGG ACGCGGTCAA GCACACCCTG ACCAACCCCA TGACCCCGCA GATGCTGCTC AAGGCCCTGT CGGTCGTCGG CTGGTACCTG TGGGCGATCC TGGCCGTCAG CTTCCTCGTC GAGCTGGTCT ATGCCGCACG GCGGGTCAAC GCCCCGCACA TCCCCACCCT CGGGCCGACC CAGGCGCTGG CCGCCGCCCT GATCGCCGCC ATCGGCATCA CCACCCTCCT GCGTGCCGCC CCCGCCCACG CCGCCGAAAC CTTCTCGGCC TCCGCTCCGA CTGGCGGCCG GGTCGCAGCC ACCGCACCCG CGCTCGCCGG GACCGGCAGT CTGTCCACCG CGCACCTCGC GGTCGGCAAC GCCAGCTCGG CCGCGCCACG CGAAAGCGTC CACACAGTGA AGCCCGGCGA GTCGCTGTAC TCGATCGCCA AGGAAGACCT CGGCAGCGGC GACGACTGGC CGGCCCTCTA CAAGATGAAC GCCGGCGTCG TCCAAGCAGA CGGCGACCAG CTCACCAACC CAGACCTGAT CCGCCCCGAC TGGAAGATCC GCATCACGCC GCCCAGCGCC GATCAGGCTC CCGCGACCAC TAGCACCACC ACCGCCCCGC CCACCAAAGC TCCCGCGCCG TCGGCCCCGA AGGCGTCCGC CCCGGCGACG AGTGGCCCCA GCGCGCTGCC TTCGCCCGCT GCTTCGCATA CTGCCGCCCC ATCCCCCGTC ACCCACGCAA CGCCTACGAA CGACGACCAC CGGCAGGGCG ACCGCGCCGC GAAGCGCCAC GGCGGAGTCG CGGTCTCGCT GCCCGACGGC GGAGCCATCG GCATCACCCT CGCCGTGGCG CTCGGGTCGG CGCTGGTCCT GGCCCGGCGC TGGCGCACGC GGCGCGCCGA CCCGCGCCTG CCGATCGCCG AGCCGCCGCT TCCCGGGGCG CTGCTGGCCG CCCGCCGTGC CCAGCGGTCC CTGGCCGCCG CCCAGCACAG CCTGGCCGCA TCGCCCGAAG ACGCAGTCCA CGACGAGGAA CACGACGACC TCTTCGACGC CAAGGGGATC GAAGACCTTG ACGAGGACGC CTTCGGCGCT GATCAGAACC TCATCGGCGA CGACGCTGAC AGCTGGGACG ACGCCGAGGA GCTGGACGAG TTCGGCGCGC CGGCCGGTCC CTTGCCGGAG CCTACGGTGA CGCGCTTTGC CGAGCCGCTG CCGCCGGGCT CGATCGGCGC CGCCGAGCGC GACGGCGTCG AGCTGCCGCT GACCCCGACG GGCACCGGCC TGGGCCTGGT CGGCCCCGGA GCCGCCGGGG CCGCCCGCGC GATCGCCGCC TCGGTGCTGT CAGCGGGCTC GCCCGAGCGC ACCGCCGACC TGGCCAGGCT CATCATCCCG GCCGCGGACC TGGCCGCGCT GCTCGGCGTC GACGAGTCCG AGCTGCCAGC CATCACCCGC GGGCTTCCCG AACTATTCGC CACCAAGGAC CTGGCCACCG CGATCGGCGA AGCCGAAGTC CACGCGCTGC TGCGCACCCG GCTCCTGGAG GAGTACGAGC AGCCGGACCT GGACGCGCTC GCCGCGGCGC ACCCGGACGT CGAGGACTGC CCGCCGCTGG TGCTCATGGC CTCCCCGGCG CGCGCTCTGA GCGCGCAGAT CGCCGCCCTG ATGAACACCT CCGCGTGTCT GCGGATCACC ACCGTCCTGC TCGGCGCGCA CCCGGACGGG CCGACCGCCT TCGTCGAAGC CGACGGCACC GCCACCGGCC CCGCTGTCCG AGACTGGTCC GGCGCCAGGC TGTGGAACCT GAGCGCGCCC GCCCTGGCCG ACATCCTGGA CCTGCTCGCC CGCGCCGCCG GCCACGATCC CGGCACTCCG GGCGGCCAGG CCGAACCAGA CGACTGGCCG GAGACCCCCC CGGCCCACGA CGCCGCAGAG CGCGCGGCGG CCGACGAGAA CGACGGCGGC GAGGCGACGA TCACCGTCCT GCCCGTGCGC CCCATGCCGG ACCGGCAAGC CGACCCGGTC GGCGACGACC ACGAGGAGCG CACGCTCACC GACTCCCCTG TGGCAGGTGC CAACACCGCA TCCCTGGCCC CGGTGACCAT GCTGCCGGTG CGGCCCGCCC CGGACAGGGC GCTCGCCGAC GCGGCGCGAA CCAGCGCCGA CGTGCGCGCC GAGGCGGCGC TGACGGCATG GAACGAAAAC CCGATACGGA TCAACGTGCT AGGCGGACTG AACATCACCG CAGGCGGGCA GTCAGTGTCC GGGCTTCGCA CCTCGGCCCG CGTGCTGGCC GCGCTGCTAG CAGTCAAGGG GTCCGCGGGC GCCAGCTCCG AGCAGATCGA CGCGATGTGC TGGCCCGACG CCAACCCCCA GGAGATGGAC CGGATCGCCA AGTGGCGTGC CGACGGCCTC AACTCCCTGC GTAAGCGCCT GGCCGCGGCC ATGGGCCAGC GCAGCCCCCG GCTGGTCCTG CTCGACCGGG CCACCGGCCG CTACCGGCTC AATCCCGAGC TGGTTGCCAC CGACCTGGGC ACGATCGCCG AGCTGACCGC CGCCGCCCGC AGCGCCGGTG ACACCGAGCA GCGCCTGGCG CTGCTGGCCG CCGCCGAACC GCTGTGCCGC GGAGCGCTGC TGGACGGAGA ACTCGGCGAC AATTTTGACT GGAGCGCGGA CTTCATCGCG ACCGTCGCCG ACGAGCAGGT CGCCGTCCTG GCACGCCTCG CGACGCTGGC CGCTGATTCC CGGCCGGACC AGGCCCTGGC GGCGCTGGAG AAGGCCGCCG CGTTCACCGA GGACAACGAG ACGCTGTACC AGCAGATGTT CGACATCCTC GCTGAGGCCG GACGGCACAG CGAGATCCCC GGCAAGCTGC GAACCCTCGA GGCGTACGCC GACTCCCTCG GGGCCGGCGT CTCGACAGCG ACCCGCGAAG CAGCGGCGCG CGCGATGAAG CGCCAGCCGC AGCAAGGGGT CCGCCAAGGG CACTGA
|
Protein sequence | MAILQALGAT ARGITRVVKA LLAAAITIGV LAGIPAALLH YAGDPIPHSV PTMDAVKHTL TNPMTPQMLL KALSVVGWYL WAILAVSFLV ELVYAARRVN APHIPTLGPT QALAAALIAA IGITTLLRAA PAHAAETFSA SAPTGGRVAA TAPALAGTGS LSTAHLAVGN ASSAAPRESV HTVKPGESLY SIAKEDLGSG DDWPALYKMN AGVVQADGDQ LTNPDLIRPD WKIRITPPSA DQAPATTSTT TAPPTKAPAP SAPKASAPAT SGPSALPSPA ASHTAAPSPV THATPTNDDH RQGDRAAKRH GGVAVSLPDG GAIGITLAVA LGSALVLARR WRTRRADPRL PIAEPPLPGA LLAARRAQRS LAAAQHSLAA SPEDAVHDEE HDDLFDAKGI EDLDEDAFGA DQNLIGDDAD SWDDAEELDE FGAPAGPLPE PTVTRFAEPL PPGSIGAAER DGVELPLTPT GTGLGLVGPG AAGAARAIAA SVLSAGSPER TADLARLIIP AADLAALLGV DESELPAITR GLPELFATKD LATAIGEAEV HALLRTRLLE EYEQPDLDAL AAAHPDVEDC PPLVLMASPA RALSAQIAAL MNTSACLRIT TVLLGAHPDG PTAFVEADGT ATGPAVRDWS GARLWNLSAP ALADILDLLA RAAGHDPGTP GGQAEPDDWP ETPPAHDAAE RAAADENDGG EATITVLPVR PMPDRQADPV GDDHEERTLT DSPVAGANTA SLAPVTMLPV RPAPDRALAD AARTSADVRA EAALTAWNEN PIRINVLGGL NITAGGQSVS GLRTSARVLA ALLAVKGSAG ASSEQIDAMC WPDANPQEMD RIAKWRADGL NSLRKRLAAA MGQRSPRLVL LDRATGRYRL NPELVATDLG TIAELTAAAR SAGDTEQRLA LLAAAEPLCR GALLDGELGD NFDWSADFIA TVADEQVAVL ARLATLAADS RPDQALAALE KAAAFTEDNE TLYQQMFDIL AEAGRHSEIP GKLRTLEAYA DSLGAGVSTA TREAAARAMK RQPQQGVRQG H
|
| |