Gene Caci_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4197 
Symbol 
ID8335551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4751681 
End bp4754677 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content71% 
IMG OID644957300 
Producttranscriptional regulator, LuxR family 
Protein accessionYP_003114902 
Protein GI256393338 
COG category[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAC CTCAGGATCA CGACCTCGGC TCACCGCATC TCGTCGTCGG CCGCGACGAG 
GAGATGCGGG CCGTCGCCGC CGCCTTCGAC GCCTTGACCG CCGGCACCGG CGGGTTCTTG
CAGGTGGTCG GCGAGCCGGG CATCGGCAAG ACGTACCTGC TGGCCGCGAT CCGGGAGACG
GCGCTGACCA AGGGCATCTC AGTGCTCTCC GGCAGGGCGA CGGAGTTCGA GCAGGAGATG
CCCTTCCAGA TCCTGCTGGA CGCGCTGTCC GAGCACGGGG ATCTGGCGCG CCTGCTGGAA
GGGCTGCCGC CGGGCGCCGG CGAAGTGCCG GCGCCGGACC GGCCGCCCGG GGCCCGGTCG
GCGCCGGACA TCGACCGCTT CCGGCTCTTC CAGGCGCTGC GGCAGGTGAT GGCGTCGATG
GCGGATCAGC CGGTCCTGGT GCTGCTGGAC GACGTGCACT GGGCCGATCC GGGCTCGATC
GACTTCATCG CCTTCCTCAG CCGCCGCCCG ATCGCCGGAC CGGTGCTCAT CGTCGTGGCG
CACCGCGAAC GCCAGGCGCC GGCACAACTT CGCTACGCGC TAGCCCGCGA CACCGATCAC
GGCACGGTCA CCCGGATCGA GCTCGGTCCG CTGTCGCTGG CCGATTCCGC GCGGCTGCTC
GGCGACCGGA ACGGAACGCG CCGCAGCGTC GAGTTGCACG AGAAGAGCCA CGGCAATCCC
CTGTACCTGC TGACGCTGGA CCGCAGCAAC GGCTACCGGC GGCGTCCGGG AGACCCGAAC
CGGCGCGAGG GTGGCGACGC CTCCAGCCGG CTGGAAGCGC TTATCCTCGG GGAGACCGTG
ACCCTGTCTC CGGACGAGTT GGCCGTCGCC TCGACCGCAG CGGTCGTGGG AGACCCGTTC
ACCCCCGAAC TGCTGACCGC GGTCATGGCC GGCCCGGCTT TGAGCCAGGT GGAGTGCGCG
GTGAACAGCC TGGTCAGCCG GGATCTGATT CGCGAGGTCC CCTCCGGTCC CGGGCTGGTC
TTCCGGCATC CGCTGGTGCG CCGCGTCATC TACGACCAGT CGGCGCCGAC GTGGCGCGTG
GACATCCATC GCAGAGCCCT GGCACTGCTC GCCGAGCGCG GGGCGTCTGC CTCCGAGCGG
GTCCACCACG TCGAGCACTG CGCCACCGCC TGGTCGCCGG AGTACGAAAC AGTGCTCTGC
CAGGCCGGCC AAGAGGCGAT GAGCACCTCT CCGCTGACGG CGGCGCACTG GTTCGGCGTC
GCGCTGAACC TGTTGCCGCA CAACGAGGAT TCGCTGCGGC GCCGCTTCGA ACTCAGCTTC
CTGGTAGCGC GCGCCTTAGG ACTCGGCGGC AGGTTCGCCG AGAGCCGGGA CCTGCTGCAC
GAGATCCTTC AGGACGCCTC CGAGGCGACG ATGGTCAACC GCTCCGAGGC GGTGGTCCTG
TGCGCGCACG CCGAGCAGCG CCTGGGCCGC TACTCCGAGG CGATCGCGCT GCTGCGTGCC
GAGGTGGCGC GTCTGGGGAC GAAGACGTCC TCCCAGCGCA TCGGCCTGTG TCTGGAACTG
GGCCTGACAG CTTTGCTGGC CAACGATTAT CCGGCGGCGC GTGCCGACAT CTCCTGGGCT
CTGGACGCCG CGCGCGAGAC CGGCGACCGG CTGTATGAGG CGACCGCGCT GGCGTTCAGC
GCGTTCGGCG AGATCTGCGT CGGGCACACC GACATCGCTC GGGCGGCGGC GGATTCGGCG
AGCGCGATGG TGGACGGCAT GCCTGACAGC GTGCTGGCCG GGGAGCGGGA GACGCTGTGC
ATGCTCGGCT GGGCCGAGAT GCTGCTGGAG CGGTTCTCCG ACGCCGATCG GCATCTGGCG
CGCGGTCGGG CGATCGTGCG GCGGACCGGG CAGAGCCACG GCTTGCCGCA CGTGCTGCTC
GGGCAGTGCC TGGTGGCGAT GTTCACCGGA CGCATGGCCG AGGCGCTGGA GCAGGTCCAG
AACGCCGAGG ACGCCGCGCG CCTGGTCGGC AGCGACCACC TGCTGGGCAT CGTGCTGGCG
ATCAAGGCGC CGATCCAGGT GTGGATGTCG CCGCGCGGCA AGGGAGACGC GGCGCTGGCC
GGGGCGCGCG CGGCGACCGC GTTGTTCACC GGGAGCGCCG TGAACAGCTG GTGGGCACGC
AATGCTCTGA TGCTGCGCGG ACATGCCGAA CTCACCAACG GTGACGCGGC GAGCTGCGTG
GAGCTGGTGT TGCGCGCAGG CGGTCCGGAC TTACGCATGC TGGGCGCGCC TCTGGTCCCG
GAGTATGCCG AGATCTTGAT CAGTGCCCTG ATCCGGCTGG GGCAGACCGA TCGCGCCGAG
GAACTGGCGC GGATGGCGGT TGTGCAGGCC GAGCAGCTGG ATCTGCCCGG TCAGCGCGGG
CACGCGGTCC GTGCCCGGGG TCTGCTCTCG GCGCTGCGCG GCGATCACCA GTCCGCGGAC
GCCGACTTCA CGCGTGCCGT CGCGGCGTTC GCGCAGGCTG GTCGGCTCGT GGAGCAGGCG
CGGACCGCGG TGTTCCATGC CCGCACGCTG GTCATGCTGG GTCGGCGCGA GGAGGCGCTG
GCGACGCTGG CCCGCTCGAC GGCGCAGGCT GCGGGCTGCG GCGCGATCTG GGTGCGCGAC
GAGTTGGAGC GGGTACGCGG GCAGGTCGCA GGGCGCGCGC CGGGTGGATG CGTGCAGTCG
GCGGAGGAAA CGGCGGCCGC GGTGAACGAC GCCCCGACGA ATGCGGTCGG CGAGAGCACG
TCCGCCGCTT CGCGATCGGG TACTCATGAC AGCGCCAGGG TGTTGACCGT GCTGACCGAC
CGCGAACGCC AGATAGCGCT GCTCGTCGGC GCCGGGCACA GCAACCGCCA GATCGCGACG
CGGCTGTTCC TGAGCGAGCG GACCGTCGAG AGCCACATGG GCAACGTCTA CCGGAAGCTG
GGTGTGGCTT CGCGCGTCGC GCTGGCGCGG CTGCTGGCGT TCGAGGTCGA GGAGTAG
 
Protein sequence
MSRPQDHDLG SPHLVVGRDE EMRAVAAAFD ALTAGTGGFL QVVGEPGIGK TYLLAAIRET 
ALTKGISVLS GRATEFEQEM PFQILLDALS EHGDLARLLE GLPPGAGEVP APDRPPGARS
APDIDRFRLF QALRQVMASM ADQPVLVLLD DVHWADPGSI DFIAFLSRRP IAGPVLIVVA
HRERQAPAQL RYALARDTDH GTVTRIELGP LSLADSARLL GDRNGTRRSV ELHEKSHGNP
LYLLTLDRSN GYRRRPGDPN RREGGDASSR LEALILGETV TLSPDELAVA STAAVVGDPF
TPELLTAVMA GPALSQVECA VNSLVSRDLI REVPSGPGLV FRHPLVRRVI YDQSAPTWRV
DIHRRALALL AERGASASER VHHVEHCATA WSPEYETVLC QAGQEAMSTS PLTAAHWFGV
ALNLLPHNED SLRRRFELSF LVARALGLGG RFAESRDLLH EILQDASEAT MVNRSEAVVL
CAHAEQRLGR YSEAIALLRA EVARLGTKTS SQRIGLCLEL GLTALLANDY PAARADISWA
LDAARETGDR LYEATALAFS AFGEICVGHT DIARAAADSA SAMVDGMPDS VLAGERETLC
MLGWAEMLLE RFSDADRHLA RGRAIVRRTG QSHGLPHVLL GQCLVAMFTG RMAEALEQVQ
NAEDAARLVG SDHLLGIVLA IKAPIQVWMS PRGKGDAALA GARAATALFT GSAVNSWWAR
NALMLRGHAE LTNGDAASCV ELVLRAGGPD LRMLGAPLVP EYAEILISAL IRLGQTDRAE
ELARMAVVQA EQLDLPGQRG HAVRARGLLS ALRGDHQSAD ADFTRAVAAF AQAGRLVEQA
RTAVFHARTL VMLGRREEAL ATLARSTAQA AGCGAIWVRD ELERVRGQVA GRAPGGCVQS
AEETAAAVND APTNAVGEST SAASRSGTHD SARVLTVLTD RERQIALLVG AGHSNRQIAT
RLFLSERTVE SHMGNVYRKL GVASRVALAR LLAFEVEE