Gene Caci_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4666 
Symbol 
ID8336020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5312467 
End bp5315769 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content71% 
IMG OID644957766 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003115368 
Protein GI256393804 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0465391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCTCG ACGTCCTGAT CCTCGGTCCG CTCCAGGCAC GCGCCGACGG CGCACCGATC 
CCGATCGGCG GCGCCCGGCT GCGCACCCTG CTGACCAGAC TCGCCCTCGA CGCCGACCGC
ACCGTCCCGG CAGCCGCACT CATCGACGCC CTCTGGAACC ACCAACCACC CGACGGCGCC
GCCAACGCCC TGCAATCGCT CATCTCCCGG CTCCGCCGCG CCCTCGGCGA CCCCGACCTC
GTCCAAGGCA CCCCCGGCGG CTACCGACTC GCCATCGACC CCCAAGCCGT CGACGCCCAC
CGCTTCGAAA CCCTCGCCCG CCAAGGCAGA CTCGCCGGCG ACCCCACCAC CGCCCGCCAC
CTCCTGCGCC AAGCCCTCGC CCTCTGGCGC GGACCCGCCC TCGCCGACAC CGCCGACGCC
GGCTTCGCCA CCGCACCCGC CGCACGCCTC GACGAACTAC GCCTCACCGC CCAATGCGAC
CGCCTCACCG CCGAACTCCA CCTCGGCATG CACACCGACG CAATCCCCGA ACTCGAAGCC
CTCACCACCG AACACCCACT GCGCGAAAAC ATCACCGCCC TACTCGTCAA AGCCCTCTAC
GCAGCAGGCA GACAAGCCGA AGCACTCGCC GCCTACGAAC ACACCCGCAC GGCGCTCGCC
GACCAACTCG GCATCGACCC CTCCGAACAG CTGGCCACCG TACACCTCGC AGTACTGCGC
AACGACCCGC TGCTGACTCC GGCGAACACC GCAGTGCCAA CGCCCAGCTC TGCCGAACAG
CGACGCACCA ACCTGCGGGC CCGCCTGAAC AGCTTCGTCG GCCGCGAAGA AGAAGTCGCC
CGCATCGGCA CGATGCTCGC CACCTCCCGA CTGGTCACCC TCGTCGGACC CGGCGGCGCC
GGCAAAACGC GCCTCGCCGG CGAAGCCGCG ACACGGCTGC CCGAAGACAT CGACGTGGCC
GACGGCGTCT GGCTCGTCGA ACTCGCCCCG GTCACCGACC CCGCCGAACT GCCGCAAGCG
ATCCTCACCG CACTCGGACA CCGCGAAATG CGCGTCCTGC GCAACGAAGC ACAAGCCGGG
CCGGCGCGCG ACGCCCTGAC CCGCGTCGCC GAAGGACTCG CCGGACAGCA CCTGGTCATC
GTGCTCGACA ACTGCGAACA CCTCGTCGAC GCCGCCGCGC ACGCCGCCGA ACACCTGCTG
CAGCACGTGC CCGGCCTGCG GATCGTCGCC ACCAGCCGCG AACCGCTGGG CATCGGCGGG
GAGAACCTGT TCCCCGTGCT GTCGCTGGCA CAACCCGCCG ACCCGGACCA ACTCGCCGCC
GCCGAGCAGG CGCTGGCGTT CCCGGCGGTG CGGCTGTTCG CCGACCGCGC CGGCGCCGTG
CGCCCCGGCT TCAGCGTCGA GGACGAGAAC GTCGCGGACG TCGTGAAGAT CTGCCGCCGC
CTGGACGGGC TGCCGCTGGC GATCGAGCTG GCCGCCGCGC GTCTTCGCAC CCTGCCGCTG
CACGCTGTCG CCTCGCGTCT CGACGACCGG TTCCGGCTGC TCACCGGCGG CAGCCGCACC
GCCATGCCGC GGCACCAGAC GCTGCGCGCG GTCGTCGCCT GGAGCTGGGA GCTGCTGTCG
GAGCAGGAGC GGGACCTGGC CGAGCGGCTG TCGGTGTTCC CCGGCGGCAT CACCGCCGAA
TCAGCCGCCG CGGTGCATCC CGCCACCGCC GCCGACATCG ACGACCTGTT GTTCGCGCTT
GTCGACAAGT CGCTTCTGCA ACCGGTCGAG CCGGACGTCG CGCGCCCCGG CCACACAGAC
GATGTCGGCC ACACAGACGA AGCCCTCGAC ACCCCGCCGC GCTGGCGCAT GCTGGAGACC
CTGCGCGAAT ACGGCATCGA ACGACTCGCC GAGGCCGGCA CCGTCACCGA CGTCCGCCGC
GCCCACGCCC GCTACTTCCT CGACCTCGCC GAAGAAGCCG AGCCCCACCT GCGCCGCCGC
GAACAACTGC GCTGGCTGGC GCGCCTGGAC GCCGACAGCG ACAACATCCT CGCCGCCCTC
CGCTTCGCCG CCGACATCGG CGACGCCGAC ACCGCGATCC GGCTCGCCGC GTCCCTGGCC
TGGTACTGGT CCATCGTCGG CCAGACCATC GAGGGCCGAG CCTGGCTGGA CCTGGCCCTG
GCGGTCCCGG GCGAATCCCC GCCCGAAGCC CACGCGGTCG TCAAGATCCT GCACGCGCTC
GGCGGACTGT TCAGCGCGCA GGACTGGACC AACCTCTCGG ACATCACCAA CGCCCTGGCC
TCCGTCATCG ACGACGCCAA CGCCGCCCAC GACAACCCGC TGCTGGCCAT CGCCGCCTGC
ACCATCCCGA TCATCACCGA CGACCTCGAA GGCGTCTACG CCGCCGTCGC GGTCCACGAG
TACCACCCCG ACCCCTGGGT CGGCGGAATG CTCCACCTCA TGCGCGGCAT GGCCGCCGAG
AACGGCGGCG ACCTCATCAC CCAGCGCCAC GACCTGGAAC TGGCCCGCGA CCGCTTCGCG
CAGATCGGCG AACGCTGGGG TCTGTCGGCG ACCCTGGCAG CCCTCGCCAC CATGGCCATG
GCCGACGGCG ACCTGCCAAC AGCCATGCGC ATGCAGGACG AAGCCCTGGG CCTGCTCCGA
GAAATCAACG CAGCCGACGA CGCCGCACAG GTACAGATGA TGCGCGCCTT CGCCCTCGCC
CGCACCGGCG CACTGGACGA AGCCCAAACA CTGATGACGT CGATCCTGGA CTCAGGCCGG
CGCACCCGCT CGCACCCCTC GATCCTGATG GCGTACGTAG GACTAGCCGA CATCGCCCGC
CAAAAAGGCG AACCCGAAGC CGCATGGGGC TACCTCCAAG CCTCCGACGA CATCATCCGC
AACCACTGGC ACGGACCACC CCAGCTCCTA GCGATGCGCG AAGTCGCCGC GGCCCTCCTC
CACCTCCTCG CCACCGACCC CGACGCCACC AAGAAGGCAC ACGCCAGACT CCAAGAGGCA
TACCGCCTCG GCTCCGCCGC CCACGACATG CCAGTCCTCA GCCGCATCGC CATAGTCGTC
GCCTGCTACA CCAACGCCAC CGGCGACCCG GCATCCGCCG CACGAGCCCT GGGCACCGCA
GTATCCCTCC GCGGCGGCAT CGACCGAGGC GACCCAGACC GCGCCGCCGT CACCGACAGC
GTCCGAGAGC AACTCGGCGA GGCGAAGTAC GAGGCCGAAT TCGCCGTGGG GCACGCTTTT
ACGCGGCTGG AGGGGTTGGC GTTCCTCGGG GAATGCTTGG GGATTGAGGC TGCGGGAGCG
TGA
 
Protein sequence
MQLDVLILGP LQARADGAPI PIGGARLRTL LTRLALDADR TVPAAALIDA LWNHQPPDGA 
ANALQSLISR LRRALGDPDL VQGTPGGYRL AIDPQAVDAH RFETLARQGR LAGDPTTARH
LLRQALALWR GPALADTADA GFATAPAARL DELRLTAQCD RLTAELHLGM HTDAIPELEA
LTTEHPLREN ITALLVKALY AAGRQAEALA AYEHTRTALA DQLGIDPSEQ LATVHLAVLR
NDPLLTPANT AVPTPSSAEQ RRTNLRARLN SFVGREEEVA RIGTMLATSR LVTLVGPGGA
GKTRLAGEAA TRLPEDIDVA DGVWLVELAP VTDPAELPQA ILTALGHREM RVLRNEAQAG
PARDALTRVA EGLAGQHLVI VLDNCEHLVD AAAHAAEHLL QHVPGLRIVA TSREPLGIGG
ENLFPVLSLA QPADPDQLAA AEQALAFPAV RLFADRAGAV RPGFSVEDEN VADVVKICRR
LDGLPLAIEL AAARLRTLPL HAVASRLDDR FRLLTGGSRT AMPRHQTLRA VVAWSWELLS
EQERDLAERL SVFPGGITAE SAAAVHPATA ADIDDLLFAL VDKSLLQPVE PDVARPGHTD
DVGHTDEALD TPPRWRMLET LREYGIERLA EAGTVTDVRR AHARYFLDLA EEAEPHLRRR
EQLRWLARLD ADSDNILAAL RFAADIGDAD TAIRLAASLA WYWSIVGQTI EGRAWLDLAL
AVPGESPPEA HAVVKILHAL GGLFSAQDWT NLSDITNALA SVIDDANAAH DNPLLAIAAC
TIPIITDDLE GVYAAVAVHE YHPDPWVGGM LHLMRGMAAE NGGDLITQRH DLELARDRFA
QIGERWGLSA TLAALATMAM ADGDLPTAMR MQDEALGLLR EINAADDAAQ VQMMRAFALA
RTGALDEAQT LMTSILDSGR RTRSHPSILM AYVGLADIAR QKGEPEAAWG YLQASDDIIR
NHWHGPPQLL AMREVAAALL HLLATDPDAT KKAHARLQEA YRLGSAAHDM PVLSRIAIVV
ACYTNATGDP ASAARALGTA VSLRGGIDRG DPDRAAVTDS VREQLGEAKY EAEFAVGHAF
TRLEGLAFLG ECLGIEAAGA