Gene Caci_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2066 
Symbol 
ID8333410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2338076 
End bp2341252 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content74% 
IMG OID644955215 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003112826 
Protein GI256391262 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGATCG CGATGCTGGG ACCTTTGGAG GTTCGCGCCG ACGGCGGCGG CTTGGCCGAC 
GTCCCCGGCG CCCGCCTGCG CGCAGTGCTG ATCGCGCTCG CGCTGCGGCC GAACCAGGTG
GTCCCCAAGG CCTCGCTGGT CGACTGGATC TGGGGCGAGA ATCCGCCCGC CGACGCCGCC
AACGCCTTGC AGCGCCTGGT CTCCCGGCTG CGCAAGGCGT TGCCGGACAC GGCGATCGAC
GGGCTCACCG ACGGTTACCG GCTCACCGTG GATCCCGACG CCGTGGACGC GGTGCGCTTC
GAGCGCCTGG TGAGCGCGAG TCAGGCGGCC GGTCAGGACG TCTCGGAGCG GGCGCGGCTG
CTGCGTGAGG CTTTCGCGCT GTGGCGCGGC GCGGCGATGC AGGACGTGGG CTTGCAGGAC
AGCGACACCT TCGACGCGGT GGTCACGCGG CTGGAAGGGC TGCGCCTGAC CGCCGGGGAG
GAGCGTTTCG ACGCCGAGCT GACGCTCGGG CGCGGCGCGG AACTGGTGAC GGAGCTGACC
GATCTGGTCG CCGCGCACCC GACGCGGGAG CGGCTGGTCT CGGCGCTGAT GCGCGCGCTC
AACGCCGCCG GCCGCGACAG CGAGGCGCTG CAGGTGTATC AGCGCACGAG GGAGGCGCTG
GCCGAGGAAC TCGGCGTCGA TCCCTCGCCG GAGCTCTCGG CGCTGCACGT CGCGCTGCTG
CGGGGCGAGC TCGGCGGGAA GCGGGAGGAA ACCCGCAAGA CCAACCTGCG CTCGGAGCTG
ACCAGCTTCG TCGGCCGGGA GGCGGAGGTG GCCGCGGTCG GCGAGCTCGT CGGCGAGCAG
CGGCTGACCA CGGTGATCGG TCCGGGCGGT GCGGGCAAGA CCCGGCTGGC CGTGGAGACC
GCGCGCACGG CGCTCGGCGG GCTGCCGGAC GGCGCGTGGC TGGTGGAGCT GGCCGCGATC
GGCGCGGACG GGGACGTGGC GCAGGCCGCG CTGTCCAGTC TGGGTCTGCG CGACACGCTG
CTCGGCGAGG CTCTGCCGTT CGAAAACCGG AACGCCGAGC CGACGGACCG CTTCGTCGCC
GCGATGCGCG AGCGGCGGGC GCTGATCGTC CTGGACAACT GCGAGCACGT CATCGAGTCC
GCTGCGCTGT TCGCGCACCG GGTGCTCGGC GAGTGCCGAG GGCTGCGGAT CCTGGCGACC
AGCCGGGAGC CGCTGGGCAT CACCGGCGAG ACGTTGTGGC CGGTGACGCC GCTGGCGCTG
CCGAAGGAGG ACGCGGCGCC GGAGGAGATC GCGGCGTCCC CGGCCGTGCG GCTGCTGCGG
GACCGCGCGC AGGCGGTGCG CCGGGACCTG AATGTGGACG CACAGCAGCT GGCGACGATG
GCCCGGGTGT GCCGGGCACT CGACGGGATG CCGCTGGCGA TCGAGCTGGC GGCGGCGCGG
TTGCGGACCA TGTCCGTGGA GCAGCTGGCG TCCCGCCTGG ACGACCGGTT CCGGCTGCTC
ACCGGCGGCA GCCGCATCGC GCTGCCCCGG CACCGGACGC TGCGCGCGGT CGTCGACTGG
AGCTGGGAGC TGCTGTCCGA CGACGAGCGC AGGGTGCTGC GACGGCTCTC GGTGTTCGCC
GGCGGCGCCG GTCTGGAGGC CGCCGAGCAC GTCTGCGCCG GGGACGAGAT CGAGCCGGAT
CTGGTGCTGG AGCTGCTGAC CGCGCTGACC GAGAAGTCGC TGCTGGTCGC CGACGGGATG
GACGAGGACG CCACGCGGTT CCGGATGAGC GGCACGATCC GGGAGTACGC CGCGCAACGG
CTCGCCGAGG CCGGCGAGCA GGACGCGGCC CGGCGGGCGC ACCTGGACTT CTTCACCCTG
CTGACCGAGA CCGCCGAGCC GCGCCTGCGG CTGCGCGAGC AGGTCGCGTG GCTGGCGGTG
TTGCAGGCCG AGCACGACAA CATCGCCGCG GCGGCGCGCG GGGCGCTCGC GGCCGGCGAG
GCGCAGTCGG CGATGCGGCT CGCGGCCGCC TCGGGCTGGT ACTGGTGGCT GGCCGGCTAT
CGGACCGAGG GTCTGGAGCT GCTGCTGGCG GCGACCAAGG TGCCCGGCGA GACCACCGCC
GAGGTCCGGG CCGTGGTCTA CGCGCTGATC GTGATGTTCG CGACCTCCGG GCGAGGCGAC
GAGCACTCCG CCGAGGAGTG GATCCACAAG GCGTACGAGG CGAGTCAGCA CACTGAGCCC
GGGACCCGCC GCCACCCCGG CATGGCTCTG GTGGAGCCGC TGGAGCGCAT GCTGCGGGCG
CCGCAGGACG CGTTGACGGC GTTCGAACCG CAACTGGACG ACGACGATCC CTGGGTCCGC
GCGCTCGCTC GGGTCCAGAC CGGCAAGATC CGCGTCATGT TCGGCCAGGG CAGCCTGGAC
GCGGACGAGC ATCTGGAGAA GGCGCTGGCG GAGTTCACAG CGCTCGGCGA GCGGTTCGGG
ACCTCCCTGG CGCTGACCGA GCTGGCCGAC CGGATCGCCG TGCGCGGGGA GTTCGCCAGA
GCCGCCGAGC TGTATGAGCG GGCTGTCGTG GTCATCGCCG AGGTCGGCGC GCCCGAGGAG
GTCATCCGGA TGCGGGCGCG GCAGGCGCTG ATGCTGTGGA TGGCCGGGGA CCGGGACGGC
GCGGCGGCGG CGATCGCCGA CGGCGAGCGG AGCGCGGAGC GGGTGACCTG GCCCGGAGCG
CTGGCGGAGT TGGCGTTGGC GAAGGTGGAC CTGGCGCGCT GGGCCGGCGA CGCCGCGGAG
TCCCGGCGTC AGATCGCCGT CGCGACGACC CTGCTCGGCG AGGAGGCGGA GCAGACGCAC
TACCGCGCGA TGCGGTTGGA CATGCTGGCG CGGGTCACCG ACGACCTGGA AGAAGCGCGG
GCGCATAGCG CGGCGGCCTT GGCGGCGGCG AGCGAGGTGG GGCTCCCGCT GCTGCTCGCC
ATGGCGGTCG TCGGGGTCGC GGACCTGGCG CTGCGCCAGG AGCAGCCCGA GCAGGCCGCG
CGGCTGCTCG GCGCGGCCGC CGCCTTGCGC GGACTGCCCG ACAGCGCGCA CCCGGACGTC
GCGCACATCG AGACGCAGAC GCGAAACCGC CTCGGCGACA CGAGGTTCGC CGAGGCGGTT
CTGGAGGGGA CGCGGACCGA GTTGTCCGAG CTCACCCAGG TCACGCTCGC TTCTTGA
 
Protein sequence
MQIAMLGPLE VRADGGGLAD VPGARLRAVL IALALRPNQV VPKASLVDWI WGENPPADAA 
NALQRLVSRL RKALPDTAID GLTDGYRLTV DPDAVDAVRF ERLVSASQAA GQDVSERARL
LREAFALWRG AAMQDVGLQD SDTFDAVVTR LEGLRLTAGE ERFDAELTLG RGAELVTELT
DLVAAHPTRE RLVSALMRAL NAAGRDSEAL QVYQRTREAL AEELGVDPSP ELSALHVALL
RGELGGKREE TRKTNLRSEL TSFVGREAEV AAVGELVGEQ RLTTVIGPGG AGKTRLAVET
ARTALGGLPD GAWLVELAAI GADGDVAQAA LSSLGLRDTL LGEALPFENR NAEPTDRFVA
AMRERRALIV LDNCEHVIES AALFAHRVLG ECRGLRILAT SREPLGITGE TLWPVTPLAL
PKEDAAPEEI AASPAVRLLR DRAQAVRRDL NVDAQQLATM ARVCRALDGM PLAIELAAAR
LRTMSVEQLA SRLDDRFRLL TGGSRIALPR HRTLRAVVDW SWELLSDDER RVLRRLSVFA
GGAGLEAAEH VCAGDEIEPD LVLELLTALT EKSLLVADGM DEDATRFRMS GTIREYAAQR
LAEAGEQDAA RRAHLDFFTL LTETAEPRLR LREQVAWLAV LQAEHDNIAA AARGALAAGE
AQSAMRLAAA SGWYWWLAGY RTEGLELLLA ATKVPGETTA EVRAVVYALI VMFATSGRGD
EHSAEEWIHK AYEASQHTEP GTRRHPGMAL VEPLERMLRA PQDALTAFEP QLDDDDPWVR
ALARVQTGKI RVMFGQGSLD ADEHLEKALA EFTALGERFG TSLALTELAD RIAVRGEFAR
AAELYERAVV VIAEVGAPEE VIRMRARQAL MLWMAGDRDG AAAAIADGER SAERVTWPGA
LAELALAKVD LARWAGDAAE SRRQIAVATT LLGEEAEQTH YRAMRLDMLA RVTDDLEEAR
AHSAAALAAA SEVGLPLLLA MAVVGVADLA LRQEQPEQAA RLLGAAAALR GLPDSAHPDV
AHIETQTRNR LGDTRFAEAV LEGTRTELSE LTQVTLAS