Gene Caci_4657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4657 
Symbol 
ID8336011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5299570 
End bp5302800 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content73% 
IMG OID644957757 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003115359 
Protein GI256393795 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTATCG CGCTAAGGCT CCTGGACGAT GTGCGCTGGC ACGGCGTCGC CGTCGTGGGC 
GAGCGCCCGC AAGCGCTGCT CGCGGCCCTG GCCGCACGTC AGGGCCGAGC CGTCGGGGCG
GCGGAGCTCA TCGAAGCCGT CTGGGGCGAT GCCGCGCCGA GCAACGGTGT GAAAAGCCTG
CAAGTGCTGG TGTCCCGGAC CCGCAGCGCC TGCGGTCCGG AGGTGATCGT CCGCGACGGC
ACCGGATACC GGCTCGGCGT CGGGCCCGGC GAGGTCGACA GCACCGAACT CAGCCGGCTG
GTGCGTGAGG CCACCGCCGC ACTGGACACC GACGCCGCAC GCGCGGCGGA GCTGGCACAA
CAGGCACTCA CCGTCGTCGG AGGGCAGGAT GCGGCCGACA GCGCGGCGGC AGCCACCGGC
GCCGTCATCC TCACTAGCGC CAGTGAGGAC GACGGGCCGC TCGCCGAGAT CCGTCACACC
GCGGCCGCAC AGCTCGCCGC CGCCCGCACC GTCGCGGCGC GCGCCGCCAG CCGCACCGGC
GCCCACGCCC AGGCGCTGCC GCAGCTGGAA TCAGCACACG CCGACGCCGC GCACGACGAG
TCGCTGCTCG CTGATCTGCT GCGCAGCGAA GCCGCGGTGC GGGGTCCGGC GGCGGCGCTG
GAGCGGTTCG AACGGCACCG CCGCGACCTG CGGGAGCGGT TCGGTTCCGA CCCTGGCGAG
GTGCTGCAGC GGGTGCAGCG CGCTCTGTTG GCGCTGGACC GGCCGGTACG GCACGGTCTG
CGGTATGACG CGACCGAGCT GATCGGCCGT GACGCCGACG TGGCGCGGCT GCGCACGGCG
CTGGCCGCCT CGCGCGTGGT GTCCATCGTC GGGCCCGGAG GGCTCGGCAA GACCCGGCTG
GCGCAGGTCA TGGCGCGCCA AGCTGAGCAG CCGGCGGTGT ATTTCGTCGA ACTCGCCGGT
GTCCTGCCCG ATGAGGACCT GGCCGTGGAG GTCGGATCGG TGTTGGGGGT GCGGGACTCG
GTCAGCGGCC GCGCAGCCCT GACACCGAGG CAGCGCGCTG ACTTCACAGC ACGGATCGCC
CAACAGCTCG CGCAAGTGCC GGGCTTGCTG GTGCTGGACA ACTGCGAGCA TCTCATCGGC
GCGGTCGCGG ACCTGGTGGC GTTCCTCATC GCCGCCACCC CGGACCTGCG GGTGCTCACC
ACCAGCCGCG CGCCGCTGGC TATCGCCGCC GAACGCCTCT ACCCCCTGGG CGAGCTCGGC
ACGGCCGACG GGGTTCAGCT GTTCGCCGAA CGGGCTCTGG CCGCCCGGCC CGACGCGCGC
CTGGATGCCG ACGTCGTCGC CGGGATCGTC CGCCGGCTCG ACGGGCTGCC GCTGGCCATC
GAACTGGCCG CCGCGAAAGT CCGGGTCATG GCCGTGGAGG ACATCGACCG GCGGCTGGCC
GACCGGTTCG CGCTGCTGCG CGGCGGCGAC CGCAGCGCCC CGGACCGGCA CCAGGGGTTG
CTGACCGTCA TCGAATGGTC CTGGAACCTG CTGGGCGAGG CTGAAAAACA AGCCCTGCGG
CGTCTGGCGC TGTTCCACGA CGGATTCACC CCGCAAGCCG CCGAAGCAGT CCTGCCGCCG
GCCGTCTTCG ACGCCGTCCC CGGCTTGGTC GACCAGTCGC TGGTCACGGT GCGGGAAAGC
GCGGCCGGTA TCAGGTACCG GATGCTGGAG ACGGTGCGCG AGTTCGGCCG GATGCAGCTG
GCCGCCGCCG GGGAGCAGGA CCAGGCGCGC GCCGCGCAGC GCCGGTGGGC TGTGCGGTAT
GTCTCGGCGC AGCGGCTCGC CTTCTCCGGT CCCGGGCAGT TCGCCGCCAT CGACGCGGTC
AGCGTGGAGG AGACGAACCT CGCTGACGAG CTGCGCGGCG CGATCGCCGA GGGGGACCGG
GAATCGCTGG TGACGCTGCT GGCCGGGCTC GGGACGATGT GGATGATGCG CGGTGAGCAT
TTCCGGCTGC TGGTCCTGGG CGGCGCGGTC CGGGACGCGT TGAGCGACTG GACACCGCCG
CCGCACCTGG CCGACGCCAC CCGCGCCGCG GTGGCGATCA CGTTGAACAA CGCCCTGGCA
CTGTCCGGGG ATCACGGCGC GGATCTGTTC GCGCTGCTGC ACCGCCTGGG CCCGGCCAGC
GGCGAGGACA TCTACCTGGC GGCGCTGGTT GAGGTGCTGC TGACCTGCGC ACCGGCGGTC
GACGCGGCGT TCCCGGCACG GCTTTGCGCC CTCGCCGCCG GAGCCGACCG GCACACCGCC
GGCGTGGCCA GCTTGTGGCT GAGCCGCCAT TTCGAGAACG AAGGAGACCT GCCGGCGGCG
CTCGCCGCCG CCGAGCGGGT GCTGGCGCTG GCCGAATCCG ACGCCGCCGA CGGCGTGCAG
GCCGGGCCCT GGACGACGGC GATGCCGCAC GCGCTGCTGG CCGAGCTGAC CATGCAGCTC
GGCGACAGCA CTGCTGCGAT CACGCACGCG AAAGCGGCGC TGCCGGTGGT CGAGCGGCTC
GGCGCCAACG ACGACGAAGC GCAGCTGCGG GCCCTGCTGG TGCTGTGTGA CATCGGTGCC
GGAAGGCTGG CCCGCGCCGC CGAACAGCTG GGCCGGATCG ACAGCATCGA GATGCGCGCC
GCATCTTTCG GCACCGACGT CTTCCGGCAC ATCTGCCGCG CCGAGCTGCT GCTCGCCTCC
GGGCAGATCG CCGACGGGCT GCGGCTGTAT CGGGAAAGCT CAGCCCGGAT GCGCCAGAGG
CAGGTTCCTG AGGCCATGCG CACCGGCACC GAGCTGTGGA GCTACTTCGG CGACGCGCTG
GTACTCAACG CGCACGCCTG GTACGCCGCC GACGCCGAGG AGCTGGCCTG CGGACAGGCG
AAGTTCACGG CGTGCCGTGC CAGTGCCCTG AAAGTGCTGA CCGCCGACAA CGAGCGCCTG
GACTACCCCG CCGCCGGGCT GCTGCTGTTC GCGTTGGGCG CGTGGGCGCT GCTGCGCAAA
GCCGCCGCCG CCACCGACGC GGTGCGGCTG CTGGCGCTGG CGCAGCGGTT CGCCTACAAC
AGGATGCTGC CGACGATGGC CTGGGACCGG ATCCTCGCCG CCGCGAAGGA GGCGCTGCCC
GGCGCTCTGG AGCAGATGAG CGCCGAATAC GCCGATCGCC GCCCGCCGGA CCTGCTGGAG
CAGGCCCGGG CGGCGGCGCG GCGGCTGCCT GAGGAATCAG AGGGCAGCTG A
 
Protein sequence
MSIALRLLDD VRWHGVAVVG ERPQALLAAL AARQGRAVGA AELIEAVWGD AAPSNGVKSL 
QVLVSRTRSA CGPEVIVRDG TGYRLGVGPG EVDSTELSRL VREATAALDT DAARAAELAQ
QALTVVGGQD AADSAAAATG AVILTSASED DGPLAEIRHT AAAQLAAART VAARAASRTG
AHAQALPQLE SAHADAAHDE SLLADLLRSE AAVRGPAAAL ERFERHRRDL RERFGSDPGE
VLQRVQRALL ALDRPVRHGL RYDATELIGR DADVARLRTA LAASRVVSIV GPGGLGKTRL
AQVMARQAEQ PAVYFVELAG VLPDEDLAVE VGSVLGVRDS VSGRAALTPR QRADFTARIA
QQLAQVPGLL VLDNCEHLIG AVADLVAFLI AATPDLRVLT TSRAPLAIAA ERLYPLGELG
TADGVQLFAE RALAARPDAR LDADVVAGIV RRLDGLPLAI ELAAAKVRVM AVEDIDRRLA
DRFALLRGGD RSAPDRHQGL LTVIEWSWNL LGEAEKQALR RLALFHDGFT PQAAEAVLPP
AVFDAVPGLV DQSLVTVRES AAGIRYRMLE TVREFGRMQL AAAGEQDQAR AAQRRWAVRY
VSAQRLAFSG PGQFAAIDAV SVEETNLADE LRGAIAEGDR ESLVTLLAGL GTMWMMRGEH
FRLLVLGGAV RDALSDWTPP PHLADATRAA VAITLNNALA LSGDHGADLF ALLHRLGPAS
GEDIYLAALV EVLLTCAPAV DAAFPARLCA LAAGADRHTA GVASLWLSRH FENEGDLPAA
LAAAERVLAL AESDAADGVQ AGPWTTAMPH ALLAELTMQL GDSTAAITHA KAALPVVERL
GANDDEAQLR ALLVLCDIGA GRLARAAEQL GRIDSIEMRA ASFGTDVFRH ICRAELLLAS
GQIADGLRLY RESSARMRQR QVPEAMRTGT ELWSYFGDAL VLNAHAWYAA DAEELACGQA
KFTACRASAL KVLTADNERL DYPAAGLLLF ALGAWALLRK AAAATDAVRL LALAQRFAYN
RMLPTMAWDR ILAAAKEALP GALEQMSAEY ADRRPPDLLE QARAAARRLP EESEGS