Gene Caci_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0228 
Symbol 
ID8331554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp252889 
End bp255201 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content68% 
IMG OID644953394 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003111022 
Protein GI256389458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00256597 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.277855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCCG AATGGGACAG GTACGCCCAA CGTCTCAAAG CCTTGCGGAG CGAACAGGGG 
CTGAGCCAGC GGCGTCTGGC GCGGGCTCTA CATGTGGACC ATAGCGTGAT CTCGCGTGCG
GAGAGTGCCG TCCGTCGTCC CGATCCCGAC CTCGCGGCCT CGATCGACCT GCTGCTGAGC
ACGGGTGGCG AGTTGCAGCG GTTGGCTCGG GACGCGGTGC GAAAGCAGGC GTCGGCCGAC
TCACTATCGA CGCGTGGGGG TCCTTCTGGT GCGGTCCCGC GGTTGCGATC GTCTATACCG
CCCGACTCCG TGCACTTCAC TGGTAGGAAC GCGGAGTTTG ACCAGGTGCG ATCAGAGCTC
GTCGAACGGC CCGACGCGGT CGGCGCCGCG AGCGTCTGCG TGATCAGCGC GCTGGCCGGT
GTGGGCAAGA CAGCGCTCGC CATCCACAGC GCGCACCGCC TCGCCGGCCG TTTTCCGGAC
GGATGTCTGT TCCTCGATCT GCGTGGCTTC ACTCCCGGAC ACGCGCCGCT GAGCAGCTTC
GAGGCGCTGG GAGCTCTGCT CGCCTTGCTG GATGTCCCGG TCGCCACCAT CCACAGCACC
GAGCCGGCGC GCTCGGCGCA GTTCCAGGCG GAGACCGCCG GCGCGCGGCT GCTGCTGGTG
CTAGACAACG CCGCTGACGC ACATCAGGTA CGACTGCTCT TGCCATCGGC GCCTGGCTGC
CGGGTATTGG TGACGAGCCG GAACCGCCTT GTGGCCCTTG ACGAGGCCGT GCATCTCGAC
CTGCGGCCCC TTGCCGAGGT TGATGCCGCC GCGCTGTTCC GCATGGTGTC GGCCGGCGGT
CGTGTCTCGC AGAGCACTGT CGACGGTATC GTCGCGCGCT GTGCCGGCGT CCCATTGGCG
TTGCGGATCG CGGCGGCACG CTGCGCTCCC GGCGGCGCGT TCGATCCGGA ACTGCTCGCC
GCCGAGCTGT CGCGCGCGGA GGACTTCATC GGGCAGCTGG ACGACGGGGA ACGCAGCGTG
CGTGCGGTGT TCGATGCCTC GTTCTCCTTG CTGCCCCGAG AGCTGCGGCA GGTGCTTGCT
CTGCTCGGCA CCCGTCTGCT GACCGTCTTC GACATTTGGG ACGTCGCAAT GCTCGCCGAC
CTCCCGCTCC TCGGCGCGGC TGGCGCGTTG GAACGGCTGC ACGCGGCGAG TCTGCTTGAA
GTGCGCGGCA CTGATCGTTT CGTCCTGCAC GATCTGGTCG CCGAGTACGC GATATCCGCC
GCCGGACAGA CGCTCGGCGC CGAGCAGCTG CGGGCCGCAA TCGGACGCTT GCTCGACGCC
TATCTGCGCG TCTGTGACAG GGCCGACTCA CAGGTCACGC CGCATCGGCA CCGCTTCCCG
CTCGCGGTGG CGGCAGCCGG AGCTGTGCCA GACCCGGACT TCAGTGACTA CTACGAGGCC
CTCGACTGGC AGACCAAGCA TCTCGACACG GCCGCGGCGC TGTGCGAAAC CGCCTACGAG
CACGGTTTCG ACGAACAGTG CTGGCAACTC GCCTACGCGC TGCGCGGGGT CTTCTTCCTC
ACCAAGCACT GGGAGCAGTG GGATCGTATC CAGCGCATCG CGCTGGCGGC GACAAGACGG
CTCCAAGACC CGCACGCCGA GTGCGTGACT CTGAACAACC TCGGCCTCAT CCTCGGTGAA
CGGGGCGAGA CCGACGCCGC GCACCGCTGC CTCAACGAGG CCGAAAGCGT GTGCCGCGCC
GCCCGTGATC GCTTCGGCGA GAACACCGCA CGCGCCCACA GGGCCTGGCT CTTCCACACG
GCCGGACGGC ATGGCGAGTC CCTGACCGAG CACGAGGGCG TCCTCGAGTT CTACACGCAG
ATCGACAGCC CTCGCAATGT GGCCATCGTG ATGCGCGACA TGGCCGCATC CGAAGCCGCT
CTAGGCCACA CCGACAGTGC CCTCGCGCAC TTGCACGCGG CCGAGCAGGC CTTTCGCAAA
TTGGATCTGA CGATGGACCT GGCCATGGCC CTGAACGACC TCGGCGAGAC CTACACAGCG
ATCATCGACC CGAAACGCGC CGCCGAGTAC TTCGAAGCAG CGCTGGCAGC CTGCGCGGAG
TCCGGCAGCG ATCATGAACG GGCTCGAGCG CACGCCGGCC TCGGCGCCCT GGCGGAGTCC
GCAGGCCGGC AGAACTGGGC GAACGCACAC TACCTTCAGG CCCTGAGGCT GTTCGATGCG
CTCGGCGCTC CGGCGGCGGA CGTGGTGCGC GCTCGGCTGC ACGCGTCGAC TGTCATGTTC
CCGACCACCG CTGAGGAGGA CGAGAACCGC TGA
 
Protein sequence
MRPEWDRYAQ RLKALRSEQG LSQRRLARAL HVDHSVISRA ESAVRRPDPD LAASIDLLLS 
TGGELQRLAR DAVRKQASAD SLSTRGGPSG AVPRLRSSIP PDSVHFTGRN AEFDQVRSEL
VERPDAVGAA SVCVISALAG VGKTALAIHS AHRLAGRFPD GCLFLDLRGF TPGHAPLSSF
EALGALLALL DVPVATIHST EPARSAQFQA ETAGARLLLV LDNAADAHQV RLLLPSAPGC
RVLVTSRNRL VALDEAVHLD LRPLAEVDAA ALFRMVSAGG RVSQSTVDGI VARCAGVPLA
LRIAAARCAP GGAFDPELLA AELSRAEDFI GQLDDGERSV RAVFDASFSL LPRELRQVLA
LLGTRLLTVF DIWDVAMLAD LPLLGAAGAL ERLHAASLLE VRGTDRFVLH DLVAEYAISA
AGQTLGAEQL RAAIGRLLDA YLRVCDRADS QVTPHRHRFP LAVAAAGAVP DPDFSDYYEA
LDWQTKHLDT AAALCETAYE HGFDEQCWQL AYALRGVFFL TKHWEQWDRI QRIALAATRR
LQDPHAECVT LNNLGLILGE RGETDAAHRC LNEAESVCRA ARDRFGENTA RAHRAWLFHT
AGRHGESLTE HEGVLEFYTQ IDSPRNVAIV MRDMAASEAA LGHTDSALAH LHAAEQAFRK
LDLTMDLAMA LNDLGETYTA IIDPKRAAEY FEAALAACAE SGSDHERARA HAGLGALAES
AGRQNWANAH YLQALRLFDA LGAPAADVVR ARLHASTVMF PTTAEEDENR