Gene Caci_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3601 
Symbol 
ID8334954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4020795 
End bp4021871 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID644956743 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003114346 
Protein GI256392782 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0229751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG GAGAGATCGC GCGTCGTGCG GGGGTGGCGC GCAGCACCGT GTCGTATGCG 
CTCAGTGGGA AACGGCCGGT CTCGGTGGAG ACGCGGCGGC GGATTCAGCA GGTCATTGAC
GAGCTGGACT ATCGCCCCAA TGCTTCGGCG CGCGCGTTGA AGGAGGGGCG GACTCGCACG
GTCGGGCTGG TGATCCCGCC GGCCGGGCCG CGGTTGACGT CCATGCAGTT GGATTTCGTC
GGCAGCGTCG TGGAGGCTGC GGCGCGGGTC GATCTTGATG TGCTGCTCTC GCCGTCCGGT
GGCGACCGTG ACCGCTCGCT CGAGCGTCTG ATCAGCGGTC GGCGGGTGGA TGGCGTGATT
TTGATGGAGA TCCTCATGGA GGATTCCCGG GTGGCCAGGG TCGCGCAGAG CGGGGTGCCG
TTCGTGACGA TCGGACGCGT CCGCGACCCT GACGCGACGT GGTGGGTGGA CATGGATTAC
GCGGCGCTGG TCGGCCGCTG CGTCGATCAT CTCGCCGACC TCGGTCACCG GCATGTCGCG
CTCGTCAACC GCTCGCCCGC GCTGATGGCC GCCGGCTACA GTCCCGGCCA TCGCGCCCGC
GACGGGTTCG CCGAGGCGGT CGCTCGGCGC GGAGTCAGCG GATCAGAGTT CTGCTGTGAG
GACGACGTCG CCGGTGGCGA GCGCTGCGTG GCGGAGATAC TGGCGACGCG CCCCAGCACG
ACCGCGATCG TCACTGTGAA TGAGGCGGCG CTGCCGGGTG TGCAGCGCGC GCTGGAGCGC
GCGGGCCATG AGATCCCGAG GACGTTCTCC GTCGCCGGGA TCGCCGCACA TCAGTGGGCT
GAGGAGTTCC ATCCGCCGCT CACGGCTGCC GACGTCCCGT CGCTGGCGAT GGGTACCGTC
GCGGTGGAAC TGCTCGCCGA GCACATCGCC GATCCGGACG CGCCCGCCGG GCATCGCCTG
TTCATGCCGC CGATCTCGTT GCGGGACAGC ACCGGATCGG TGCCGCGGCC TTCGGCGCGG
CCGGCGTTTT TCAGTCCGGC CGCGCCGAGG CGGTTCAGGG AAGCAGACGC GGATTGA
 
Protein sequence
MDIGEIARRA GVARSTVSYA LSGKRPVSVE TRRRIQQVID ELDYRPNASA RALKEGRTRT 
VGLVIPPAGP RLTSMQLDFV GSVVEAAARV DLDVLLSPSG GDRDRSLERL ISGRRVDGVI
LMEILMEDSR VARVAQSGVP FVTIGRVRDP DATWWVDMDY AALVGRCVDH LADLGHRHVA
LVNRSPALMA AGYSPGHRAR DGFAEAVARR GVSGSEFCCE DDVAGGERCV AEILATRPST
TAIVTVNEAA LPGVQRALER AGHEIPRTFS VAGIAAHQWA EEFHPPLTAA DVPSLAMGTV
AVELLAEHIA DPDAPAGHRL FMPPISLRDS TGSVPRPSAR PAFFSPAAPR RFREADAD