Gene Caci_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1033 
Symbol 
ID8332368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1170399 
End bp1171610 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID644954181 
Producthistidine kinase 
Protein accessionYP_003111800 
Protein GI256390236 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0113351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTA CGTCGTCGGC GCGATCCTCA GGACACAGGC TCGCGAATCC GGGAAGAGTC 
TCGAAGCTGT CCATCCGGGC CCTACTGGCT CTCGTCCAGG GCGCGCTCGC GGTGCTCGCC
TGCGTGCTTC TCCTAGCGAT CACCTATGTC TTGTTCGATC GGCAACTGCC GTCGAACCCG
GTGCTCAAGG CAGACGTCAC CATCGCCGTT CAGGATCCGG TGCCTCGGCC GGATCCCTCA
CTGGGCCGCC AGGCGGGATC GTTCCTGTCG AAGGACGCGG TGCTCGATCA GCTGCTCGTC
CAAGGCGGGC TCGCCCTGGG CGCGGTCGCC GTCGCCGCCA CCGGCCTGGC CTGGCTCACG
GCCGGACGCA TGCTGCGTCC CCTCCACCGG ATCACCGCCA CGGCCGGCCG GATCGCGGGC
GCTCCCGCCG CCGACCGCGG ACTGCACGAA CGCATCGCCC TGAACGGCCC GGCCGACGAG
GTCAAGGAAC TCGCCGACAC CTTCGATCTC ATGCTGTCCC GCCTCGACCA GTCCTTCGAC
AGCCAGCGGC GGTTCATCGG CAACGCCTCG CACGAACTGC GTACTCCGCT CGGCCTCAAC
CAGGCGCTGA TCGAACTCGC TCTTCAACGC ACTGATACGA CGCCTGAGAT GCGCCGCCTC
GGGGAGACGC TCCTCGAGGT CAACTCCCGC CACGAGCGGC TCATCGACGG CCTGCTGGTG
CTCGCCAGAT CGGAGGGCGA GCCGGGGCAG GGCTCCTTCG TGGACCTCGC GGACATCGCC
GAGCACGTCG TGGAGCAGAC TCCCGCAGGT GACGTCGAAG TCACGTGCGC GGTCGAGGAG
GCACCCGCGA TCGGGAATCC GGTCCTGCTG GAGCGACTCG TCCAGAACCT CGTCGAGAAC
GGCATCCGGT ACAACATCCG GCACCGCGGC TGGGTCCGCG TCACCACCGG CACCGCAGCG
GACGGATCAA CCCGGCTCCA GGTGAGCAAC ACCGGCCCGA TCGTTCCCCG GCACGAGATC
CCGACCCTGT TCGAGCCCTT CCGCCGCCTC GGCGGCGAAC GGCCGTCCGG AAGGTCTCAC
GGCACATCAG GCGCGGGTCT GGGCCTGTCG ATCGTCCGCG CCGTCTCCCG CGCGCACGGC
GGCGACGTCC AAGCCGAGCC TCGGGACGGC GGAGGTCTGA TCGTCACGGT GACTCTGCCG
CGAGCGGAGT AG
 
Protein sequence
MSLTSSARSS GHRLANPGRV SKLSIRALLA LVQGALAVLA CVLLLAITYV LFDRQLPSNP 
VLKADVTIAV QDPVPRPDPS LGRQAGSFLS KDAVLDQLLV QGGLALGAVA VAATGLAWLT
AGRMLRPLHR ITATAGRIAG APAADRGLHE RIALNGPADE VKELADTFDL MLSRLDQSFD
SQRRFIGNAS HELRTPLGLN QALIELALQR TDTTPEMRRL GETLLEVNSR HERLIDGLLV
LARSEGEPGQ GSFVDLADIA EHVVEQTPAG DVEVTCAVEE APAIGNPVLL ERLVQNLVEN
GIRYNIRHRG WVRVTTGTAA DGSTRLQVSN TGPIVPRHEI PTLFEPFRRL GGERPSGRSH
GTSGAGLGLS IVRAVSRAHG GDVQAEPRDG GGLIVTVTLP RAE