Gene Caci_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0204 
Symbol 
ID8331530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp220680 
End bp222020 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID644953372 
Producthistidine kinase 
Protein accessionYP_003111000 
Protein GI256389436 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGCG GGCCCAAGAT CGGCGAAGCT TTTCCCATGA AGCGTTTCCT GTTGAGCCCG 
GTGTCGGGGC GCACCGCACG CCAGTACCTG TTCGCCCTGG TCAGCGCGCC GCTCGGACTC
GCGGGCTTCG TGTACGCGGT GGTCACGCTG GCCGTCGGCG GCGCGCTGTC GTTCACCTTC
GTCGGGCTGC CGCTGATCGC CCTCGCGATC CACCTGGCCC GGCAGTACGC CAAACTCCAG
CGCCGTATGG CCCGCGGCCT GCTCGGCGTC GACATCGAGG CGCCGCCGCC GAACCTGCGC
CGGCAGAACG CGCACGGGCT GCTGGCCAAG CTCGGGGCGG CGCTGGGCGA CCTGCCGGCG
TGGCGGTCCC AGCTGTATCT GCTGGTGCGC TTCCCGCTGG GTATCGCCTA CTTCGTCACG
CTCGGGGTCG GCGTGGTCGA GGGCCTGGTC ATGCTGACCT ATCCGATCTG GTGGGCGGTG
TTCCGCCCGA CGAACACCGA CAGCCACGGG GTCAAGCACC AGTCGGCGCT CCAGTTCGGC
GACGGCTTCT ACTTCGACAA CTGGCTGCGC GCACTGCTGC TCACGGTGGC CGGCGTGGTC
TGGATCTACG CCGCGGTCTG GATCCTCAAG TTCCTGCTCT GGCTGGACTC GCTGCTGATG
AAGGCCCTGC TGGGCCCGAC CAACAGCGAG CGCCGGGTCG AGGAGCTGAC CGTGAGCCGC
GCGCACGCCG TGGACGACTC CGCCGCGCGG CTGCGCCGGA TCGAACGCGA TCTGCACGAC
GGCGCGCAGG CGCAGCTGGT CGCACTGGCG ATGCAGCTCG GCGAGGCCAA GGAGAACCTG
GACGCCGGCG GCAACGGCGC CGAACTGGAC CTGACCGAGA CCCGAACGCT GATCGACACC
GCGCACCGCA ACGCCAAGCA GGCCATCAAC GAACTGCGCG ACCTGGCCCG CGGCATCCAC
CCGGCGGCAC TGGACACCGG CCTGCGCGAC GCCCTGGGCA CACTCGCGGC GCGCTCAGCG
ATGCCGGTGA CGGTGAACGT GGCGCTGGCC GAGCGCCCGG ACCGAGCGAT CGAGACCATC
GCCTACTTCT GCGCCGCCGA GCTGCTGACC AACGCGGCCA AGCACGCCGC CCCGACCCGA
GCCGCGCTGT CAGTGGTGCA GGAGGAGGGG CAACTGCACC TCACCGTCGA GGACGACGGC
CGCGGCGGCG CACAGGTCGG CTACGGCGGC GGCCTGGCCG GCCTCCTGGA CCGCGTGCGC
ACCGTGGAGG GCTCGCTGGC GGTGGACAGC CCGCAGGGCG GACCGACGCG GGTGTTCGTG
AAGCTGCCGA TGCACGTGTG A
 
Protein sequence
MGGGPKIGEA FPMKRFLLSP VSGRTARQYL FALVSAPLGL AGFVYAVVTL AVGGALSFTF 
VGLPLIALAI HLARQYAKLQ RRMARGLLGV DIEAPPPNLR RQNAHGLLAK LGAALGDLPA
WRSQLYLLVR FPLGIAYFVT LGVGVVEGLV MLTYPIWWAV FRPTNTDSHG VKHQSALQFG
DGFYFDNWLR ALLLTVAGVV WIYAAVWILK FLLWLDSLLM KALLGPTNSE RRVEELTVSR
AHAVDDSAAR LRRIERDLHD GAQAQLVALA MQLGEAKENL DAGGNGAELD LTETRTLIDT
AHRNAKQAIN ELRDLARGIH PAALDTGLRD ALGTLAARSA MPVTVNVALA ERPDRAIETI
AYFCAAELLT NAAKHAAPTR AALSVVQEEG QLHLTVEDDG RGGAQVGYGG GLAGLLDRVR
TVEGSLAVDS PQGGPTRVFV KLPMHV