Gene Caul_4471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4471 
Symbol 
ID5901932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4845369 
End bp4847252 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content70% 
IMG OID641564990 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001686089 
Protein GI167648426 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.611462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG ACAGCCGCAG CTCTGGAGCC GCGACTCACG CGCCGTTGCA GGCGCGGATA 
GCGGCTGTCG CGTTGACCAC CACGGTGGCC GCGCTGCTGG CGGCGTGCCT GTGCTTCATG
CTGCAGCAAT GGAGCGTCGC CCGTCAGGAA TCCCAGACCA GCCACGAAAT CCTGAGCCAG
ATGGTCGCCG CGGGCGCCGC CGCGCCGATG GCCGATCACA ACACCGTCGG CGCCTATCGC
GCCCTCGAGG CGGTGGCCAA GGCCCCCAAC GTCTCCGCCG TGCGCCTGAC GGACATGCAT
GGCCAGGTCG TCGCCCAGAT CGGCGCCGAG GGCGACGCCG CCGTCGACAC CCTGATCCGC
CCGATCAAGC TGGGGTCTCG CCAGGTCGGA ACCCTGGCCC TGATGGCCAA GCGGCCCAGC
CTGACCGACA TCATCGCCCG CGACCTGGCC CTGACCGGCA CCCTGTTCTT CGGCTCGGCC
GGCCTGGCCA TCCTGCTGGC CAACGGCCTG GCCCGGCGCA TGACCAAGCC GATGGAGCGC
CTGTCCCGGG CCATGCGCGA AGTCGCCGCC GACGGCCGCT TCAAGCCGGT GGAGCAGGCG
GCCGACGACG ACGTCTTCCA GAGCCTGACC GACAGTTTCA ATCACCTGGT CTCGCGCCTG
GAGATCAATG ACCACGACCT GCGCGGGGCG ATGGCCGAGC TGGTCAAGGC CCGCGACGAC
GCCAACGCCG CCAATGTCCT GAAGTCGCAC TTCCTGGCCA ATATGAGCCA CGAGATCCGC
ACGCCGCTGA ACGGCGTGCT GGCCATGACC GAGGTGATGG CCATGGGCGA GCTCAGCGCC
GCCCAGCGCG AACGCCTGTC GGTGATCCGC GAATCGGGCG ACTTGCTGCT GTCGGTGCTC
AACGACGTGC TCGACCTGTC GAAGATCGAA GCCGGCCGGC TGGACCTGGT CGAACGCGAC
TTCGACCTGG CCAGCCTGGC CCTGTCGATC CGCGAATCCT ACGCCACCCA GGCCCGCCAG
AAGAACCTGG AGTTTGGCGT CTTCGTCGCG CCAGAGGCCC TGGGTCCGTG GCGCGGCGAC
GCCGACCGCC TGCGCCAGAT CCTCGGCAAC CTGGTCTCAA ACGCCCTGAA GTTCACGCTG
GAGGGCTCGG TCTCGGTGCG GTTCGCCTCG GCCGACGACG GCAGCGGCCT GCGGATCGAC
GTGGTCGACA CCGGCATCGG CATCGCCGCC GAAAGCCTGC CGCGCCTGTT CGACAAGTTC
GTCCAGGCCG ACAGCTCGAC CACCCGCCGG TTCGGCGGCT CGGGCCTGGG CCTGTCGATC
TGCAACGAAT TGGCGGCCCT GATGGGCGGC GGCGTCCATG TCCAGAGCCG CGAGGGCCAG
GGCTCGACCT TCACCGTCGT GGTCGCCATG CCGCGCGGCG AGGCCACCGT CCACGTCCCG
GTCGAGGCGG TCCCGCCGCC GATCGAATCG GAGCGCCGGC TGCGCGTGCT GGCCGCCGAC
GACAATCCGA CCAACCAGAA GGTCATCGCC GCCGTCCTGG CCCCGCTGGG CGCCGAGGTC
GAACTGGTCG CCGACGGCGC GGCCTGCGTC GAGGCCTGGA AGCGCGGCGG CTTCGACATC
GTGCTGATGG ACATCCACAT GCCGGTGATG GACGGGGTCG AGGCCGCCCG CACCATCCGT
TCGCTCGAGG TCAGCGAGGG CCGCAAGCGC ATCCCGATCG TGGCGGTCAC CGCCAACGCC
CTGGTCCATC AGGTCGAGGG CTACATGGCC GCCGGCATGG ATGGCCATGT CGCCAAGCCG
ATCGAGGTGA CCAAACTCTA CGACGCGATC GAGACCGCCG TGGCGATCGC CCGCCGGGAC
GGCTCGAAGG TCGCGGCGGC TTAG
 
Protein sequence
MIDDSRSSGA ATHAPLQARI AAVALTTTVA ALLAACLCFM LQQWSVARQE SQTSHEILSQ 
MVAAGAAAPM ADHNTVGAYR ALEAVAKAPN VSAVRLTDMH GQVVAQIGAE GDAAVDTLIR
PIKLGSRQVG TLALMAKRPS LTDIIARDLA LTGTLFFGSA GLAILLANGL ARRMTKPMER
LSRAMREVAA DGRFKPVEQA ADDDVFQSLT DSFNHLVSRL EINDHDLRGA MAELVKARDD
ANAANVLKSH FLANMSHEIR TPLNGVLAMT EVMAMGELSA AQRERLSVIR ESGDLLLSVL
NDVLDLSKIE AGRLDLVERD FDLASLALSI RESYATQARQ KNLEFGVFVA PEALGPWRGD
ADRLRQILGN LVSNALKFTL EGSVSVRFAS ADDGSGLRID VVDTGIGIAA ESLPRLFDKF
VQADSSTTRR FGGSGLGLSI CNELAALMGG GVHVQSREGQ GSTFTVVVAM PRGEATVHVP
VEAVPPPIES ERRLRVLAAD DNPTNQKVIA AVLAPLGAEV ELVADGAACV EAWKRGGFDI
VLMDIHMPVM DGVEAARTIR SLEVSEGRKR IPIVAVTANA LVHQVEGYMA AGMDGHVAKP
IEVTKLYDAI ETAVAIARRD GSKVAAA