Gene Caul_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3743 
Symbol 
ID5901205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4057280 
End bp4059454 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content73% 
IMG OID641564266 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001685368 
Protein GI167647705 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.3347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCAGA CCATGGGGTT GAAATTCGAT GCGGCGAGCC TGAGACGGGG CGGGGACCCG 
GCGCGACGGC GCCTGGCGAT CCTGGTGCTC GCCTATGTGG CGTGCCTGTC CTACAGCCTG
TTCCTGGCGG GCGCGACCCA CCATATCCCG ACGATCTGGA CCGCCAGCGC CGTGGTGGTC
GCCGGGCTGA TGATCCTGCC GCGCCGGCTT GGCGCGGCCC TGCTCAGCCT GACCGCCCTG
CTGCATGTGG TGATCGAGCT GGGCGTCGGC GATCCGCCGA GGTTCGTCTT CGTGGTCACC
GTGCTGGACA CCGTGCAGGA GGCCGCGACC GCCGCCCTGC TGCGGCTGTT GCGCATGCCG
ACCCGAGTGC GCGACATGCG CGGCCTCCTG GCCCTGACCG CCGTCTCGAC AACGCTGACG
GCGATCGCCT CGATCTGCGT CAACGGCCTG CTGGCCCTCA GCGGCGGCCG GGCCTTCTGG
GCCGGCTGGA CCGATTGGAC CACGTCCAAC GTGCTGGGCG CGGCGATCAC CCTGCCCACC
ATCCTGATCC TGTTCGACCG CCGTCACCTG CAGGGCTTCC GGATCCCGGC GGGCGAGGCG
GCGCTGGGCG TGGTCTTCAT CCTGGCCGCC ACGATCGCCG TGTTCTCCGC CGACGCCTCG
CTGCAGGTGC TGCTGTTCGC CCCCGCCCTG CTGGCGGTGT TCCGGGGCGG CCCCCGGGCC
GCGGCCATCG TCGTCACCCT CTCATTGGCC GCGACCATTC CGGCGGTGCT GCACCGCGTG
GGGCTGGACC CCAAGGTCGC CGCGCCGCCG CTGCGCCACG CCCTGATCTT CCACCTGGTG
CTGTACGCCG TCTGCCTGAC CGCCGCCCTG GCCCTGAGCC GCCAGGCGCG GCTGCAGGCG
CTGCTGGTCC GCCGCCAGGC CATCGCCCGC GCCGCCCAGG CCAAGGCCCA GGCCGCCAAC
CAGGCCAAGT CCGACTTCCT GGCCACGATG AGCCACGAGA TCCGCACGCC CCTGAACAGC
ATCCTCGGCT TCGCCGCCCT GGTCGGCGAG GATCCTGGCC TAGCGCCCGA GAACCGCCGG
CGCCTTGGCC TGGTGGTCAG CGCCGGCCGG TCCCTGGCCG AGATCGTCAA TGACCTGCTG
GACTTCGCCA AGGTCGAGGC CGGCCGCATG GACCTGGCCC TGGAGCCCGT TTCGCCCGCC
GGCCTGCTGC GCGACGCGGT CGCCATCATC GCCCCGGCCG CCGAGGCCAA GGGCCTGCCG
CTCTCGGTGA TCATCGAAAC CACGGGCCAA GGCGACGAGA CCGCCCTGCT GGCGCTCGAC
GAGACCCGCC TGCGCCAGGT GCTGCTCAAC CTGCTGGCCA ATGCGTTGAA ATTCACCGCC
CAGGGCCAGG TGACCGCCCG ACTGACGATC GGCCCCGCGC CGGGCGATCT AAGGTTCGAG
GTCACCGACA CCGGCATCGG CATCGCGCCG GCCGTCCAGG CCCAGCTGTT CCGCCGCTTC
AGCCAGGCCG ACAGCTCGAT CAGCCGCCGC TATGGCGGGG CGGGCCTGGG CCTGGCGATC
AGCAAGGCCT TGGTGACCCA GATGGGCGGC GCGATCGGCG TCGACAGCGC CCCCGGCGAC
GGCTCGCGGT TCTGGATCGA CCTGACGGCC GAGGTCGTCG CGGCCGACAC GGCGCCAGCC
GTCGCGCCGA CCACGCTGGA CGCCACCCGC TCGCCCCGCG TGCTGCTGGT CGACGACCAT
CCGATGAACC GCGAACTGGG CCACGCCCTG CTGACCCTGG CCGGCTGCGA GGTCTTGACC
GCCGACGACG GCGCCCAGGC GGTGGCGGCC GCGCGCCTGG GCGGCTTCGA CCTGATCCTG
ATGGACGTGC ACATGCCCGG CATGGACGGC CTGGCCGCCG CCCGCGCCAT CCGCGCCCTG
CCCGGCCCGG AAGCCGCCGT GCCGATCATC GCCCTCAGCG CCGACGTCCT GCCCGACCAG
ATCGCCCGCT GCCGTGCGGC CGGGATGGAT GACCACGTCG CCAAGCCGAT CCGGCGGGAG
GAGCTGGTGG CCGCGGTGGC GAGGGCGTTG GGGGCGACCA GCGACCTGCC CTCCCGGACC
GGAGCCGCCG GCTGGGACGG TCCGCTCACG CCCCCCCTTG CCCCCCGCCC AAACCCGCGT
CACTCTCTGC CTTAA
 
Protein sequence
MVQTMGLKFD AASLRRGGDP ARRRLAILVL AYVACLSYSL FLAGATHHIP TIWTASAVVV 
AGLMILPRRL GAALLSLTAL LHVVIELGVG DPPRFVFVVT VLDTVQEAAT AALLRLLRMP
TRVRDMRGLL ALTAVSTTLT AIASICVNGL LALSGGRAFW AGWTDWTTSN VLGAAITLPT
ILILFDRRHL QGFRIPAGEA ALGVVFILAA TIAVFSADAS LQVLLFAPAL LAVFRGGPRA
AAIVVTLSLA ATIPAVLHRV GLDPKVAAPP LRHALIFHLV LYAVCLTAAL ALSRQARLQA
LLVRRQAIAR AAQAKAQAAN QAKSDFLATM SHEIRTPLNS ILGFAALVGE DPGLAPENRR
RLGLVVSAGR SLAEIVNDLL DFAKVEAGRM DLALEPVSPA GLLRDAVAII APAAEAKGLP
LSVIIETTGQ GDETALLALD ETRLRQVLLN LLANALKFTA QGQVTARLTI GPAPGDLRFE
VTDTGIGIAP AVQAQLFRRF SQADSSISRR YGGAGLGLAI SKALVTQMGG AIGVDSAPGD
GSRFWIDLTA EVVAADTAPA VAPTTLDATR SPRVLLVDDH PMNRELGHAL LTLAGCEVLT
ADDGAQAVAA ARLGGFDLIL MDVHMPGMDG LAAARAIRAL PGPEAAVPII ALSADVLPDQ
IARCRAAGMD DHVAKPIRRE ELVAAVARAL GATSDLPSRT GAAGWDGPLT PPLAPRPNPR
HSLP