Gene Caul_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3978 
Symbol 
ID5901440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4308206 
End bp4309939 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content71% 
IMG OID641564499 
Productsignal transduction histidine kinase 
Protein accessionYP_001685601 
Protein GI167647938 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.301159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC ACGACCCCGC ATCGGACGCG CCGCCCCGGG CCGCCGGCAC CGTGCGAAGA 
CGCCTGCTGC TGATGGCCCT GAGCCTGGTG GTCCCGGCCG TGATCTTCAT GACGCTGTTG
GCGCGGGCCG AGTTCGGCGA GAGCCAGGCC CGCTACGAGC GGCAGTTGAT CGCCACCACC
CGCGCCCTGG TGCTGGCGAC CGACCGCCAG ATCGGCCAGG GGCAAAGCGT TCTGCAGGCC
CTGGCGGTCT CGCCCGCCCT GGTTTCCGGC GACATTTCGG CCTTCGAACG CCAGGCGCGC
GCGGCCGTGC AGGGCGGCGA GGGCTGGATC GTATTGCTCG ACAATGAGCG GCAGCTGGTC
AACACCCGGC GACCGGCGGG CGCGCCGCCG CCCAAGGTGG GCCTGCCCGA CTATCGGTGG
CGGACCATCC GCGCCGGCCG CACGTCGGTG TCGAACCTGG TGCTGCCCAA GACGCCCGGC
CAGTTCCCGC CCTTCGTGTC GATCGACATG CCGGTCATCG TCGACGGCAA GCTGTACGAC
CTGGCCTACC AGCAATCGCC CAGGGCCTTC TCGTCGATCT TCGCCGGCCA GAACATCCCG
CGCAGCTGGA CGGCCAGCAT CGTCGACCGC GAGGCCACGC TGGTTTCGCG ATCCAAGGAC
CAGGATCGCT TCCTGGGCCA CAAGGTCAGC CCCAACACCT ATGCGGCCAT GGCCCGCGGC
GCCGAGGGAG TGGTGCTGAG CCGGACCCTG GATGGCACGC CCACGCTCTC GGCCTTCAGC
CGTTCGCCGA CCACCGGCTG GGCGTTCATT GTCGGAGTGC CGCGCGCCGA GCTGAACCGG
GCCAACTGGT CGTCGATTGG GCTGCTGAGC CTGGCCAGCG CGGTGCTGCT GACCTTCGGC
GTGGCGGTGG CGCTGGTGTT CTCGCGCGAC ATCTCGGCGA CGGTGCGCGG CCTGGCGGTC
GACGCCAAGG CGGTGGCGGC CGGCGAGGAA ATCGCCCCCA CCCCCGATCG CCCCGACCAG
TTCATCGAAA TCGCCGAGGT GCGCGCGGCC CTGCACAAGG CCGCCCTCCA GCTGCGGACC
CGCGAGGCCG AGGAACAGCG CGCCCATCAG CGCCAGCAGC TGATGATCAA CGAGCTGAAC
CATCGGGTGA AGAACACCCT GTTCACGGTG CAGTCCCTGG CCCGCCAGAG CCTGGGACGG
CCGGCCGACA CGCCCGGCCT GACGGCCTTC AACGAGCGTC TGATGGCCCT GGCCCGCGCC
CACGACCTGC TGACCCGGAG CGTCTGGGAG GGCGCCGAGC TGAGGGAGAT CCTCGAGGAG
ACGCTCGAGC CGTATCTGGA CCGGACCGTG CTGGCCGGAC CGCTGGCGGC GCTGTCGCCG
AACGCCGCCC TGGCCCTGTC GATGGTGTTC CACGAGCTCG CCACCAACGC CGTCAAATAC
GGCGCCCTGT CGGTTCCCGA CGGCACGGTG ACGGTCGTCT GGCACGTCGA CCCCGGCGCG
GCGCACCGGC TGACCCTGCA CTGGGAGGAA CGGGGCGGAC CCAAGGTGTC GCCGCCCAGC
CGCTCGGGGT TCGGCTCGCG CCTGATCGCC GCCAGCCTCA AGTCCGACCT CAACGGCGAG
GCGCGCATCG ACTACCGGCC CACCGGCCTG GTCTGCGTGC TGACCCTGTC GCTGCCCCAG
ACCGGCAAGG AGCAGGCGGC GGCCGAGACG GCGTCCGGAC CGGTGGAAAG CTAG
 
Protein sequence
MADHDPASDA PPRAAGTVRR RLLLMALSLV VPAVIFMTLL ARAEFGESQA RYERQLIATT 
RALVLATDRQ IGQGQSVLQA LAVSPALVSG DISAFERQAR AAVQGGEGWI VLLDNERQLV
NTRRPAGAPP PKVGLPDYRW RTIRAGRTSV SNLVLPKTPG QFPPFVSIDM PVIVDGKLYD
LAYQQSPRAF SSIFAGQNIP RSWTASIVDR EATLVSRSKD QDRFLGHKVS PNTYAAMARG
AEGVVLSRTL DGTPTLSAFS RSPTTGWAFI VGVPRAELNR ANWSSIGLLS LASAVLLTFG
VAVALVFSRD ISATVRGLAV DAKAVAAGEE IAPTPDRPDQ FIEIAEVRAA LHKAALQLRT
REAEEQRAHQ RQQLMINELN HRVKNTLFTV QSLARQSLGR PADTPGLTAF NERLMALARA
HDLLTRSVWE GAELREILEE TLEPYLDRTV LAGPLAALSP NAALALSMVF HELATNAVKY
GALSVPDGTV TVVWHVDPGA AHRLTLHWEE RGGPKVSPPS RSGFGSRLIA ASLKSDLNGE
ARIDYRPTGL VCVLTLSLPQ TGKEQAAAET ASGPVES