Gene Caul_0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0279 
Symbol 
ID5897553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp308434 
End bp310689 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content69% 
IMG OID641560763 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_001681914 
Protein GI167644251 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0597552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.855639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC TAGAGGCCAT CAAGGTCACC TTCTTTCAGG AATGCGAGGA ACTTCTCGCC 
GATCTTGAAG GCGGGCTCTT GGCCATGCAG GACGGCAACG ACGATGGCGA CACCGTCAAC
GCTGTGTTCC GGGCCGTGCA CTCGATCAAG GGCGGGGCTG GCGCCTTCGG CCTGGAACCG
CTGGTGCGCT TCGCCCACGT GTTCGAGACC CTGCTCGACG CCCTGCGGTC GAACGCGATC
CCTGGCAGCG CCGATCTGAC GGCCCTACTG CTGCGCGCTT CGGACGTGCT GGCCGACCAT
GTCAGCGCCG CCCGCGGCCT GGGCGTGGTC GATATGGACG CCTCGGCCTC AATGGCCGCT
GAACTGAAGG CGGTCACCGA TCCCAACGCC GCGCCCGCTC CGGTCGCCGC CGCGCCCGTG
GTCGATGCCC CGCAGTACGT CAACGCCGCG CCGATCGTCA TCGATGACGG CATGGACGAT
GACGACCTGG GCTTCGACTT CCAGCCGATG ACCATCAGCC TCGACGCGCA GGCCGAAGAC
GCCGCCGTGC TCGACGACAA CGTCTGGACC GTGTCGATCC GTCCGAAATC GGACCTCTAT
CGCAAGGCCA ACGAAACCGG CCTGCTGCTC CGCGAACTGA GCCGGCTGGG TCCGGTCCAG
GCGACGCTGG ACGACAGCGC CCTGCCGGCC CTGGAATTCC TGGATCCCGA AGCCGCCTAC
GTCACCTGGA GCGTCCGCGT CGAGACCGAC CAGGGCGAAG AGGCGATCCG CGAGGTCTTC
GAATTCGTCG ATGGCGACTG CGAACTGGAA ATCACCCGCG GTGAGGCGCC GGTGGCCGAC
ACCCTGGCTG CCCTGCTGGA AACCATCGCG CCGGCTCCCG AGCCCGTCGC CGAAGTCCAT
ATCGAGCCCG AGATCGAAGC GCCGATGATC GAGGCCCCGG TCGACGTCGC GCCGGCTCAG
ATCTCCGCGC CGCCGCCCGC CAAGCCGCCG GTCGCCGCCA ACGCGCCGGC GGCCAAGCCG
GCCCAGATCG ACGTGCCCGG TCCCGGTCAA TCGGTGATCC GTGTCGATCC CGAGCGCATC
GACCATCTGA TCGACCTGGT CGGCGAACTG GTGATCAACC AGGCCATGCT GGCCCAGCGG
GTCGGCGAAT ACGGCATCGC CCCGTCCTCG AACCTGGCCA TGGGTCTGGA TGAGCTGGAA
CAGCTCACCC GGCAGATCCA GGACAGCGTG ATGGCCATCC GCGCCCAGCC GGTGAAGTCG
GTGTTCCAGC GCATGCCGCG CCTGGTCCGC GAAGTCGCCA ACATGACGGG CAAGCAGGCC
CGCCTGGTGA TGGACGGCGA GAACACCGAG GTCGACAAGA CGGTCATCGA GCGTCTGGCC
GACCCGATCA CCCACATGCT GCGGAACGCC ATCGACCACG GGCTGGAAAG CCCCGAGGAG
CGTCGCGCCG CCGGCAAGAA TCCCGAAGGC GTCGTGCGCC TGGCCGCCCT GCACCGCAGC
GGCCGGATCG TCATCGAGGT CCAGGACGAC GGCAAGGGCA TCAACCGCGA GCGCGTGCTG
TCGATCGCCG TCAACAAGGG CCTGATCTCG CCCGAGCAGA CCCTGACCGA CGAGGAGATC
GACAACCTGA TCTTCCTTCC CGGCTTCTCG ACCGCCGACA AGATCTCGGA CGTCTCGGGC
CGCGGCGTCG GCATGGACGT GGTCAAGCGC AGCGTCCAGG CGCTGGGCGG CCGCATCTCG
ATCTCCTCGC GTCCCGGCCT GGGCTCGACC TTCACGCTCA GCCTGCCGCT GACCCTGGCC
GTCCTCGACG GCATGGTGGT CGACGTGGCC GGCGAGACCC TGGTCATTCC GCTGGCCTGC
ATCGTCGAAA GCCTGCGGCC CAAGGCCGAG GAAGTCCGCC CGCTGGGTCC GACCGGTTCG
GTCCTGGCCG TGCGCGACAG CTTCGTGCCG CTGATCGACG TCGGCCTGAC CCTGAACTAT
CGCACCACTT CGCCGCCGGC CACCGAGGGC GTGGTCCTGC TGGTCGAGGG CGAGGACGGC
TCGCGCGCCG CCCTTGTGGC CGACGCCATC CACGGCCAGC GCCAGGTGGT CATCAAGTCG
CTGGAGCAGA ACTACCAACA GGTCGAGGGC GTCGCCGCCG CGACGATCCT GGGCGACGGC
CGCGTGGCCC TGATCCTCGA CGTCGACGCG GTGATCAACC TTCGCCGCCG CGGTCCGCCG
CTCCCCGCCG ACCCCACCCT CATCGCCGCG GAATAG
 
Protein sequence
MDELEAIKVT FFQECEELLA DLEGGLLAMQ DGNDDGDTVN AVFRAVHSIK GGAGAFGLEP 
LVRFAHVFET LLDALRSNAI PGSADLTALL LRASDVLADH VSAARGLGVV DMDASASMAA
ELKAVTDPNA APAPVAAAPV VDAPQYVNAA PIVIDDGMDD DDLGFDFQPM TISLDAQAED
AAVLDDNVWT VSIRPKSDLY RKANETGLLL RELSRLGPVQ ATLDDSALPA LEFLDPEAAY
VTWSVRVETD QGEEAIREVF EFVDGDCELE ITRGEAPVAD TLAALLETIA PAPEPVAEVH
IEPEIEAPMI EAPVDVAPAQ ISAPPPAKPP VAANAPAAKP AQIDVPGPGQ SVIRVDPERI
DHLIDLVGEL VINQAMLAQR VGEYGIAPSS NLAMGLDELE QLTRQIQDSV MAIRAQPVKS
VFQRMPRLVR EVANMTGKQA RLVMDGENTE VDKTVIERLA DPITHMLRNA IDHGLESPEE
RRAAGKNPEG VVRLAALHRS GRIVIEVQDD GKGINRERVL SIAVNKGLIS PEQTLTDEEI
DNLIFLPGFS TADKISDVSG RGVGMDVVKR SVQALGGRIS ISSRPGLGST FTLSLPLTLA
VLDGMVVDVA GETLVIPLAC IVESLRPKAE EVRPLGPTGS VLAVRDSFVP LIDVGLTLNY
RTTSPPATEG VVLLVEGEDG SRAALVADAI HGQRQVVIKS LEQNYQQVEG VAAATILGDG
RVALILDVDA VINLRRRGPP LPADPTLIAA E