Gene Caul_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1241 
Symbol 
ID5898696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1302938 
End bp1304833 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content71% 
IMG OID641561726 
Productsignal transduction histidine kinase 
Protein accessionYP_001682869 
Protein GI167645206 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase
[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0246477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCTG ACGCGGCGCC CGTCTTGATC GGTTATCTCG ACAACCAGCA GCGCTACCGC 
TTCGTCAATC GCGCCTACGA GGAATGGTTC GGCGTCGACC GCGCCAAGAT CGAGGGCCGA
ACCCTGATGG AGGTGTTGGG CGAGGCCGGC TACGCCAAGG TCCGGGACCA CGTGGCCCGC
GCCCTGGCTG GCGAGCGACT GATCTACGAG GGCGAGGCGC CCTACCGCGA CGGCGGCCAA
CGCCATATCG AGGCTCAGTT TGTCCCCGAC ATCCAGGCCG ACGGGTTCGT CGCGGGCTAT
TGCGTGGTGG TCACCGATAT TTCCGACCGC AAGCGCGCCG AGGCCCGGGT CGCGCGCCAG
CTGCAACAGG AACGACGCCG GGCCCTGATC CTCGACCTGG GCCGCCGTCT GCGCGAGGAG
ATCGATCCCG ACGCCGTCGT CGCCAAGGCC TGCGAAGCGC TGGGACTGCA TCTGAACGCG
TCGCAGGTCG GCTACGGCGA ACTGGACTTC GCCGCCGCCC AGATCCACGT CGTCGGCGAA
TGGCGGGGGG CGCCCTCCCC GCCCCTGCTG GGCAGCCGAC ACGTGCTGGA CTCGTTCGGG
CCGGCGATGG CGGACGAGAT GCGGGCCGGC CGCGAAACCG TGGTCAGCGA CGTCGCGGTC
GATCCGCGAA CGGCCGCCGC CGCGGCGACC GGCGCCTACG CGGCGCTGAA CGTGGGCGCC
TATGTCACCT TTCCGCTGAT CAAGGCCGGT CGCCTGGTGT CCTATTTCTA CGTCGCCCAG
GACGCGCCCC GCGCCTGGAC AGAAGAGGAA GTGGCCTTCA TCGGCGAGAT CGCCGAGCTG
ACCTGGGCCG CCGCCGAGCG GGCGCGTTCG GACGCCGCCC TGACCCAGGC CGAAGAGACC
GAACAGCTGC TGATGCGCGA GATCGACCAC CGGGCCAAGA ACGTGTTGGC CGTGGTCCAG
TCCCTGGCGT CCCTGACCCC GTTCGTCGAC AAGGAACAGT ACGTCGCCGC CCTGTCCGGC
CGCATCGGCT CCCTGGCCCG CTCGCACAGC CTGCTGTCAG GCGCCCGCTG GAGCGGGGCG
CGCCTGGATG TGCTGCTGCA GCAGGAGTTG GAACCCTACG GGGCCGAGAA CGACAACCGG
GTCACGATCG CGGGACCGCC GGTGCTGATC CAGGCCGAGG CGGCGCAGTC CCTGGGGCTG
GTGATCCACG AACTGGCCAC CAACGCCGGC AAGTACGGCG CCCTGTCCAC CCCAGCCGGG
GCGCTGGAGA TCGCTTGGCG CTGGGAGGCC GAGCGCCTGG TCCTGACCTG GCGCGAGACG
GGCGGTCCTC GAACGTCGCC GCCCGCGCGC CAGGGCTTCG GGGCCACCCT GATCGCCAAC
GCCGGCCGGC AGGTCGGGGC GACGATCACC CAGGACTGGC GCGCCGAGGG TCTGGTCTGC
GAGATCGCGC TGGGGCGCGG CGCTGCGCCC TATTTCGGCG TCCCGCCCCG ACCCGCGGCC
GGCGACGGGG CGACTCACGA CGGCGGCGCT ACGGATAGCG ACGCGATCCG ATCCTTGGCC
GGCCAGCGGG TGCTGATCGT CGAGGACGAG GCCCTGGTGG CCATGGAACT GGCCCAAATC
CTGGCCGCCG CGGGCGCCCA GGTGATCGGG CCGACCGGCG ACATCGACGA CGCCCTGGCC
CTGGTCGCGG CCGGCGGCGT CGACCGCGCC CTGCTCGACA TCAACCTGGC CGGCCGCACG
GTCACCCCGG TGGCGAGCGC CCTGGCCCAG AAGGCGATCC CGTTCGTCTA CCTGACCGGC
TACCAGGAAG TCGATGTCGA GGACGGTCCG GTGCTGCGCA AACCGGCCAG CGCGGCGGCG
CTGCTGGGCG CGCTGGCGAG CCAGGTGGCG GTTTAA
 
Protein sequence
MVADAAPVLI GYLDNQQRYR FVNRAYEEWF GVDRAKIEGR TLMEVLGEAG YAKVRDHVAR 
ALAGERLIYE GEAPYRDGGQ RHIEAQFVPD IQADGFVAGY CVVVTDISDR KRAEARVARQ
LQQERRRALI LDLGRRLREE IDPDAVVAKA CEALGLHLNA SQVGYGELDF AAAQIHVVGE
WRGAPSPPLL GSRHVLDSFG PAMADEMRAG RETVVSDVAV DPRTAAAAAT GAYAALNVGA
YVTFPLIKAG RLVSYFYVAQ DAPRAWTEEE VAFIGEIAEL TWAAAERARS DAALTQAEET
EQLLMREIDH RAKNVLAVVQ SLASLTPFVD KEQYVAALSG RIGSLARSHS LLSGARWSGA
RLDVLLQQEL EPYGAENDNR VTIAGPPVLI QAEAAQSLGL VIHELATNAG KYGALSTPAG
ALEIAWRWEA ERLVLTWRET GGPRTSPPAR QGFGATLIAN AGRQVGATIT QDWRAEGLVC
EIALGRGAAP YFGVPPRPAA GDGATHDGGA TDSDAIRSLA GQRVLIVEDE ALVAMELAQI
LAAAGAQVIG PTGDIDDALA LVAAGGVDRA LLDINLAGRT VTPVASALAQ KAIPFVYLTG
YQEVDVEDGP VLRKPASAAA LLGALASQVA V