Gene Caul_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0079 
Symbol 
ID5897791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp94113 
End bp96032 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content72% 
IMG OID641560562 
Productsignal transduction histidine kinase 
Protein accessionYP_001681715 
Protein GI167644052 
COG category[T] Signal transduction mechanisms 
COG ID[COG0784] FOG: CheY-like receiver
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.464916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATG CGGGCGGAAC GCTGGCAGAT GATCACGAGG CGCGCCGGCT GCGCGCCCTC 
GACGCCCTTC GCGCCCTCGA CGGCGACCCG TGCGACCCCC GATTCGATCG GATCGTCCGC
TTGGCCTCGC GCCTGTTCAA CGCGCCGCGC GCCGCGATCC GGCTGATCGG CAAGGACCGG
GTCTGGTTGA AGGCCAGGGT GGGCTTTGAC CATGTCGAAG AAGCTCGCCC CCCCGGCCTG
AGCGAGCGCC TGCGCGAAAC CGGCGTGGTC TCGCATCCCG ATCTCGCCCA CGCCGCCTCG
GACCAAACCC TACGGCCCTG GTGCGCCGAC AGCCGCTTCT TCGCCTGCGC GCCGCTCAAG
AGCGCGGCGG GGGATATCGT CGGCCTGCTG ACCGTCGAGG ACCCCCGGCC CCGCGATGCG
GTCGATGCGG GCCTGACCGA GGCCCTGGCC GACCTCGCGG CCCTGGCGAT GGAAGAACTG
CTGCATGACG CCGAGACGGC CCGCAACGCC GCCGAGCGCG CGCTCGACAG CGAGCGCATC
GCCCTGGCCC TGCGGGCGGC CAATCTGGGC GAGTTCGTCT GGGACATCGT CGCCGACACG
GTGCGGGTCA GCCCGCGGAT GTCGCGGATC ACCGAGATTC CGGAAGGGGT GGCCCCGGCG
GACGGCGGCA AGGCCCTCTA CGCCTTCATC CACCCTGACG ATCGCGAGGC CACCCGCGCC
GAGATCGAAG CCCAGCTGAA GGCCCAGGGC CGCTACGAGG TCGAGTTCCG GCGCGTGACC
TCGGATCCGG ACCGGGTGAT CTGGAACCGC GTGGCCGCCC TGATGGTGCT CGACGCGGCC
GACCAGCCCG TGCGGCTGAT CGGCGTGGTG CAGGACGTCA CCGCGCGCCG CGACGCCGAC
GATCAGCGCG AGAACCTGCT CACCGAGCTG GATCACCGGA TCAAGAACAT CCTGGCCGCC
GTGCTGTCGG TGGCCGGCCA GTCGGCCCGC AAGGCCTCGT CGCTGGATGG CTTCTTAAAG
GCCTTCACCG GCCGGCTGAA ATCGATGAGC TCGGCCCATG ACCTGCTCAG CGCCGCGCGC
TGGCGCGGGG CCACCCTGGC GCGGATCGCC GCCGCCGAGC TGGGCGGTCT GGCCCCCAAC
CAGACCCGCT GGGACGGGCC GGAGCTGTTC CTGACGCCCC GCGCGGCGGC CGCCCTGTCG
CTGACCCTGC ACGAACTGGC CGTCAACGCC GTGAAGTTCG GGGCCCTGTC CTCGGAGAGC
GGCCGGGTCG AGGTCGTCTG GCGCGGCTCG CCCGAAGGCG GCTTCAACCT CGAATGGCTG
GAGACCGGTG GACCCATGAC CTCGCCGCCA GCCACCCGCG GCTTCGGCAT GACCCTGATC
GAGGACGTGG TCGGTCGCGA ACTGGGGGGG CGGGCCAAGA TCGAATACAA GCGCAGCGGC
GTCACGGCGA TGATCCACGC CGCCGCCGAC GCCCTGGTCG AGACGCCCGA ACCCGAGCCG
GCCGCGCCCC CGAACGAACG CATCGTCGAG ACCGTGGGCG GCGGCGACGA CAGCTTCCGG
GCCGGCGACA TCGCGGGCCT GCGCGTGCTG ATCGTCGAGG ATTCGCTGCT GCTGGCCATG
GAGTTGGAGG CGGGGCTGGA GGATTCCGGC GTCGAGGTGG TGGGGTGCGC CGCCGAACTG
TCCGAGGCCC TGCAGATGCT GGAGCTGTCG TTCGACGCCG CCGTGCTCGA CGCGGACCTC
AACGGCCAGT CGGTGGCGCC GGTCGCCGAG ATCCTACGTC GCGAGGGCCG GCCCTTCGTG
TTCGCCACCG GCTACGCCGA CAAGGCCGCC CCGATGGGGT TCGACGCCCC GATCGTCCGC
AAGCCCTACA ACGTCCACCA GATCGCCCGG GCGCTGGCGT CGGTGACGGG GCGCGGCTGA
 
Protein sequence
MDDAGGTLAD DHEARRLRAL DALRALDGDP CDPRFDRIVR LASRLFNAPR AAIRLIGKDR 
VWLKARVGFD HVEEARPPGL SERLRETGVV SHPDLAHAAS DQTLRPWCAD SRFFACAPLK
SAAGDIVGLL TVEDPRPRDA VDAGLTEALA DLAALAMEEL LHDAETARNA AERALDSERI
ALALRAANLG EFVWDIVADT VRVSPRMSRI TEIPEGVAPA DGGKALYAFI HPDDREATRA
EIEAQLKAQG RYEVEFRRVT SDPDRVIWNR VAALMVLDAA DQPVRLIGVV QDVTARRDAD
DQRENLLTEL DHRIKNILAA VLSVAGQSAR KASSLDGFLK AFTGRLKSMS SAHDLLSAAR
WRGATLARIA AAELGGLAPN QTRWDGPELF LTPRAAAALS LTLHELAVNA VKFGALSSES
GRVEVVWRGS PEGGFNLEWL ETGGPMTSPP ATRGFGMTLI EDVVGRELGG RAKIEYKRSG
VTAMIHAAAD ALVETPEPEP AAPPNERIVE TVGGGDDSFR AGDIAGLRVL IVEDSLLLAM
ELEAGLEDSG VEVVGCAAEL SEALQMLELS FDAAVLDADL NGQSVAPVAE ILRREGRPFV
FATGYADKAA PMGFDAPIVR KPYNVHQIAR ALASVTGRG