Gene Caul_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2969 
Symbol 
ID5900424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3223560 
End bp3225464 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content68% 
IMG OID641563466 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001684594 
Protein GI167646931 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.163827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGCG ATCCGGCGGG CCAGGTGCGA ACACGCGCCT TGAACAGGGC TGCGGCGTTA 
TCTCTGGCGA CGCGGTATGG CGGGGCGGTC GCCGCCGTGG CGGTTGCGGC GATGGTGCGG
CTGGCCATGG AGCCTTGGCT GGAGCATCGC TCGCCCTATC TGTTGTTCAC CGCCCCGGTT
GTCGCCGCCG TGGCCTTTGG CGGCTTGGGG CCGGCTCTGG TGGCCGCGGC CTTGGGTCTG
GTCGCCGGCG TGGTGTTCAG CGGGAGCCTG GCCGCCGAGC CGAGCGCGCC GCTGATGTTC
GTCCTGGTTT CGGTGGCGCT GATCCTGGTG GGCGGCGAGA TCCATCGGCT GCGACGACGG
TTCGTGGCCA GCGAACAGCG GGCCGTTGAG CGCGCCGCGA CCGCGGGCAG GATCGCCCGC
GAACTGAACC TGCTGATCGA CGGCGCGGCC GAATACGCCA TCTACATGCT CGACCCTCAA
GGGCGCGTGC TGATCTGGAA CGCCGGCGCC GAGCGTCTCA AGGGATGGAG CGAGGCCGAA
ATCATCGGTC AGGACGCCGC GGTCTTCTAT CCGGCCGACG CCCGCGCGGC CGGCAAGCCG
ATGGAGGATC TGCGGCGAGC CGAAGCTGTC GGCAAGCTGG AGCAAGAAGA CTGGCGCTTG
CGCAAGGACG GCTCGGAGTT CCTGGCCCAC GTCTCGATCA CCGCGCTCTA TGACGAGGAG
GGCGCGCTCC AGGGCTTTGG CAAGGTCGTG CGCGACGTCA CCGATCAGAG GGCCAGCGAG
CATGCGCTGC AGGCCAGCAC CAATCACCTG CAGTCGATCC TCTCCACGGT GCCGGACGCG
ATGATCGTCA TCGACGAGCA CGGCGCCATC CTGTCGTTCA GCGCCGCGGC CGAGCGGCTT
TTTGGCTATG CGGAGGCCGA GGTGATCGGC TCCAATATCA GCCGGCTGAT GCCCGAGCCC
GATCAGACGC GACATGACGG CTATCTGCAG CGTTATGTCG CCACCGGAGA GCGTCGGATC
ATCGGCATCG GGCGCGTCGT CGTGGGCCTA AAGCGCGACG GCGCGACCTT TCCGATGGAA
CTGTCGGTCG GCGAGGCGCG AGGCGAGGGC CAGCGGGTGT TCACCGGTTT CATCCGTGAC
CTGACCGACC GTCGCCGCAC CCAAGCCCGC CTGGAGGAGC TGCAGTCGGA GTTGATCCAT
GTGGCCCGGG TCAGCGCCAT GGGGACCATG GCGTCGACCC TGGCTCACGA GCTCAATCAA
CCGATCACAG CGGTCGCCAA CTATGTCGAG GCGATACGCG ACCTCCTGGC CCAGCCCGAA
CCCGACGACC TGCCGATGAT CCGTGACGCG CTCGGCGAGG CGGCCAGTCA AGCCATGCGC
GCCGGCCACA TCGTCCGTCG GCTGCGCGAC TTCGTCGCCC GCGGCGAGGT GGAGAAGACG
GTCGAGGACC TGCCGGCCTT GGTCGATGAG GCCGTGGCCC TCGGCTTGCT GGGCGCGCGC
GAGGATGGGG TGAAGGCCAG CTTCGATCTG GATCCGCAAG CGCGCCTGGT CTTGGTCGAC
AAGGTCCAGA TCCAACAGGT CCTGATCAAT CTGGCCCGCA ACGCCGTCCA GGCGACGGAG
GGGTGCGCGC AGCGGCAGGT GACCTTCCGC AGCCGCCAGG AGCCGGGAGG CCTGACCCGG
ATGACCGTGG CCGATACGGG CTGCGGCGTT CCGCCCGGAG TGGCCGAGCA GCTGTTCACC
GCCTTTGTCA CCACCAAGGC CGAGGGCATG GGCCTTGGGC TTTCGATCTG CCGAACCATC
GTCGAGGCCA ATGGCGGGCG AATCTGGTTC GAGCCGCGAG AGGGCGGCGG TTCGCAATTT
CACTTCACGC TGGTGCGCGC GGAGCCGGAG GCGGTTGATG TCTGA
 
Protein sequence
MVGDPAGQVR TRALNRAAAL SLATRYGGAV AAVAVAAMVR LAMEPWLEHR SPYLLFTAPV 
VAAVAFGGLG PALVAAALGL VAGVVFSGSL AAEPSAPLMF VLVSVALILV GGEIHRLRRR
FVASEQRAVE RAATAGRIAR ELNLLIDGAA EYAIYMLDPQ GRVLIWNAGA ERLKGWSEAE
IIGQDAAVFY PADARAAGKP MEDLRRAEAV GKLEQEDWRL RKDGSEFLAH VSITALYDEE
GALQGFGKVV RDVTDQRASE HALQASTNHL QSILSTVPDA MIVIDEHGAI LSFSAAAERL
FGYAEAEVIG SNISRLMPEP DQTRHDGYLQ RYVATGERRI IGIGRVVVGL KRDGATFPME
LSVGEARGEG QRVFTGFIRD LTDRRRTQAR LEELQSELIH VARVSAMGTM ASTLAHELNQ
PITAVANYVE AIRDLLAQPE PDDLPMIRDA LGEAASQAMR AGHIVRRLRD FVARGEVEKT
VEDLPALVDE AVALGLLGAR EDGVKASFDL DPQARLVLVD KVQIQQVLIN LARNAVQATE
GCAQRQVTFR SRQEPGGLTR MTVADTGCGV PPGVAEQLFT AFVTTKAEGM GLGLSICRTI
VEANGGRIWF EPREGGGSQF HFTLVRAEPE AVDV