Gene Caul_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2539 
Symbol 
ID5899994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2753399 
End bp2755063 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID641563030 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001684164 
Protein GI167646501 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.787523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCA TTATGCGATC ATCGACGATA GGAGACGCTC GCGCCCTTCA ATGGGTTGTC 
GGCTCGGCTA TCGTGCTGGT CATCGTCTCG ATCCTTGGCG GCGTGGCCCT GATGCTGTTC
GCCGCGCGCT CGCTCGACCA CATCGAAAGC CTCGACGAAC GCCAGCTGGT CGACCGCACC
ATCCAGCGCG GCCTGGAGCG GATGACGCAT GAGCTGACCT CGGCGACGGT TTGGGATGAG
GCCTATACTG CGACGACGCC CGTGGTCGAC ATGGCTTGGG CCGATATCAA CTACGCCGAC
TATTATCACC GCTATTTCAA TCACGACCTG AGCTTCGCGA TGCGCGACGG CCAGGTCGTC
TACGCGTCGC AGGCGGGCGC GCGCGTTTCG CCGCGCGCGA TCGGCGCTCT TCCCGGCGAT
GCGCGGGGAT TGGTCGATGA GGTCCTCGCC AAGGCCGCGA AAGCCCGCGC CGCGGGCCGG
CTGAACCTCG AAGGCGTCTC GACGGCGGCG GCGCTCGTCA AGTCGGGCGA CGAGATCTAT
CTCGTGGCCG CGTCCGACGT GGTCGCCGAA ACGCCGCGCG TGGCCGCCAG TCGGCATGCG
CCGCCGATCG TCATCATCAC GGCGCGTCGC ATCAGCGCGA CCTTTATTCG CGGAATGCAG
GAAGACCTCG CCGTTCCGGG CTTGACCCTG ACAGCCAAGC CTTCGGCCTC GCCCAGCGTG
CCTCTGCATG ATGCAAGGGG CGTCACCATC GGCGCCCTCA GCTGGACGCC GGCCAATCCC
GGCATGTCGC TGCTGAAGGC TGCCCTTCCG GCGATCGTGG CTGTCTTCCT GGTGCTGACC
CTGGCTAGCC TGGTCTTGTT TCGCCGAGTC GCCGAGACCC TGGACCGAAT GGCGCGGGGG
CGCGAGTCCC TGGTTGTGGC CAAGGAGCAG GCGGAGGCCG CCAACGTCGC CAAGACCCAG
TTCCTGGCCA ATATGAGCCA CGAGATCCGC ACCCCGCTGA ACGGCGTCCT GGGCATGGCC
CAGATCATGG AAAGCGACGC CCTCTCGGAT CGGCAGCGCG AGCGCCTGGC TGTGATCGAA
GAGTCGGGCA ACGCCCTGCT CGCTCTGCTC AACAGCATTC TCGACATGGC CCGGCTGGAA
ACCGGCGCGA TCCGGTTGCG CCGGGAAGCC TTCGATCTGG GCGCCCTGGT CGATAGCAGT
TGCGCGGTGT TTTCCGGCGC GGCGGTCAGC AAGGGGATCA AGCTCTGCCA GGCCCTGACG
CCAGAAAGCC TCGGGGCCTG GGTCGGCGAT CCGCTTCGCC TGCGCCAGGT GCTCGGCAAT
CTGGTCGCCA ACGCCGTCAA GTTCACCGAC AGCGGCACGG TGCGCGTCCA CGTCGAGGAG
TCGGCGCTGG GCCTGCGCTT CGAGGTCAGC GACACCGGAA TCGGCGTGGC GCCCCAGGAC
CAATTGCGCC TGTTCAAGAT GTTTTCCCAG GTGGACAGCT CGTCGACCCG TTCGCACGAA
GGCTCGGGCC TGGGCCTGGC CATCTGCCAC GACCTGGTCG AGCTGATGGG CGGGGCGATC
GGTATGCGCA GCGTTCCCGG CGAGGGATCG ACCTTCTTCT TCGATCTGCC GCTGCAGCGC
GCCCCGGCCG AGCGGCCCAC CCTGCGGATC GTCGGCCGCG CCTAA
 
Protein sequence
MAGIMRSSTI GDARALQWVV GSAIVLVIVS ILGGVALMLF AARSLDHIES LDERQLVDRT 
IQRGLERMTH ELTSATVWDE AYTATTPVVD MAWADINYAD YYHRYFNHDL SFAMRDGQVV
YASQAGARVS PRAIGALPGD ARGLVDEVLA KAAKARAAGR LNLEGVSTAA ALVKSGDEIY
LVAASDVVAE TPRVAASRHA PPIVIITARR ISATFIRGMQ EDLAVPGLTL TAKPSASPSV
PLHDARGVTI GALSWTPANP GMSLLKAALP AIVAVFLVLT LASLVLFRRV AETLDRMARG
RESLVVAKEQ AEAANVAKTQ FLANMSHEIR TPLNGVLGMA QIMESDALSD RQRERLAVIE
ESGNALLALL NSILDMARLE TGAIRLRREA FDLGALVDSS CAVFSGAAVS KGIKLCQALT
PESLGAWVGD PLRLRQVLGN LVANAVKFTD SGTVRVHVEE SALGLRFEVS DTGIGVAPQD
QLRLFKMFSQ VDSSSTRSHE GSGLGLAICH DLVELMGGAI GMRSVPGEGS TFFFDLPLQR
APAERPTLRI VGRA