Gene Caul_3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3136 
Symbol 
ID5900591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3400082 
End bp3402403 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content65% 
IMG OID641563639 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001684761 
Protein GI167647098 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.121494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCTCTT TTTCGCGCCC ACTTGAAGAA TCCCTGCATC GCGCCGTCGC TTACGCCAAC 
CAGCGCAAGC ACGAGTACGC CACGCTTGAA CACCTGCTGC TGTCGCTCAC CGACGACGAC
GACGCGGCCG GGGTTATGCG GGCCTGCGAT GTCGACCTCA ACGCTCTGAA GAAGAGCCTC
GTGAACTATC TCGACGTCGA ACTGGCCTCG CTGGTCGTCG ACGACGAGGA CGACGCCAAG
CCCACGGCCG GCTTCCAGCG CGTGATCCAG CGCGCCGTCA TTCACGTGCA GTCCTCGGGT
CGCGAAGAAG TGACCGGCGC CAACGTGCTG GTGGCGATCT TCTCCGAACG GGAGAGCCAC
GCCGCCTACT TCCTGCAAGA GCAGGACATG ACCCGCTATG ACGCGGTGAA CTTCATCGCC
CATGGCATCG CCAAGAAGGC CGGCGCCTCC GAGCCCAAGA CGGCCAAGGG CGCCGCGGTC
GAGGAAGATG TCGAGAAGCC CAACGCCAAG ACCGGCGGCG AGGCGCTCGA GGCCTATTGC
GTCGACCTCA ACGAAAAGGC CCGCCAGGGC AAGGTCGACC CGCTGATCGG CCGCGCGAGC
GAAGTGGAAC GCGCGATTCA GATCCTGTGC CGGCGGACCA AGAACAACCC GCTGCTGGTC
GGCGAACCCG GCGTCGGCAA GACCGCCATC GCCGAGGGCC TGGCCCGCAA GATCATTACC
CACCAGGTCC CCGAAGTCCT GGCCGGCGCC ACCATCTACT CGCTCGACAT GGGCGCGCTG
CTGGCCGGCA CCCGCTATCG CGGCGACTTC GAGGAACGCG TCAAGCAGGT GGTCAAGGAA
CTGGAGAACC ACCCCAACGC GGTGTTGTTC ATCGACGAGA TCCATACGGT GATCGGCGCT
GGCGCGACCT CGGGCGGGGC GATGGACGCC TCCAACCTGC TGAAGCCCGC CCTGGCCTCG
GGCACCCTGC GGTGCATGGG TTCGACCACC TACAAGGAGT TCCGTCAGCA CTTCGAAAAG
GATCGGGCCC TGGTCCGTCG CTTCCAGAAG ATCGACGTGA ACGAGCCGAC GGTGGAAGAC
ACCATCAAGA TCCTCAAGGG CCTGAAGACC TATTACGAGG ACTTCCACAA GCTGAAGTAC
ACCAACGAGG CCCTCAAGGT CGCGGTGGAG CTGTCGGCCA AGTACATCAC CGACCGCAAG
CTGCCCGACA AGGCGATCGA CGTGATCGAC GAGGCGGGCG CCGGCCAGAT GCTGCTGCCG
GAGAGCCGTC GCAAGAAGGT CCTGGGCGTC AAGGAGATCG AGGCCGTGGT GGCCAAGATC
GCCCGCATCC CGCCCAAATC CGTCAGCAAG TCGGACACCG AGTCCCTGCG CGAGCTGGAA
CGGGATCTGA AGGGCGCGGT GTTCGGTCAG GACGAAGCCC TGGCTCAGCT GTCCTCGGCC
ATGAAACTGG CCCGGGCCGG CCTGCGCGAT CCGGACAAGC CGATCGGCAG CTATCTGTTC
AGCGGCCCGA CCGGCGTCGG CAAGACCGAA GCGGCCAAGC AGCTCGCCTC GACCCTTGGG
ATCGAGATGA TCCGCTTCGA CATGTCGGAA TACATGGAGC GCCACACCGT CAGCCGGCTG
ATCGGGGCTC CCCCCGGCTA TGTCGGCTAC GATCAGGGCG GCCAGTTGAC CGACGCCGTC
GACCAGCACC CGCACGCCGT GGTGTTGCTG GACGAGATCG AGAAGGCCCA CGCCGACGTC
TACAACATCC TGCTCCAGGT CATGGACCAC GGGGTGCTGA CCGACAGCAA CGGCAAGAAG
GTCGACTTCA GGAACGTCAT CCTGATCATG ACCACCAACG CCGGCGCGTC GGACGCCCAG
CGCCAGTCGA TCGGTTTCGG CCGCGACAAG GTGCAGGGCG AGGAAGAGGC CGCGCTCAAG
CGCCTGTTCA CCCCGGAGTT CCGCAACCGT CTCGACGCGG TGGTGGCCTT CAAGCCGCTG
ACGCCGGAAA TCATCCGGAT GGTCGTGCAG AAGTTCGTCC TGCAGATGGA AGTCCAGCTG
GCCGACCGGA ACGTGACGAT CTCGCTGAGC GACGACGCGG CCGACTGGCT GGCCAAGAAC
GGCTTCGACG AGCTCTACGG CGCACGCCCA CTGGCGCGGG TCATCCAGGA GCACATCAAG
AAGCCGCTGG CCGACGACAT CCTGTTCGGG CGACTGGTCC GCGGCGGGCA TGTGAAGGTC
GTGCTCAAGG ACAGCAAGAT CGACTTCGAG ATCGAGAGCA CGCCAGAAAA GCCCGGCAAG
GCTCCCAAGG AAGACGAGGC CGAACCGGCC CTGGCCGAGT AG
 
Protein sequence
MPSFSRPLEE SLHRAVAYAN QRKHEYATLE HLLLSLTDDD DAAGVMRACD VDLNALKKSL 
VNYLDVELAS LVVDDEDDAK PTAGFQRVIQ RAVIHVQSSG REEVTGANVL VAIFSERESH
AAYFLQEQDM TRYDAVNFIA HGIAKKAGAS EPKTAKGAAV EEDVEKPNAK TGGEALEAYC
VDLNEKARQG KVDPLIGRAS EVERAIQILC RRTKNNPLLV GEPGVGKTAI AEGLARKIIT
HQVPEVLAGA TIYSLDMGAL LAGTRYRGDF EERVKQVVKE LENHPNAVLF IDEIHTVIGA
GATSGGAMDA SNLLKPALAS GTLRCMGSTT YKEFRQHFEK DRALVRRFQK IDVNEPTVED
TIKILKGLKT YYEDFHKLKY TNEALKVAVE LSAKYITDRK LPDKAIDVID EAGAGQMLLP
ESRRKKVLGV KEIEAVVAKI ARIPPKSVSK SDTESLRELE RDLKGAVFGQ DEALAQLSSA
MKLARAGLRD PDKPIGSYLF SGPTGVGKTE AAKQLASTLG IEMIRFDMSE YMERHTVSRL
IGAPPGYVGY DQGGQLTDAV DQHPHAVVLL DEIEKAHADV YNILLQVMDH GVLTDSNGKK
VDFRNVILIM TTNAGASDAQ RQSIGFGRDK VQGEEEAALK RLFTPEFRNR LDAVVAFKPL
TPEIIRMVVQ KFVLQMEVQL ADRNVTISLS DDAADWLAKN GFDELYGARP LARVIQEHIK
KPLADDILFG RLVRGGHVKV VLKDSKIDFE IESTPEKPGK APKEDEAEPA LAE