Gene Caul_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2845 
Symbol 
ID5900300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3082861 
End bp3085260 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content64% 
IMG OID641563341 
ProductATP-dependent protease La 
Protein accessionYP_001684470 
Protein GI167646807 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0128404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0142705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA TTCGTACGCT TCCTGTGTTG CCCCTGCGGG ATATCGTTGT GTTCCCGCAC 
ATGGTGGTTC CCCTGTTCGT GGGCCGCGAC AAGTCGGTGC GCGCCCTCGA AGAGGTGATG
CGCGGCGGCA AGGAGATCCT TCTGGTCACC CAGAAGAACT CGGCCGACGA CGATCCGGCG
CCTTCCGACA TCTATGATGT CGGCGTGCTG GCCACCGTTC TGCAACTGCT GAAACTGCCG
GACGGCACCG TGAAGGTGCT GGTCGAAGGC AAGGGCCGCG CCGCCGTCGT GCGCTTCACC
GATCAGGAGG CCTATTACGA GGCCCAGATC AGCGAAGTGA ACGAGGACCA GGGCGTCGGC
CCCGAGGCCG AGGCCCTGTC GCGCGCCGTC GTCGAGCAGT TCGAGAACTA CGTCAAACTG
AACAAGAAGG TGCCGCCCGA GGCCCTGGCC TCGATCCCGC AGATCGCCGA GCCCGGCAAG
CTGGCCGACA GCATCTCGGC TCACCTGTCG GTCAAGATCG GCGACAAGCA GCACCTGCTG
GAAATCTTCG ACGTCGTGAA GCGCCTGGAG AAGGTCTTCG CCCTGATGGA GGGTGAGATC
TCGGTGCTGC AGGTCGAGAA GAAGATCCGC TCGCGCGTGA AGCGCCAGAT GGAGAAGACC
CAGCGCGAAT ATTACTTGAA CGAGCAGATG AAGGCGATCC AGCGCGAGCT GGGCGATCCC
GACGATCAGC GCGACGAACT GATCGAGCTC GAAAAGCGCA TCAAGAAGAC CAAGCTTTCC
AAGGAAGCCC GGGCCAAGGC CGAGGGCGAG CTGAAGAAGC TGCGCAACAT GAGCCCGATG
TCGGCCGAGA GCACCGTGGT CCGCAACTAT CTGGACTGGA TGCTGTCGAT CCCGTGGGGC
AAGGCCAAGA CCAAGAAGAT CGACCTGGTC GAGGCCGAAG CCGTGCTCGA AGAGGACCAC
TATGGCCTGG AGAAGGTCAA GGAACGCATC CTTGAGTACT TGGCCGTCCA GGCTCGTACC
GGTTCGCTGA AGGGGCCGAT CCTGTGCCTC GTCGGCCCTC CGGGCGTCGG CAAGACCTCG
CTGGGCCGTT CGCTGGCCAA GGCGACCGGG CGCGAATTCG CCCGCATCTC CCTGGGCGGC
GTGCGCGACG AGGCCGAGAT CCGCGGTCAC CGCCGCACCT ACATCGGCTC CATGCCCGGC
AAGATCATCC AGACCATGAA GAAGGCGAAG ACCACCAACG CCTTCGTCCT GTTGGACGAG
ATCGACAAGA TGGGCAGCGA CTATCGCGGC GACCCGTCGT CGGCTCTGCT GGAAGTGCTC
GACCCGGCCC AGAACTCGAC CTTCGGCGAC CACTATCTGG AAGTCGACTA CGACCTGAGT
CAGGTGATGT TCGTCACCAC GGCCAACAGC CTCAACATGC CCCAACCGCT GCTGGACCGC
ATGGAGATCA TCCGCATCCC CGGCTACACC GAGGATGAGA AGCTGGAGAT CGCCAAGCGC
CACGTGCTGC CCAAGCTCAT GAAGGACCAT GGCCTGAAGC CGGCTGAGTT CGTGGTGCCG
GAAAAGGCGA TCCGCGACCT AATCCGCTAC TACACCCGGG AAGCCGGCGT GCGGTCACTG
GAGCGGGAAC TGGGCGCTCT GGCGCGCAAG ACCGTCCGCG ATCTGGCGCG TGAGAAGGTC
GTCTCGATCA CGATCGACGA CGAGCGTCTG GCAAAGTACG CGGGCGTCAA GAAGTACCGC
TACGGCGAGA CCGACGAAGT CGACCAGGTC GGCATCGTCA CCGGTCTGGC CTGGACCGAG
TTTGGCGGCG ACATCCTGAC TATCGAAGCC GTGAAGATGC CGGGCAAGGG CCGCATGACG
GTCACCGGCA ACCTCAAGGA GGTGATGAAG GAGTCGATCT CGGCGGCCCA GTCCTACGTC
CGCTCGCGGG CGCTGCACTT CGGCGTCAAG CCGCCGATCT TCGAGAAGAC CGACGTCCAC
ATCCACGTGC CGGACGGTGC GACGCCCAAG GACGGCCCGT CGGCCGGCGC CGCCATGGCC
CTTGCCATGG TCTCGGTGCT GACCGGGATC CCGATCCGCA AGGACATCGC CATGACCGGC
GAGATCACCC TGCGCGGCCG GGTCACCGCG ATTGGCGGCC TCAAGGAGAA GCTGCTGGCC
GCCCTGCGGT CCGGCGTGAA GACCGTCCTG ATCCCTCAGG AGAACGAGAA GGATCTGGTC
GACGTGCCGC AAAGCGTGAA GGACGGCCTG GAAATCATCC CGGTCTCCAC GGTCGACGAA
GTGCTGAAAC ACGCGCTTAC CGGTCCGCTC ACGCCAATCG AGTGGCGGGA AGAGGACGAG
CCGATTGCTG CAACTGCGAA GGTTGACGAC GGCGATAGCG ACGCTGTCCT GACCCACTGA
 
Protein sequence
MSEIRTLPVL PLRDIVVFPH MVVPLFVGRD KSVRALEEVM RGGKEILLVT QKNSADDDPA 
PSDIYDVGVL ATVLQLLKLP DGTVKVLVEG KGRAAVVRFT DQEAYYEAQI SEVNEDQGVG
PEAEALSRAV VEQFENYVKL NKKVPPEALA SIPQIAEPGK LADSISAHLS VKIGDKQHLL
EIFDVVKRLE KVFALMEGEI SVLQVEKKIR SRVKRQMEKT QREYYLNEQM KAIQRELGDP
DDQRDELIEL EKRIKKTKLS KEARAKAEGE LKKLRNMSPM SAESTVVRNY LDWMLSIPWG
KAKTKKIDLV EAEAVLEEDH YGLEKVKERI LEYLAVQART GSLKGPILCL VGPPGVGKTS
LGRSLAKATG REFARISLGG VRDEAEIRGH RRTYIGSMPG KIIQTMKKAK TTNAFVLLDE
IDKMGSDYRG DPSSALLEVL DPAQNSTFGD HYLEVDYDLS QVMFVTTANS LNMPQPLLDR
MEIIRIPGYT EDEKLEIAKR HVLPKLMKDH GLKPAEFVVP EKAIRDLIRY YTREAGVRSL
ERELGALARK TVRDLAREKV VSITIDDERL AKYAGVKKYR YGETDEVDQV GIVTGLAWTE
FGGDILTIEA VKMPGKGRMT VTGNLKEVMK ESISAAQSYV RSRALHFGVK PPIFEKTDVH
IHVPDGATPK DGPSAGAAMA LAMVSVLTGI PIRKDIAMTG EITLRGRVTA IGGLKEKLLA
ALRSGVKTVL IPQENEKDLV DVPQSVKDGL EIIPVSTVDE VLKHALTGPL TPIEWREEDE
PIAATAKVDD GDSDAVLTH