Gene Caul_3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3450 
Symbol 
ID5900905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3730244 
End bp3732175 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content67% 
IMG OID641563956 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001685075 
Protein GI167647412 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGG TGTTGGCGCT CGCGGCCGTG AGCCTTTTGC TGGCTGGCCC CAGCTTCGCG 
CAGACCACGA GCCTGCCCAG CGAGACCCCC GCGGTCTTCA AGCCCGCCGC CGAACGCCTC
GACTACGAGC GGCGCGACGT GATGATCCCG ATGCGCGACG GGGTGAAGCT GCACACCGTC
ATCCTGGTCC CCAGGGACGC CAAGCGGGCG CCGATCCTGC TGACCCGCAC CCCCTACGAC
GCCACGGCGA TGACCACGAT CAACGCCACC ACCCACATGG CCGACGCCAT CGCCGGTTAC
GACCATCCCG TCGATGTGGT GATCGAGGGC GGCTACATCC GCGTCGTCCA GGACGTGCGC
GGCAAGCATG GCTCCGAGGG CGACTACGTG ATGAACCGCC CCCTGAAGGG GCCGCTCAAT
CCCACGGCGG TCGACCACGC CACCGACACC TGGGACACCA TCGACTGGCT GGTCAAGAAC
ATACCCGAGA CCAACGGCAA GGTCGGGATC CTGGGCATCT CCTACGACGG CTTCACCGCG
CTGGAGGCGC TGTTCAACCC GCATCCGGCC CTGAAGGCGG CCGTGCCGAT GAACCCGATG
GTCGACGGCT GGATGGGCGA CGACTGGTTC CACAACGGGG CCTTCCGCCA GCAAAACATG
CCCTACATCT ATGAGCAGGT CGGCACGCGG AAGAACGAGG AGAAGTGGCT GTCGGGCGTT
CACGACGACT ACGACCTCTT CATGCGGGCC GGCTCGGCCG GGGCGCTGGG CGCCCAGATG
GGGCTGGAGC AGACCGGCTT CTGGCGCAAG ATCCTGGCCC ACCCGGACTA CGATGCCTTC
TGGAGCGACC AGGCGGTCGA CAAGCTGTTG GCCAGGGAGC CGCTGAAGGT GCCGGTCATG
CTGGTCCACG GCTTGTGGGA CCAGGAGGAC ATCTACGGCG CCCCAGCCGT CTACAAGGCG
ATCGAGCCCA AGGACACGGC CAACGACAAG GTGTTCCTGG TGCTAGGTCC CTGGTTCCAC
GGCCAGCAGA TCGAGGAGGC CTCCAGCCTG GGAGCGATCA AGTTCGGCGC CGACACCGCC
CTGCGGTTTC GCCAGGACGT GCTGGCCCCG TTTCTGGCCC ACTATCTGAA GGACGAGGCC
CCCGCCATGG ACGTGGCGCC GGTCACCGCC TTCGAGACCG GAACCAACCG CTGGCGCAGG
CTGGACGCCT GGCCCTCGGG TTGCGCCAAG GGCTGCGCGA CGACACAGAC TCCGCTCTAC
CTGCACGCCG ACGCCAAGGC CGACTTCACC CCGCCCAAGA CCGGCGAGAC GGCCAGTGAC
GCCTACGTCT CCGACCCAGC CAAGCCCGTG CCCTACCGCG CCCGCCCCAG CCAGCCGACC
GGCTACACGC CGCCCCTGAC ATGGACCCAG TGGTTGGTCG ACGACCAGCG CGAGGCTTCG
GGCCGCACCG ACGTGCTGAC CTACACCACC GACGTGCTGA CCGCGCCGAT GAAGATCAGC
GGCGAGCCGA TCGTCCACCT GACCGCCTCG ACCAGCGGGA CCGACAGCGA CTGGGTGGTC
AAGCTGATCG ACGTCTATCC CGACGAGGTC CCGGCCGATC CGGCCATGGG CGGCTACCAG
TTGCCCGTGG CCATGGACAT CCTGCGCGGC CGCTATCGCG AAGGCTTCGC CCAGGCCAAG
CCGATCACGG CGGGCGCGCC GCTCAGCTAC CGTTTCGCCC TGCCCAACGC CAACCACGTG
TTCCTGCCGG GCCACCGGAT CATGGTCCAG GTGCAGTCCA GCTGGTTCCC GCTCTACGAC
CGCAACCCGC AGACCTTCAC CCCCAACATC TTCCTGGCCA AGCCGAGCGA CTACGTGAAG
GCGACCCAGA CGGTGTTCCA CGCGCCGGAC AAGGCTAGCT TTGTGGAACT GCCGGTGGTG
AAGGCGCCCT AG
 
Protein sequence
MKMVLALAAV SLLLAGPSFA QTTSLPSETP AVFKPAAERL DYERRDVMIP MRDGVKLHTV 
ILVPRDAKRA PILLTRTPYD ATAMTTINAT THMADAIAGY DHPVDVVIEG GYIRVVQDVR
GKHGSEGDYV MNRPLKGPLN PTAVDHATDT WDTIDWLVKN IPETNGKVGI LGISYDGFTA
LEALFNPHPA LKAAVPMNPM VDGWMGDDWF HNGAFRQQNM PYIYEQVGTR KNEEKWLSGV
HDDYDLFMRA GSAGALGAQM GLEQTGFWRK ILAHPDYDAF WSDQAVDKLL AREPLKVPVM
LVHGLWDQED IYGAPAVYKA IEPKDTANDK VFLVLGPWFH GQQIEEASSL GAIKFGADTA
LRFRQDVLAP FLAHYLKDEA PAMDVAPVTA FETGTNRWRR LDAWPSGCAK GCATTQTPLY
LHADAKADFT PPKTGETASD AYVSDPAKPV PYRARPSQPT GYTPPLTWTQ WLVDDQREAS
GRTDVLTYTT DVLTAPMKIS GEPIVHLTAS TSGTDSDWVV KLIDVYPDEV PADPAMGGYQ
LPVAMDILRG RYREGFAQAK PITAGAPLSY RFALPNANHV FLPGHRIMVQ VQSSWFPLYD
RNPQTFTPNI FLAKPSDYVK ATQTVFHAPD KASFVELPVV KAP