Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3450 |
Symbol | |
ID | 5900905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3730244 |
End bp | 3732175 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641563956 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001685075 |
Protein GI | 167647412 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGG TGTTGGCGCT CGCGGCCGTG AGCCTTTTGC TGGCTGGCCC CAGCTTCGCG CAGACCACGA GCCTGCCCAG CGAGACCCCC GCGGTCTTCA AGCCCGCCGC CGAACGCCTC GACTACGAGC GGCGCGACGT GATGATCCCG ATGCGCGACG GGGTGAAGCT GCACACCGTC ATCCTGGTCC CCAGGGACGC CAAGCGGGCG CCGATCCTGC TGACCCGCAC CCCCTACGAC GCCACGGCGA TGACCACGAT CAACGCCACC ACCCACATGG CCGACGCCAT CGCCGGTTAC GACCATCCCG TCGATGTGGT GATCGAGGGC GGCTACATCC GCGTCGTCCA GGACGTGCGC GGCAAGCATG GCTCCGAGGG CGACTACGTG ATGAACCGCC CCCTGAAGGG GCCGCTCAAT CCCACGGCGG TCGACCACGC CACCGACACC TGGGACACCA TCGACTGGCT GGTCAAGAAC ATACCCGAGA CCAACGGCAA GGTCGGGATC CTGGGCATCT CCTACGACGG CTTCACCGCG CTGGAGGCGC TGTTCAACCC GCATCCGGCC CTGAAGGCGG CCGTGCCGAT GAACCCGATG GTCGACGGCT GGATGGGCGA CGACTGGTTC CACAACGGGG CCTTCCGCCA GCAAAACATG CCCTACATCT ATGAGCAGGT CGGCACGCGG AAGAACGAGG AGAAGTGGCT GTCGGGCGTT CACGACGACT ACGACCTCTT CATGCGGGCC GGCTCGGCCG GGGCGCTGGG CGCCCAGATG GGGCTGGAGC AGACCGGCTT CTGGCGCAAG ATCCTGGCCC ACCCGGACTA CGATGCCTTC TGGAGCGACC AGGCGGTCGA CAAGCTGTTG GCCAGGGAGC CGCTGAAGGT GCCGGTCATG CTGGTCCACG GCTTGTGGGA CCAGGAGGAC ATCTACGGCG CCCCAGCCGT CTACAAGGCG ATCGAGCCCA AGGACACGGC CAACGACAAG GTGTTCCTGG TGCTAGGTCC CTGGTTCCAC GGCCAGCAGA TCGAGGAGGC CTCCAGCCTG GGAGCGATCA AGTTCGGCGC CGACACCGCC CTGCGGTTTC GCCAGGACGT GCTGGCCCCG TTTCTGGCCC ACTATCTGAA GGACGAGGCC CCCGCCATGG ACGTGGCGCC GGTCACCGCC TTCGAGACCG GAACCAACCG CTGGCGCAGG CTGGACGCCT GGCCCTCGGG TTGCGCCAAG GGCTGCGCGA CGACACAGAC TCCGCTCTAC CTGCACGCCG ACGCCAAGGC CGACTTCACC CCGCCCAAGA CCGGCGAGAC GGCCAGTGAC GCCTACGTCT CCGACCCAGC CAAGCCCGTG CCCTACCGCG CCCGCCCCAG CCAGCCGACC GGCTACACGC CGCCCCTGAC ATGGACCCAG TGGTTGGTCG ACGACCAGCG CGAGGCTTCG GGCCGCACCG ACGTGCTGAC CTACACCACC GACGTGCTGA CCGCGCCGAT GAAGATCAGC GGCGAGCCGA TCGTCCACCT GACCGCCTCG ACCAGCGGGA CCGACAGCGA CTGGGTGGTC AAGCTGATCG ACGTCTATCC CGACGAGGTC CCGGCCGATC CGGCCATGGG CGGCTACCAG TTGCCCGTGG CCATGGACAT CCTGCGCGGC CGCTATCGCG AAGGCTTCGC CCAGGCCAAG CCGATCACGG CGGGCGCGCC GCTCAGCTAC CGTTTCGCCC TGCCCAACGC CAACCACGTG TTCCTGCCGG GCCACCGGAT CATGGTCCAG GTGCAGTCCA GCTGGTTCCC GCTCTACGAC CGCAACCCGC AGACCTTCAC CCCCAACATC TTCCTGGCCA AGCCGAGCGA CTACGTGAAG GCGACCCAGA CGGTGTTCCA CGCGCCGGAC AAGGCTAGCT TTGTGGAACT GCCGGTGGTG AAGGCGCCCT AG
|
Protein sequence | MKMVLALAAV SLLLAGPSFA QTTSLPSETP AVFKPAAERL DYERRDVMIP MRDGVKLHTV ILVPRDAKRA PILLTRTPYD ATAMTTINAT THMADAIAGY DHPVDVVIEG GYIRVVQDVR GKHGSEGDYV MNRPLKGPLN PTAVDHATDT WDTIDWLVKN IPETNGKVGI LGISYDGFTA LEALFNPHPA LKAAVPMNPM VDGWMGDDWF HNGAFRQQNM PYIYEQVGTR KNEEKWLSGV HDDYDLFMRA GSAGALGAQM GLEQTGFWRK ILAHPDYDAF WSDQAVDKLL AREPLKVPVM LVHGLWDQED IYGAPAVYKA IEPKDTANDK VFLVLGPWFH GQQIEEASSL GAIKFGADTA LRFRQDVLAP FLAHYLKDEA PAMDVAPVTA FETGTNRWRR LDAWPSGCAK GCATTQTPLY LHADAKADFT PPKTGETASD AYVSDPAKPV PYRARPSQPT GYTPPLTWTQ WLVDDQREAS GRTDVLTYTT DVLTAPMKIS GEPIVHLTAS TSGTDSDWVV KLIDVYPDEV PADPAMGGYQ LPVAMDILRG RYREGFAQAK PITAGAPLSY RFALPNANHV FLPGHRIMVQ VQSSWFPLYD RNPQTFTPNI FLAKPSDYVK ATQTVFHAPD KASFVELPVV KAP
|
| |