Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1847 |
Symbol | |
ID | 5899302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1967178 |
End bp | 1968848 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562337 |
Product | carboxylesterase type B |
Protein accession | YP_001683474 |
Protein GI | 167645811 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2272] Carboxylesterase type B |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.105867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0362661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCCT TCATTCGCGA CAGAGGGATC GACCGGCGTC GGTTGCTGGC TGGTGGGGCC GCCGCCGCCG GCGCCCTGGC CGCGCCCTTG GTCGCGCGCG CCGCCGACGC GCCGTCCGAG TCCGTGGAGA TCGCGATCGC CGAAGGGCGC CTGCGCGGCC TGCGCACCGG CGGGGTCGAC GTCTACAAGG GCGTGCCCTA CGGGGCGAGC GTCTCCGGGG CTGGCCGGTT CAAGCCCGCC AACCCCGTCG CGCCCTGGAC CGGAGTGCGG GACGCCACGC GGCTCGGCAC GCCGTCGCTG CAGGATCCCG GCACCGTCTA CGGCGTGAAC GAGCCGGCGC CGGGCGAGGA TTGCCTGGTG CTGAACCTCT GGACCCCGGC CGGCGGCGGT AAGGGCAAGC CTGTGATGTT CTACAGTCAC GGCGGCGGCT ACACGACCGG TTCGGGCGGC AGCACGGCCC AGGACGGCTC GAACCTGGCG CGCGAGCACG ACGTGGTGGT GGTGTCGACC AATCACCGAT TGGGCGTGCT GGGCTATCTC TACATGGGCG AACTGGGCGG GGCCGAATAC GCCGGCTCGG GCAACCAGGG CCTGTCCGAC ATCGTGCTGG CGCTCAAGTG GGTGGCTCGC AACATCGCCG CGTTCGGAGG CGACCCCGGC AATGTGACGA TCTTCGGCGA GTCCGGCGGC GGGGCAAAGA CCTCCTGCCT CTACGCCATG CCCTCGGTCG CGCCGCTGTT TTCCAAGGCG ATCATCCAGA GCGGGCCGGT GGTGCGGGTG ACGACCCCGG ACGTCGCCGC CCAGACGACA CGCATGTTCC TCGAACAACT GGGGATCGCC CCGGCCGACT GGCGCAAGGT GCTCGATGTT CCCGCGCCAC AGATCCTGGC CGCGCAGAAG GCGCTGAACG CCAAGGTGAA GAGCGACAGC GGCGGCTGGC GGGGGATCCA GTCGCTGACG CCCGGCACGT ACGGGCCGAT CGTCGATGGC GGCCTGCTGC CGCGCCATCC GTTCGATCCC GCCGCCCCGG CCAGCGCCGC CGACAAGCCG CTGATGATCG GCTGGCTTGA CAGCGAAGCG GCCTTTTTCG CCTGGACGGG CAAGGACGTC GAGGCTTTCA GGCTGGACGA GGCGGGGCTG AAGGCGCGGC TGTCGGGCAA GTTCGGCGAC AAGGCCCAGA CATTAATCGA CACCTACCGG GCCGACCGTC CCGGCGCGAC GCCGAGCGAC ATCTATCTGG CGGCGGCCAG CTTCCACGCC ATGGGCGCCG GCTCGGTGGT CACCGCCGAG CGCAAGGCCA CGCAGGGCAG GGCGCCGGTC TATGTCTACA ACATCGCCTA TCGCTCGAAC CGCAAGATGG ACGGGACCGA CATCGAGCTG GGGGCCATGC ACGCCAGCGA CATCCCCCTG GTGTTCAACA CCGTCGCCTC GCCCACCACC CTGGCCGGAG ACCGCGCCGA CCGTTTCGCG GCGGCGCGCA ACGTCAGCAC GATGTGGGCC AATTTCGCGC GGACCGGACG GCCGGCGGCG CCGGGCCAGC CGGTCTGGCC CGCCTACGAC CTGAAGACGC GACAGACGAT GGTGCTCGAT GTCGGCTGCA GCGTGGTTCC AGACCGGTTC GGGGCGGAGC GGAAGCTGTG GGCGCGGTTG GATCCGCCGA CCGCGGGATG A
|
Protein sequence | MTSFIRDRGI DRRRLLAGGA AAAGALAAPL VARAADAPSE SVEIAIAEGR LRGLRTGGVD VYKGVPYGAS VSGAGRFKPA NPVAPWTGVR DATRLGTPSL QDPGTVYGVN EPAPGEDCLV LNLWTPAGGG KGKPVMFYSH GGGYTTGSGG STAQDGSNLA REHDVVVVST NHRLGVLGYL YMGELGGAEY AGSGNQGLSD IVLALKWVAR NIAAFGGDPG NVTIFGESGG GAKTSCLYAM PSVAPLFSKA IIQSGPVVRV TTPDVAAQTT RMFLEQLGIA PADWRKVLDV PAPQILAAQK ALNAKVKSDS GGWRGIQSLT PGTYGPIVDG GLLPRHPFDP AAPASAADKP LMIGWLDSEA AFFAWTGKDV EAFRLDEAGL KARLSGKFGD KAQTLIDTYR ADRPGATPSD IYLAAASFHA MGAGSVVTAE RKATQGRAPV YVYNIAYRSN RKMDGTDIEL GAMHASDIPL VFNTVASPTT LAGDRADRFA AARNVSTMWA NFARTGRPAA PGQPVWPAYD LKTRQTMVLD VGCSVVPDRF GAERKLWARL DPPTAG
|
| |