Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4728 |
Symbol | |
ID | 5902190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5116051 |
End bp | 5117625 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641565247 |
Product | arylesterase-related protein |
Protein accession | YP_001686346 |
Protein GI | 167648683 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2267] Lysophospholipase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0151687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA CCATTCTCAT GATCCACGGC TTTGGCGCCG CGGGTGAAAG CTGGGCGCCG GTGGCGGCTC GTTTCAAGTC GGCCGGCTAC ACGGTCGAAG CCCCGACGAT CCAGGCCGCG CTCCGCACGG TGGGCGCGCC CCCAGCGGGC CTGGCCGGCA AGACCCTCTC GGACTATGTC GGCGAAATGA GCGACCTGGC CGAGGCGATC GCGACGCGTG ATGGCATCAA GCCCGTGGTG TTCGGCCATT CGATGGGCGG GCTGATCGCC CAGAAGCTAG CCGAGGCGGG CCTGGTCTCG GGCGCCGTGC TGTTCGCCCC GGCCTCGCCG GCCGACGCGC GCGGCAAGCC CAGCCTGTCG GCCTTGTTCA CCTTCCTCAA CATCGTCGCC GCCTCCAAGC CGGAGACCAA GGCGGTGAAG ATCTGGAGGA CCGGCTTCCT GTGGGGCGTG CTGAACAAGA CGCCGCCCGA ACGGCGCGAG GCGATCTTCG CCACCACCGT TCACGACAGC GGCCAGGTGC TGACCGACCT GGCCTATCCC GAGCGCGATC CGCGCAGGAC CGCCCACGTC GACGCCTCCA AGGTCACCGC GCCGGTGCTG ATCCTGGGTG GAGCCCAGGA CCGCACCACC CCGATCGCCG ACCTGCGCCT GGTGGCCAAG AAGTACGCCG GGTCGACGTT CAAGGAATAT CCCAACAACG GCCACTGGCT GGTCGATGAG CCCGGCAGCG CCGGGATCCT GGCCGACGTC GCCGCCTGGC TGGACGCCAA GGGCCTGGGC GTCAAGGCGC CGGTCGTGAC GGCCACGCCC GCTCCGGCCG CCAAGGCCGC CGCGCCGAAG GCCGAACCCG CCAAGCCGGT GGTCGCCGCA GCTCCCGTCC CGGCCGCGAA GCCCGCGCCC AAGGCCAAGG CTCCAGCTTC CGCCAAGACC GCTCCGGCGT CCAAGGCCCC CGCGAAGACG GATCCAGCCC CGAAGGCCGC CGCGCCCAAG GCCGCTCCTG CTCCAAAGGT CGTCAAGGCC GCGCCCGCCC CGAAGGCGGC GACCTCAGCC CCCAAGGCTC CGTCCAAGCC CGCCGCCAAG GCTGCTCCCG CACCCAAGGC GAAGACTCCC GTCAAGGCCG TGCCGGTCGC GACGCCCGCC GCCGCGCCGG CCAAGACCGC TGCCAAGCCG CCGGCGAAGG CCAAGGCCGT TCCGGCGCCC AAGGCCGCCC CCGCCGCGAA GCCTGCCCCC GCTCCCAAGC CCGAGGCGGC CAAGGCCAAG CCGGCTGCCG CGCCCACCGC CGCCAAGGCC CCGGCCAAGG CCGTCTCGCC GAAGCCGAAG GCCGCCGCTC CGGCCGCCAA GGACACGGCC GTGAGGACCC CCGCGGCGAA GGCTCCGGCG GCCAAGGCCC CCGCCAAGGC TGCGAACCCG GCTCCGAAGG TCGCTCCAAC CAAGGTCAAG TCGGCCGCTG CCAAGCCCGC CGCCGCCAAG GCTCCGGCCG CAACGAAGGC CGCTCCGCCG GCCAAGACGC CGGCCAAGGC CCCCGCGACA AAGTCGACCG CCAAATCTTC GCCATCAGCC AAGAAGAGCG TGTAG
|
Protein sequence | MTKTILMIHG FGAAGESWAP VAARFKSAGY TVEAPTIQAA LRTVGAPPAG LAGKTLSDYV GEMSDLAEAI ATRDGIKPVV FGHSMGGLIA QKLAEAGLVS GAVLFAPASP ADARGKPSLS ALFTFLNIVA ASKPETKAVK IWRTGFLWGV LNKTPPERRE AIFATTVHDS GQVLTDLAYP ERDPRRTAHV DASKVTAPVL ILGGAQDRTT PIADLRLVAK KYAGSTFKEY PNNGHWLVDE PGSAGILADV AAWLDAKGLG VKAPVVTATP APAAKAAAPK AEPAKPVVAA APVPAAKPAP KAKAPASAKT APASKAPAKT DPAPKAAAPK AAPAPKVVKA APAPKAATSA PKAPSKPAAK AAPAPKAKTP VKAVPVATPA AAPAKTAAKP PAKAKAVPAP KAAPAAKPAP APKPEAAKAK PAAAPTAAKA PAKAVSPKPK AAAPAAKDTA VRTPAAKAPA AKAPAKAANP APKVAPTKVK SAAAKPAAAK APAATKAAPP AKTPAKAPAT KSTAKSSPSA KKSV
|
| |