Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4233 |
Symbol | |
ID | 5901694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4597934 |
End bp | 4600591 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564753 |
Product | erythromycin esterase |
Protein accession | YP_001685853 |
Protein GI | 167648190 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0412] Dienelactone hydrolase and related enzymes [COG1926] Predicted phosphoribosyltransferases [COG2312] Erythromycin esterase homolog |
TIGRFAM ID | [TIGR02019] bacteriochlorophyll 4-vinyl reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGC CACACAGCGT CCCGCTCTTC GCGGACCGCG CCGAGGCCGG TCGGCAGCTG GCGGCGCGTC TGGCGACCTT GAAGTTGCCG CGTCCGGTGG TTTACGCCCT GCCGCGTGGC GGGGTCCCCC TGGCGATGGA GATCGGCAGG ATTTTGGCCG CGCCGATCGA CCTGATCCTC GTGCGCAAAT TGGGCGCCCC AGGGCAACCT GAGCTCGCCG TCGGCGCGGT GGTGGACGGC CACGAGGCGC AGATGGTCAT CAACGAGGAC CTGGCCGCCG CGACCGGCGC CGACGCCGCC TATTTGCATG ACGCCCAGCG ACGCGAACTG GGTGAGATCG AGCGACGCAG GGCGCTCTAT CTGGGCGATC GTCCTCGCGT CAGTCCCGTC GGGCGCACGG CGATCGTCGT CGATGATGGC CTGGCGACCG GCGCCACCGC CAAGGCGGCG TTGCGAGCGC TGCGCCGCCA GGGCGCGGCC CGCATCATCC TGGCCGTCCC GCTGGCGCCA ATTGAGACCC TGGAAGCCAT GCGCGCCGAG GCCGACGAGG TGGTCTGTCT GGCCACGCCG TCCCCGTTCC TGGGCGTTGG GCGATTCTAT GGCGACTTCC ATCAACTGAC CGACGACGAA ACCATCGCGT TGTTGCGTCA GGCCTGGGAT GGGGCCGCCT CCAGCGGGCC GGGCGTGGGG ATTGGGGCCG TCGAACGACG CGAGGTGCGG CTGCCGCCGA TCGGCTTGAC GGGCGACCTT CAGGTTCCGA AGGGGGCAAA GGGGATCGTG GTGTTCGCCC ACGGCAGCGG TTCGAGCCGG CTCAGCCCGC GCAATCGCGC CGTGGCCGAC GATCTCAACG CGCGCGGGAT GGCGACGCTG CTGTTTGACC TGCTCGGCGA GGCCGAGGCC GCGGACCGCC GCAAGGTCTT CGACATCGAC CTGCTGGCCG AGCGGCTTGT CGACGCCACG GCCTGGATCG CCGGCCAGCC GGATCTGGCC GGGCTGCCCC TGGGTCTGTT CGGCGCCAGC ACCGGCGCCG GCGCCGCCCT GGTCGCGGCC GCCAAGCTGG GCGAGCGCGT GTGCGCCGTC GTCTCTCGCG GCGGGCGGCC GGACTTGGCC GGGGCCGCGC TCCGGCGCGT TTCAGCGCCC ACCTTGCTGA TCGTAGGCGG CGCGGATCAT CAGGTTATCG AACTCAACCG GCAAGCCCTC GCCGAACTGC GGGGTGACAA GGCGCTCCGG ATCGTGCCCG GCGCTGGGCA TCTGTTCGAG GAAGCCGGCG CGCTCGAACA GGTCATGCAT CTGGCCGGCG ACTGGTTTGG AGCCAAGTTT TGCGTCCCGC CCGCGACCAT CGATACACGG GCGCTGGCCG TCACGCCGCA GGCCCGTCTT GCCTCCGCGG CCGAACCCCT GCCGGACATC GACGATCCCG CCTTCGCCGC CGCCTTCGAC CGCTACGCCG AGGCGCGGGT CGTCCTATTG GGCGAGGGCA GTCACGGGAC GAGCGAATTC TATCGCGCCC GGGCGGCGAT CACGCGCCGG CTCATCGAAC AGCATGGCTT CGAGATCGTC GCGGTCGAGG CCGACTGGCC CGACGCGGCG GCCATAGATC GACACGTGCG ATTGAAGCCG CACCAGGCGA TGACGCCCGC GCCGTTCACC CGCTTTCCCA CCTGGATGTG GCGCAACACC GAGGTCGAGG CCTTCACCCG CTGGCTGCGC GATCACAACG CTGGCAAGCC GGTCGATCAC CGTGTCGGCT TCTATGGCCT TGACCTCTAC AACATGCGCG CCTCGATGGC GGCCGTGCTG GCCTATCTCG ACGAGGTCGA TCCGGCCGCC GCCGCCGAAG CGCGCGACCG CTACGCCTGC ATGTCGCCCT GGAACGCCGC GCCGGCGACC TACGGCCGGG CGGCGCTCAG CGAAGGTTAC GCGATCTGCG AGCGTCAGGT GGTCAGTATC CTCGTCGATC TGGTGCGCAA GGCCGCAGAC TACGCCGCCC AGGACGGTGA GACCCTGTTT GACGCCACCC AGAACGCGCG ACTGGTGGTC GACGCCGAAC GCTACTACCG GGCCATGTAT TATGGCGCCC ACGAGTCCTG GAACCTGCGC GACCGGCACA TGTTCGAAAC CCTCGATCGC GTCCTCAGGC TGCGCGGCCC CGAGGCCAAG GCCGTGGTCT GGGCGCACAA CTCCCATATC GGCGACGCCC GCTACACCGA AATGGGCGCG GCGCGCGGCG AACTGAACAT CGGCCAGCTT TGCCGCGAAC GCTTCGGAAG CGACGCCGTC CTGATCGGCT TGGGCACGCA CGGCGGGACG GTGATGGCGT CTTCGGACTG GGACGCCCCG GCGGAGGTGA AGACGGTAAA GCCCTCGCGC CCTGACAGCT ATGAGGCGCT TTGTCACGAG GTGGGCGTCG AACGGTTCCT GGTGGATCTG CGGCCAGGCC ACAACGAGGC CTTGCGCGCC GACCTGCGCG AGCCACGTCT GGAGCGCTAT ATCGGCGTGG TCTATCGGCC CGACAGCGAG CGTCTGAGCC ACTACGCCCA CGCCAGCCTG TCCGAACAAT ACGACGCCTT CGTTTGGTTC GAAGAAACCC AGGCCCTGAC CCAGCTCCCC ACCCAGGTTC GCCCGGGTGA AGATGACACC TATCCCTTCG GCCTTTGA
|
Protein sequence | MNAPHSVPLF ADRAEAGRQL AARLATLKLP RPVVYALPRG GVPLAMEIGR ILAAPIDLIL VRKLGAPGQP ELAVGAVVDG HEAQMVINED LAAATGADAA YLHDAQRREL GEIERRRALY LGDRPRVSPV GRTAIVVDDG LATGATAKAA LRALRRQGAA RIILAVPLAP IETLEAMRAE ADEVVCLATP SPFLGVGRFY GDFHQLTDDE TIALLRQAWD GAASSGPGVG IGAVERREVR LPPIGLTGDL QVPKGAKGIV VFAHGSGSSR LSPRNRAVAD DLNARGMATL LFDLLGEAEA ADRRKVFDID LLAERLVDAT AWIAGQPDLA GLPLGLFGAS TGAGAALVAA AKLGERVCAV VSRGGRPDLA GAALRRVSAP TLLIVGGADH QVIELNRQAL AELRGDKALR IVPGAGHLFE EAGALEQVMH LAGDWFGAKF CVPPATIDTR ALAVTPQARL ASAAEPLPDI DDPAFAAAFD RYAEARVVLL GEGSHGTSEF YRARAAITRR LIEQHGFEIV AVEADWPDAA AIDRHVRLKP HQAMTPAPFT RFPTWMWRNT EVEAFTRWLR DHNAGKPVDH RVGFYGLDLY NMRASMAAVL AYLDEVDPAA AAEARDRYAC MSPWNAAPAT YGRAALSEGY AICERQVVSI LVDLVRKAAD YAAQDGETLF DATQNARLVV DAERYYRAMY YGAHESWNLR DRHMFETLDR VLRLRGPEAK AVVWAHNSHI GDARYTEMGA ARGELNIGQL CRERFGSDAV LIGLGTHGGT VMASSDWDAP AEVKTVKPSR PDSYEALCHE VGVERFLVDL RPGHNEALRA DLREPRLERY IGVVYRPDSE RLSHYAHASL SEQYDAFVWF EETQALTQLP TQVRPGEDDT YPFGL
|
| |